-
gmongaras/CC12M_and_Imagenet21K_Recap
Viewer • Updated • 22.7M • 418 • 1 -
gmongaras/Imagenet21K_Recaption
Viewer • Updated • 13.1M • 1.24k • 2 -
gmongaras/EleutherAI_the_pile_deduplicated
Viewer • Updated • 134M • 218 • 3 -
gmongaras/Stable_Diffusion_3_Recaption
Viewer • Updated • 10.9M • 499
Gabriel Mongaras PRO
gmongaras
AI & ML interests
None yet
Recent Activity
liked
a Space
about 6 hours ago
nanotron/ultrascale-playbook
updated
a dataset
about 6 hours ago
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual
Organizations
Collections
6
Models for the paper Cottention: Linear Transformers With Cosine Attention https://arxiv.org/abs/2409.18747
Papers
1
models
19

gmongaras/Latent_Diffusion_Model_Imagenet2012_Softmax_250000
Updated

gmongaras/Softmax_Attention_BERT
Feature Extraction
•
Updated
•
5

gmongaras/Cosine_Attention_BERT
Feature Extraction
•
Updated
•
8

gmongaras/Cosine_Attention_GPT_1.2B
Feature Extraction
•
Updated
•
6

gmongaras/Cosine_Attention_GPT_300M
Feature Extraction
•
Updated
•
10

gmongaras/Softmax_Attention_GPT_1.2B
Feature Extraction
•
Updated
•
5

gmongaras/Softmax_Attention_GPT_300M
Feature Extraction
•
Updated
•
7

gmongaras/Yann_UWU
Text Generation
•
Updated
•
7

gmongaras/Meta-Llama-3.1-8B
Text Generation
•
Updated
•
5

gmongaras/reddit_negative_v1_13B
Text Generation
•
Updated
•
15
•
1
datasets
31
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual
Updated
•
391
gmongaras/Imagenet21K_Recaption
Viewer
•
Updated
•
13.1M
•
1.24k
•
2
gmongaras/Amazon-Reviews-2023
Viewer
•
Updated
•
572M
gmongaras/CC12M_and_Imagenet21K_Recap
Viewer
•
Updated
•
22.7M
•
418
•
1
gmongaras/Imagenet21K
Viewer
•
Updated
•
13.2M
•
5.84k
gmongaras/ImageNet12
Viewer
•
Updated
•
1.28M
•
96
gmongaras/Stack
Updated
•
6
gmongaras/Imagenet21
Updated
•
1
gmongaras/Stable_Diffusion_3_Recaption
Viewer
•
Updated
•
10.9M
•
499
gmongaras/Pile_TokLlama
Updated
•
2