3 42 54

Pritam Kumar Ravi

PritamcodesAGI

pritam5756

AI & ML interests

AI4Sci , Generative Models

Recent Activity

upvoted an article about 12 hours ago

Mixture of Experts (MoEs) in Transformers

upvoted an article about 12 hours ago

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

upvoted an article 4 months ago

We Got Claude to Build CUDA Kernels and teach open models!

View all activity

Organizations

upvoted 2 articles about 12 hours ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 165

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

6 days ago

• 72

upvoted an article 4 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 157

liked 2 models 5 months ago

black-forest-labs/FLUX.2-klein-4B

Image-to-Image • Updated Feb 24 • 337k • • 699

black-forest-labs/FLUX.2-klein-9B

Image-to-Image • Updated Feb 24 • 137k • • 827

upvoted an article 5 months ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.14k

upvoted a collection 5 months ago

📝 Research & Long-Form Blog Posts

Collection

In-depth technical articles and research pieces published by Hugging Face • 18 items • Updated 6 days ago • 25

upvoted 2 articles 5 months ago

Article

Deriving the PPO Loss from First Principles

garg-aayush

•

Dec 25, 2025

• 45

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

liked 3 models 6 months ago

liked a Space 6 months ago

Sesame CSM

🌱

861

Conversational speech generation

upvoted an article 6 months ago

Article

The Annotated Diffusion Model

nielsr, kashif

•

Jun 7, 2022

• 359

liked a model 6 months ago

physical-intelligence/fast

Robotics • Updated Jan 16, 2025 • 172

liked a Space 6 months ago

Evaluation Guidebook

📝

325

Explore LLM benchmark scores over time

liked a Space 7 months ago

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs

upvoted a collection 7 months ago

📐 FineMath

Collection

FineMath datasets and ablation models • 14 items • Updated May 5, 2025 • 26

New activity in nampdn-ai/tiny-textbooks 7 months ago

I wanna create a new dataset from textbooks as well

#5 opened 7 months ago by

PritamcodesAGI

liked a dataset 7 months ago

nampdn-ai/tiny-codes

Viewer • Updated Sep 30, 2023 • 1.63M • 1.52k • 288

Pritam Kumar Ravi

AI & ML interests

Recent Activity

Organizations

PritamcodesAGI's activity

Mixture of Experts (MoEs) in Transformers

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

We Got Claude to Build CUDA Kernels and teach open models!

Mixture of Experts Explained

Deriving the PPO Loss from First Principles

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sesame CSM

The Annotated Diffusion Model

Evaluation Guidebook

The Smol Training Playbook

I wanna create a new dataset from textbooks as well