Pratyay Banerjee's picture

In a Training Loop 🔄

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

HCI, Computer Vision, Object Detection, Pattern Recognition, NLP, Supervised Learning

Recent Activity

liked a model about 3 hours ago

Jiunsong/supergemma4-26b-uncensored-gguf-v2

liked a model about 3 hours ago

Jiunsong/supergemma4-26b-abliterated-multimodal-mlx-4bit

upvoted a paper about 22 hours ago

MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference

View all activity

Organizations

liked 2 models about 3 hours ago

Jiunsong/supergemma4-26b-uncensored-gguf-v2

Text Generation • 25B • Updated about 1 month ago • 288k • 547

Jiunsong/supergemma4-26b-abliterated-multimodal-mlx-4bit

Image-Text-to-Text • 5B • Updated 24 days ago • 9.08k • 51

upvoted 10 papers about 22 hours ago

MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference

Paper • 2605.07363 • Published 5 days ago • 12

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 6 days ago • 13

AcademiClaw: When Students Set Challenges for AI Agents

Paper • 2605.02661 • Published 9 days ago • 15

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

Paper • 2605.05204 • Published 7 days ago • 25

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published 6 days ago • 37

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published 5 days ago • 57

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 6 days ago • 92

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 5 days ago • 82

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 9 days ago • 107

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 13 days ago • 211

liked 2 models 4 days ago

ibm-granite/granite-speech-4.1-2b

Automatic Speech Recognition • 2B • Updated 13 days ago • 138k • 90

google/gemma-4-26B-A4B-it-assistant

Any-to-Any • 0.4B • Updated 1 day ago • 47.7k • 118

liked 4 models 6 days ago

unsloth/gemma-4-E4B-it-GGUF

Image-Text-to-Text • 8B • Updated 8 days ago • 1.31M • 397

bartowski/google_gemma-4-E4B-it-GGUF

Image-Text-to-Text • 8B • Updated 9 days ago • 138k • 54

bartowski/google_gemma-4-26B-A4B-it-GGUF

Image-Text-to-Text • 25B • Updated 9 days ago • 209k • 125

bartowski/google_gemma-4-31B-it-GGUF

Image-Text-to-Text • 31B • Updated 9 days ago • 171k • 70

liked a Space 7 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

liked a model 7 days ago

XiaomiMiMo/MiMo-V2.5

311B • Updated 4 days ago • 78.2k • 234