Pramit Choudhary's picture

Pramit Choudhary

maverick84

·

https://github.com/pramitchoudhary

AI & ML interests

None yet

Recent Activity

liked a model about 11 hours ago

allenai/olmOCR-7B-0225-preview

liked a Space about 18 hours ago

oidlabs/Lexoid

liked a dataset 2 days ago

ChicagoHAI/CaseSumm

View all activity

Organizations

maverick84's activity

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 396

upvoted a paper about 1 month ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 158

upvoted an article about 1 month ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 477

upvoted a paper 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 182

upvoted a collection 2 months ago

GAIA release

Gather the items of the GAIA release • 4 items • Updated Nov 23, 2023 • 23

upvoted a collection 3 months ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated Mar 13 • 96

upvoted 2 papers 4 months ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 197

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published Dec 19, 2024 • 74

upvoted a collection 5 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 22 days ago • 223

upvoted 4 papers 7 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 73

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 69

MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 28

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Paper • 2308.06721 • Published Aug 13, 2023 • 30

upvoted a collection 9 months ago

LLM Training Datasets

A collection of datasets for training LLMs. • 110 items • Updated 8 days ago • 21

upvoted a paper 10 months ago

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 49

upvoted a paper about 1 year ago

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 58

upvoted a collection about 1 year ago

DRAGON Models

Production-grade RAG-optimized 6-7B parameter models - "Delivering RAG on ..." the leading foundation base models • 23 items • Updated Feb 23 • 46

upvoted a collection over 1 year ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 582

upvoted 2 papers over 1 year ago

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 52

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Paper • 2205.14135 • Published May 27, 2022 • 13