GAIR-ProX

community

https://gair-nlp.github.io/ProX/

AI & ML interests

NLP Research

Recent Activity

Pengfei authored a paper 4 days ago

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

SivilTaram authored a paper 12 days ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

SivilTaram authored a paper 12 days ago

When Attention Sink Emerges in Language Models: An Empirical View

View all activity

gair-prox's activity

Pengfei

authored a paper 4 days ago

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published 4 days ago • 27

SivilTaram

authored 5 papers 12 days ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Paper • 2411.07763 • Published Nov 12, 2024

When Attention Sink Emerges in Language Models: An Empirical View

Paper • 2410.10781 • Published Oct 14, 2024

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 16

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

Scaling up Masked Diffusion Models on Text

Paper • 2410.18514 • Published Oct 24, 2024

SivilTaram

authored a paper 14 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 14 days ago • 28

SivilTaram

authored a paper 18 days ago

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Paper • 2503.15450 • Published 19 days ago • 11

SivilTaram

authored a paper about 2 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 100

koalazf99

authored a paper about 2 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 16

koalazf99

in gair-prox/DCLM-pro about 2 months ago

[WIP] Upload folder using huggingface_hub (multi-commit f652a714e9bcaf40b69daaf465ca687697e8df0aa1db0cee063ddb5f2fb44d71)

#7 opened about 2 months ago by

koalazf99

updated a dataset about 2 months ago

gair-prox/DCLM-pro

Viewer • Updated Feb 15 • 366M • 5.37k • 8

koalazf99

in gair-prox/DCLM-pro about 2 months ago

[WIP] Upload folder using huggingface_hub (multi-commit 779e8d477ccfd8d7ede9f7ead7970c9792e9953fcf19cb11de5471794429a3eb)

#6 opened about 2 months ago by

SinclairWang

updated a dataset about 2 months ago

gair-prox/DCLM-pro

Viewer • Updated Feb 15 • 366M • 5.37k • 8

Pengfei

authored a paper 2 months ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

lockon

authored a paper 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 374

Pengfei

authored a paper 3 months ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11 • 32

lockon

authored a paper 3 months ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44

koalazf99

authored a paper 3 months ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44

Pengfei

authored a paper 4 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 48