Alon Albalak's picture

Alon Albalak

alon-albalak

·

https://alon-albalak.github.io/

AI & ML interests

None yet

Recent Activity

new activity 20 days ago

SynthLabsAI/Big-Math-RL-Verified:Solution of the Problems

new activity 28 days ago

SynthLabsAI/Big-Math-RL-Verified:Adding an indicator of whether response requires LLM judge

updated a dataset 28 days ago

SynthLabsAI/Big-Math-RL-Verified

View all activity

Organizations

alon-albalak's activity

upvoted a collection about 2 months ago

Big-Math

This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers • 4 items • Updated 7 days ago • 4

upvoted 2 papers about 2 months ago

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published Mar 3 • 38

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Paper • 2502.17387 • Published Feb 24 • 6

upvoted a paper 3 months ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 98

upvoted a paper 5 months ago

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published Dec 4, 2024 • 14

upvoted 2 papers 6 months ago

Generative Reward Models

Paper • 2410.12832 • Published Oct 2, 2024 • 6

A Survey on Data Selection for Language Models

Paper • 2402.16827 • Published Feb 26, 2024 • 4

upvoted a collection 9 months ago

Common Pile

Datasets in the Common Pile. • 28 items • Updated Mar 22 • 5

upvoted a paper 9 months ago

PERSONA: A Reproducible Testbed for Pluralistic Alignment

Paper • 2407.17387 • Published Jul 24, 2024 • 20

upvoted a paper 10 months ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 53