-
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
Paper • 2408.07089 • Published • 12 -
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Paper • 2409.16191 • Published • 41 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 131 -
Self-Boosting Large Language Models with Synthetic Preference Data
Paper • 2410.06961 • Published • 14
Sheikh Jubair
sheikhjubair
AI & ML interests
None yet
Organizations
None yet
Collections
2
models
None public yet
datasets
None public yet