🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 6 items • Updated 3 days ago • 28
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 2 days ago • 354k • • 640
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments Paper • 2501.10893 • Published 16 days ago • 23
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published 14 days ago • 89
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Paper • 2501.11733 • Published 14 days ago • 27
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Paper • 2501.12273 • Published 13 days ago • 14
The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Paper • 2501.09653 • Published 18 days ago • 12
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper • 2501.09751 • Published 18 days ago • 47
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published 18 days ago • 36
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 17 days ago • 42
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 20 days ago • 52