Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory Paper • 2509.14662 • Published Sep 18 • 13
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning Paper • 2308.12032 • Published Aug 23, 2023 • 2
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning Paper • 2310.11716 • Published Oct 18, 2023 • 6
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning Paper • 2402.00530 • Published Feb 1, 2024 • 2