SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity Paper • 2503.01506 • Published 11 days ago • 9
Running 535 535 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
view article Article Zero to Hero with the TRL learning link bomb 💣 By burtenshaw • Nov 25, 2024 • 5
view article Article Outperforming Claude 3.5 Sonnet with Phi-3-mini-4k for graph entity relationship extraction tasks By rcaulk • Aug 19, 2024 • 7