Running 494 494 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG Paper • 2305.14989 • Published May 24, 2023
Arabic Automatic Story Generation with Large Language Models Paper • 2407.07551 • Published Jul 10, 2024
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces Paper • 2410.13194 • Published Oct 17, 2024 • 1
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces Paper • 2410.13194 • Published Oct 17, 2024 • 1
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Paper • 2408.06266 • Published Aug 12, 2024 • 10
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 147
Running 560 560 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training