📝 Cool LLM papers - a anakin87 Collection

anakin87 's Collections

📝 Cool LLM papers

🇮🇹 Italian Merges

📝 Cool LLM papers

updated 6 days ago

Starting from 2024-11-15

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 7 days ago • 328
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 10 days ago • 15
Running

384

📝

Scaling test-time compute
Reverse Thinking Makes LLMs Stronger Reasoners

Paper • 2411.19865 • Published 27 days ago • 19
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22 • 56
Scaling Laws for Precision

Paper • 2411.04330 • Published Nov 7 • 6

Note to read
LoRA vs Full Fine-tuning: An Illusion of Equivalence

Paper • 2410.21228 • Published Oct 28 • 2
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13 • 2
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Paper • 2404.14367 • Published Apr 22 • 1
Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7 • 29