Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper • 2502.14768 • Published 8 days ago • 42
DevQuasar/NousResearch.DeepHermes-3-Llama-3-8B-Preview-GGUF Text Generation • Updated 14 days ago • 887 • 2
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Paper • 2502.07374 • Published 17 days ago • 35