deepseek-ai/DeepSeek-R1-Distill-Llama-8B Text Generation • Updated 23 days ago • 1.54M • • 654
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 352