DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 6 days ago • 226
nexa-collaboration/output_llama3-1_8b_distillation_from_sparse Text Generation • Updated 8 days ago • 7
nexa-collaboration/output_llama3-1_8b_distillation_from_sparse Text Generation • Updated 8 days ago • 7