An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published Feb 13 • 30
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper • 2412.10302 • Published Dec 13, 2024 • 17
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 113
Running on Zero 3 3 Typhoon2 Vision Demo (Research Preview) 💬 Generate text from images and messages
scb10x/llama3.1-typhoon2-deepseek-r1-70b-preview Text Generation • Updated 4 days ago • 62 • 12