DeepSeek-R1-Distill - a Ayushnangia Collection

Ayushnangia 's Collections

DeepSeek-R1-Distill

Deep-RL

DeepSeek-R1-Distill

updated Jan 20

This is a collection of Llama and Qwen-based models ranging from 1.5B to 70B parameters with are distilled from DeepSeek's new R1 models.

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Text Generation • 8B • Updated Feb 24 • 1.11M • • 778
deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • 71B • Updated Feb 24 • 139k • • 711
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Feb 24 • 899k • • 1.29k
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Feb 24 • 1.11M • • 685
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

Text Generation • 15B • Updated Feb 24 • 267k • • 542
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24 • 529k • • 1.42k