cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese Text Generation • Updated 1 day ago • 1.39k • 155
tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3 Text Generation • Updated 2 days ago • 1.22k • 8
nitky/Llama-3.1-SuperSwallow-70B-Instruct-v0.1 Text Generation • Updated Dec 13, 2024 • 136 • 1