NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • Updated 23 days ago • 66.5k • 303
NousResearch/DeepHermes-3-Mistral-24B-Preview Text Generation • Updated 23 days ago • 8.56k • 90
Running 2.41k 2.41k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters