neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation โข Updated Oct 17, 2024 โข 2.05k โข 14
Running 403 403 LLM Model VRAM Calculator ๐ Calculate VRAM requirements for running large language models