Running 368 368 LLM Model VRAM Calculator ๐ Calculate VRAM requirements for running large language models
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation โข Updated Oct 25, 2024 โข 227k โข โข 2.01k