RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated about 4 hours ago • 6.97k • 1
RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated about 4 hours ago • 1.72k • 2
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • 8B • Updated 7 days ago • 37k • 30
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 7 days ago • 32.3k • 9
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated 7 days ago • 15.8k • 20
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16 Text Generation • 24B • Updated 7 days ago • 279 • 1
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 71B • Updated 7 days ago • 2.27k • 3
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated 7 days ago • 15.4k • 14