inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4 Text Generation • Updated 4 days ago • 171
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 5 days ago • 175
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 5 days ago • 169
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 5 days ago • 153
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise Image-Text-to-Text • 32B • Updated 5 days ago • 128
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid Image-Text-to-Text • 32B • Updated 5 days ago • 124
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-heuristic Image-Text-to-Text • 32B • Updated 5 days ago • 155
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-noise Image-Text-to-Text • 30B • Updated 5 days ago • 129
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-hybrid Image-Text-to-Text • 30B • Updated 5 days ago • 114
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-heuristic Image-Text-to-Text • 30B • Updated 5 days ago • 106
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-noise Image-Text-to-Text • 28B • Updated 5 days ago • 111
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated 5 days ago • 287
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-heuristic Image-Text-to-Text • 28B • Updated 5 days ago • 117
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-noise Image-Text-to-Text • 26B • Updated 5 days ago • 119
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-hybrid Image-Text-to-Text • 26B • Updated 5 days ago • 124
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-heuristic Image-Text-to-Text • 26B • Updated 5 days ago • 114
inference-optimization/Qwen3.6-35B-A3B-5.0-bits-mode-noise Image-Text-to-Text • 24B • Updated 5 days ago • 108
inference-optimization/Qwen3.6-35B-A3B-5.0-bits-mode-hybrid Image-Text-to-Text • 24B • Updated 5 days ago • 145
inference-optimization/Qwen3.6-35B-A3B-5.0-bits-mode-heuristic Image-Text-to-Text • 24B • Updated 5 days ago • 449