Michael Goin
mgoin
·
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated
a model
22 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
new activity
22 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:Fix invalid config
new activity
22 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:Fix invalid config