amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-cpu Text Generation • Updated Jan 30, 2025
Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA Collection ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU • 8 items • Updated Dec 5, 2025 • 8
Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots