RyzenAI-1.3_LLM_NPU_Models Collection Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3) • 14 items • Updated about 8 hours ago • 3
RyzenAI-1.3_LLM_NPU_Models Collection Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3) • 14 items • Updated about 8 hours ago • 3
amd/gemma-2-2b-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated 21 days ago • 9
amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid Updated about 20 hours ago • 105 • 1
amd/DeepSeek-R1-Distill-Qwen-1.5B-awq-asym-uint4-g128-lmhead-onnx-hybrid Updated about 21 hours ago • 60 • 1
amd/DeepSeek-R1-Distill-Qwen-7B-awq-asym-uint4-g128-lmhead-onnx-hybrid Updated about 21 hours ago • 109
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Dec 13, 2024 • 38 • 1
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Dec 13, 2024 • 1.06k • 1
amd/Llama-3.1-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Dec 13, 2024 • 36 • 2
amd/Llama2-7b-chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Dec 13, 2024 • 34
amd/Llama2-7b-chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Dec 13, 2024 • 34
amd/Llama-3-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Dec 13, 2024 • 69 • 1
amd/Llama-3-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Dec 13, 2024 • 69 • 1