Latest SOTA models supported on Qualcomm NPU.
AI & ML interests
On Device AI Deployment and Research
Recent Activity
Multimodal models running on Qualcomm NPU for Snapdragon8 Gen4
Latest SOTA models supported on Intel NPU
Text Generations Models in MLX format, hand picked by Nexa Team.
Text Generations Models in GGUF format, hand picked by Nexa Team.
Tiny, multimodal on-device models developed by Nexa AI.
Latest SOTA models supported on Apple Neural Engine
Nexa AI infra to support Qwen3VL running on GPU/NPU/CPU
-
NexaAI/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 17.7k • 27 -
NexaAI/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 5.54k • 6 -
NexaAI/Qwen3-VL-8B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 21.2k • 21 -
NexaAI/Qwen3-VL-8B-Thinking-GGUF
Image-Text-to-Text • 8B • Updated • 9.86k • 12
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
NexaQuant compresses models with 100% accuracy recovery.
Latest SOTA models supported on Qualcomm NPU.
Latest SOTA models supported on Apple Neural Engine
Multimodal models running on Qualcomm NPU for Snapdragon8 Gen4
Nexa AI infra to support Qwen3VL running on GPU/NPU/CPU
-
NexaAI/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 17.7k • 27 -
NexaAI/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 5.54k • 6 -
NexaAI/Qwen3-VL-8B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 21.2k • 21 -
NexaAI/Qwen3-VL-8B-Thinking-GGUF
Image-Text-to-Text • 8B • Updated • 9.86k • 12
Latest SOTA models supported on Intel NPU
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
Text Generations Models in MLX format, hand picked by Nexa Team.
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
Text Generations Models in GGUF format, hand picked by Nexa Team.
NexaQuant compresses models with 100% accuracy recovery.
Tiny, multimodal on-device models developed by Nexa AI.