Collections
Discover the best community collections!
Collections trending this week
-
mlx-community/Qwen2.5-VL-72B-Instruct-4bit
Image-Text-to-Text • Updated • 176 • 2 -
mlx-community/Qwen2.5-VL-72B-Instruct-3bit
Image-Text-to-Text • Updated • 79 • 2 -
mlx-community/Qwen2.5-VL-7B-Instruct-bf16
Image-Text-to-Text • Updated • 226 • 2 -
mlx-community/Qwen2.5-VL-7B-Instruct-8bit
Image-Text-to-Text • Updated • 499 • 7
-
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 271 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 281 -
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Paper • 2501.00958 • Published • 99 -
The Lessons of Developing Process Reward Models in Mathematical Reasoning
Paper • 2501.07301 • Published • 89
-
mlx-community/Qwen2.5-7B-Instruct-1M-4bit
Text Generation • Updated • 363 • 6 -
mlx-community/Qwen2.5-7B-Instruct-1M-6bit
Text Generation • Updated • 43 • 1 -
mlx-community/Qwen2.5-7B-Instruct-1M-3bit
Text Generation • Updated • 23 -
mlx-community/Qwen2.5-7B-Instruct-1M-8bit
Text Generation • Updated • 65
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • Updated • 29.1k • 90 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • Updated • 8.13k • 51 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • Updated • 4.32k • 166 -
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Paper • 2412.10302 • Published • 12