Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 8 days ago • 311
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper • 2501.11873 • Published 14 days ago • 63
cognitivecomputations/Wizard-Vicuna-30B-Uncensored Text Generation • Updated May 20, 2024 • 2.01k • 151
lmstudio-community/DeepSeek-R1-Distill-Qwen-7B-GGUF Text Generation • Updated 14 days ago • 336k • 42