Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 474
Marian COMET (Metrics and QE) Collection COMET models converted into Marian by Microsoft Translate Team! • 10 items • Updated May 28, 2024 • 1
xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection Paper • 2310.10482 • Published Oct 16, 2023 • 2
Beyond English-Centric Multilingual Machine Translation Paper • 2010.11125 • Published Oct 21, 2020 • 1
MT5 release Collection The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. • 10 items • Updated Dec 13, 2024 • 18
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Paper • 2406.07476 • Published Jun 11, 2024 • 34
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published 2 days ago • 61
VideoLLaMA3 Collection Frontier Multimodal Foundation Models for Video Understanding • 13 items • Updated about 10 hours ago • 7
Eagle 2 Collection Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 1 day ago • 18
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 1 day ago • 42
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 3 days ago • 18
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Paper • 2501.09781 • Published 8 days ago • 20
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper • 2501.09775 • Published 9 days ago • 26