Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 2 days ago • 66
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 5 days ago • 73
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 4 days ago • 127
meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • Updated 3 days ago • 26.8k • • 284
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated 3 days ago • 445k • • 743
Running 253 253 starvector-1b-im2svg 📈 Convert images and text into scalable vector graphics (SVG) code
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated 22 days ago • 22
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 2 days ago • 31