DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. ā¢ 2 items ā¢ Updated 12 days ago ā¢ 81
Turkish Instruction Datasets Collection Collection of instruction datasets for Turkish. ā¢ 37 items ā¢ Updated 24 days ago ā¢ 2
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 ā¢ 10 items ā¢ Updated 12 days ago ā¢ 47
Turkish Vision-Language Datasets Collection Collection of Turkish vision-language datasets. ā¢ 20 items ā¢ Updated 24 days ago ā¢ 4
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper ā¢ 2408.02718 ā¢ Published Aug 5 ā¢ 60
VITA: Towards Open-Source Interactive Omni Multimodal LLM Paper ā¢ 2408.05211 ā¢ Published Aug 9 ā¢ 47
Gemma 2: Improving Open Language Models at a Practical Size Paper ā¢ 2408.00118 ā¢ Published Jul 31 ā¢ 75
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. ā¢ 7 items ā¢ Updated Aug 24 ā¢ 13
EVLM: An Efficient Vision-Language Model for Visual Understanding Paper ā¢ 2407.14177 ā¢ Published Jul 19 ā¢ 42
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model Paper ā¢ 2407.07053 ā¢ Published Jul 9 ā¢ 42
šŖ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos ā¢ 12 items ā¢ Updated 3 days ago ā¢ 204
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing Paper ā¢ 2407.08770 ā¢ Published Jul 11 ā¢ 19
AgentInstruct: Toward Generative Teaching with Agentic Flows Paper ā¢ 2407.03502 ā¢ Published Jul 3 ā¢ 49