Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 13 days ago • 34
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 6 days ago • 53
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published 19 days ago • 35
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 23 days ago • 38
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • Updated Oct 10, 2024 • 7.81k • 18
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1 Sentence Similarity • Updated 9 days ago • 16.1k • 29