Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 11 days ago • 130
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 27 days ago • 49
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 27 days ago • 29
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 27 days ago • 62
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 27 days ago • 289
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 27 days ago • 442
Skywork-Reward-Data-Collection Collection Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12 • 11
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated 27 days ago • 27
Tulu V2.5 Suite Collection A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated 27 days ago • 14
SciRIFF Collection Data and models to enhance instruction-following for scientific literature understanding. • 9 items • Updated 27 days ago • 8
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17 • 56
Reward Bench Collection Datasets, spaces, and models for the reward model benchmark! • 5 items • Updated 27 days ago • 8
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 11 days ago • 327
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 218
OLMo Suite Collection Artifacts for the first set of OLMo models. • 18 items • Updated 27 days ago • 69
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 66