Nathan Lambert's picture

Nathan Lambert

natolambert

·

https://www.natolambert.com/

AI & ML interests

Reinforcement learning, Ethics, Robotics, Dynamics Models

Recent Activity

updated a collection about 9 hours ago

liked a Space about 10 hours ago

Presidentlin/llm-pricing-calculator

liked a model about 10 hours ago

Qwen/Qwen2.5-VL-32B-Instruct

View all activity

Organizations

natolambert's activity

upvoted a collection about 1 month ago

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 12 days ago • 10

upvoted an article 2 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 84

upvoted a collection 3 months ago

2024 Interconnects Artifacts

Models & datasets mentioned in the bottom section of posts! • 280 items • Updated Jan 2 • 6

upvoted a paper 3 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 145

upvoted 4 collections 4 months ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 12 days ago • 67

OLMo 2

Artifacts for the second set of OLMo models. • 27 items • Updated 4 days ago • 105

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 12 days ago • 95

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 12 days ago • 76

upvoted 2 collections 6 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 12 days ago • 299

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 27 days ago • 569

upvoted 3 collections 7 months ago

Skywork-Reward-Data-Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 16

OLMoE (November 2024)

Artifacts for open mixture-of-experts language models. • 13 items • Updated 12 days ago • 29

Hermes 3

The Hermes 3 Series of Models • 12 items • Updated Feb 13 • 112

upvoted a collection 9 months ago

Aligned Diffusion Model via DPO

18 items • Updated Jul 8, 2024 • 3

upvoted 2 collections 10 months ago

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated 12 days ago • 15

SciRIFF

Data and models to enhance instruction-following for scientific literature understanding. • 9 items • Updated 12 days ago • 9

upvoted a collection 12 months ago

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 56

upvoted 3 collections about 1 year ago

Reward Bench

Datasets, spaces, and models for the reward model benchmark! • 5 items • Updated 12 days ago • 9

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated 13 days ago • 330

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236