Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
86
27
315
Nathan Lambert
natolambert
Follow
PatrickJr's profile picture
gentaiscool's profile picture
EnRaoufi's profile picture
137 followers
·
5 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
updated
a dataset
1 day ago
allenai/reward-bench-results
new
activity
1 day ago
allenai/reward-bench:
multilingual
updated
a collection
3 days ago
2025 Artifacts
View all activity
Articles
Ethics and Society Newsletter #4: Bias in Text-to-Image Models
Jun 26, 2023
•
2
Can foundation models label data like humans?
Jun 12, 2023
•
1
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
23
Red-Teaming Large Language Models
Feb 24, 2023
•
18
What Makes a Dialog Agent Useful?
Jan 24, 2023
•
1
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
127
Stable Diffusion with 🧨 Diffusers
Aug 22, 2022
•
42
Organizations
natolambert
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
allenai/reward-bench
1 day ago
multilingual
2
#8 opened 8 days ago by
ehartford
New activity in
allenai/reward-bench
about 1 month ago
add more contaminated models to the list
2
#7 opened 3 months ago by
arielgera
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 1 month ago
Reason behind not using special tokens in the prompt format?
2
#2 opened about 2 months ago by
Doctor-Shotgun
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 1 month ago
What is that instruction template?
1
#1 opened about 2 months ago by
SerialKicked
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 1 month ago
Why do you use pass@10 to test coding perfmance...
1
#4 opened about 2 months ago by
Leon-Leee
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 1 month ago
Has the data set been expanded?
1
#2 opened about 1 month ago by
win10
New activity in
allenai/tulu-3-sft-personas-algebra
about 1 month ago
Librarian Bot: Add language metadata for dataset
#1 opened about 2 months ago by
librarian-bot
New activity in
allenai/tulu-3-sft-personas-math
about 1 month ago
Add link to Tulu 3 paper
#2 opened about 2 months ago by
gabrielmbmb
New activity in
allenai/llama-3.1-tulu-3-70b-preference-mixture
about 1 month ago
Librarian Bot: Add language metadata for dataset
#1 opened about 2 months ago by
librarian-bot
New activity in
allenai/llama-3.1-tulu-3-8b-preference-mixture
about 1 month ago
Easy way to separate permissive samples
1
#1 opened about 2 months ago by
RASMUS
New activity in
allenai/tulu-3-sft-mixture
about 1 month ago
recommend filter
1
#2 opened about 2 months ago by
ehartford
NuminaMath-TIR License (Apache 2, not CC-BY-NC-4.0)
1
#3 opened about 2 months ago by
rbattle
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
about 1 month ago
Adding `safetensors` variant of this model
#2 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-SFT
about 1 month ago
Adding Evaluation Results
#2 opened about 1 month ago by
leaderboard-pr-bot
New activity in
allenai/Llama-3.1-Tulu-3-8B-DPO
about 1 month ago
Adding `safetensors` variant of this model
#2 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-DPO
about 1 month ago
Adding `safetensors` variant of this model
#3 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 1 month ago
Spelling Error in Section 5.4 - "then" should be "than"
1
#3 opened about 2 months ago by
eliuakk
New activity in
allenai/Llama-3.1-Tulu-3-8B
about 1 month ago
Feedback
1
#2 opened about 2 months ago by
KeyboardMasher
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
about 2 months ago
Update README.md
#1 opened about 2 months ago by
reach-vb
New activity in
allenai/Llama-3.1-Tulu-3-70B-SFT
about 2 months ago
Update README.md
#1 opened about 2 months ago by
reach-vb
Load more