Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
baidu
's Collections
Pixel-based Pre-training (PixelGPT)
Multilingual Code Pre-training (ERNIE-Code)
Tool-Augmented Reward Models
Tool-Augmented Reward Models
updated
Oct 13, 2024
[ICLR'24 Spotlight] Tool-Augmented Reward Modeling
Upvote
-
Tool-Augmented Reward Modeling
Paper
•
2310.01045
•
Published
Oct 2, 2023
•
2
baidu/TARA
Preview
•
Updated
Feb 20, 2024
•
205
•
1
baidu/Themis-7b
Updated
Mar 9, 2024
•
20
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections