Tool-Augmented Reward Models - a baidu Collection

baidu 's Collections

Pixel-based Pre-training (PixelGPT)

Multilingual Code Pre-training (ERNIE-Code)

Tool-Augmented Reward Models

Tool-Augmented Reward Models

updated Oct 13, 2024

[ICLR'24 Spotlight] Tool-Augmented Reward Modeling