SimpleRL-Zoo
Collection
The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"
•
12 items
•
Updated
•
6
No model card