Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
RyanYr
's Collections
CoT gemma2
Reward modeling
CoT
Reasoning
Reward modeling
updated
Sep 30
Upvote
-
RyanYr/pm
Text Generation
•
Updated
Jul 21
•
10
RyanYr/bt-rm
Updated
Aug 8
•
3
RyanYr/bt-rm-llama3.1-hendrydong-preference_700K
Text Classification
•
Updated
Aug 8
•
39
RyanYr/bt-rm-llama3.1-RLHFlow-pair_data_v2_800K_wsafety
Updated
Aug 11
•
3
RyanYr/llama31-it-preference_data_v2_800K_wsafety_v1
Updated
Aug 12
RyanYr/llama31-it-preference_data_v2_800K_wsafety_v2
Updated
Aug 14
•
3
RyanYr/llama31-it-preference_data_v2_800K_wsafety_v3
Updated
Aug 15
RyanYr/llama31-it-preference_data_v2_800K_wsafety_trainer
Updated
Aug 17
•
2
RyanYr/openchat-3.6-8b-20240522_iter1
Updated
Aug 24
•
1
RyanYr/openchat-3.6-8b-20240522_iter2
Updated
Aug 24
•
2
RyanYr/openchat-3.6-8b-20240522_iter3
Updated
Aug 25
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections