Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RLHF4MATH
Activity Feed
Follow
1
AI & ML interests
None defined yet.
Team members
1
RLHF4MATH
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Articles
1231czx
updated
a dataset
9 months ago
RLHF4MATH/Gemma-7B-1.1-it-iter1-random-pairs
Viewer
•
Updated
Jul 27, 2024
•
19k
•
35
•
1
1231czx
updated
6 models
9 months ago
RLHF4MATH/CodeGemma-7B-it-M-DPO
Text Generation
•
Updated
Jul 26, 2024
•
1
RLHF4MATH/Gemma-7B-it-M-DPO
Text Generation
•
Updated
Jul 26, 2024
•
1
RLHF4MATH/Gemma-9B-it-SFT3epoch
Text Generation
•
Updated
Jul 26, 2024
•
1
RLHF4MATH/Mistral-7B-pt-SFT2epoch
Text Generation
•
Updated
Jul 26, 2024
•
1
RLHF4MATH/Code-Gemma-7B-it-SFT3epoch
Text Generation
•
Updated
Jul 26, 2024
•
10
•
1
RLHF4MATH/Gemma-7B-it-SFT3epoch
Text Generation
•
Updated
Jul 26, 2024
•
1
1231czx
updated
5 datasets
9 months ago
RLHF4MATH/SFT_510K
Viewer
•
Updated
Jul 25, 2024
•
512k
•
35
•
1
RLHF4MATH/prompt_iter4
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
18
RLHF4MATH/prompt_iter3
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
20
RLHF4MATH/prompt_iter2
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
20
RLHF4MATH/prompt_iter1
Viewer
•
Updated
Jul 25, 2024
•
20.8k
•
20
1231czx
updated
2 models
9 months ago
RLHF4MATH/Gemma-2-9B-it-M-DPO
Text Generation
•
Updated
Jul 15, 2024
RLHF4MATH/Mistral-7B-pt-M-DPO
Text Generation
•
Updated
Jul 13, 2024