Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
50.5
TFLOPS
4
10
Max
reciprocate
Follow
OmbelineM's profile picture
shuyuej's profile picture
allknowingroger's profile picture
36 followers
·
0 following
maxreciprocate
AI & ML interests
Reward models
Organizations
models
18
Sort: Recently updated
reciprocate/mistral-7b-gsm8k-code-rm
Text Classification
•
Updated
Mar 24, 2024
•
2
•
3
reciprocate/mistral-7b-rm
Text Classification
•
Updated
Feb 15, 2024
•
2
•
2
reciprocate/rm_beluga-7b_hh-full
Text Classification
•
Updated
Sep 25, 2023
reciprocate/rm-llama2-7b-gsm8k
Text Generation
•
Updated
Sep 14, 2023
•
2
reciprocate/llama2-7b-gsm8k
Text Generation
•
Updated
Aug 29, 2023
•
1
•
1
reciprocate/shepherd-13b
Text Generation
•
Updated
Aug 24, 2023
•
1
•
1
reciprocate/tiny-llama
Text Generation
•
Updated
Aug 6, 2023
•
9
•
2
reciprocate/vicuna-13b_rm_oasst-hh
Text Classification
•
Updated
Jun 27, 2023
•
18
reciprocate/openllama-13b-rlhf-v0
Text Generation
•
Updated
Jun 22, 2023
•
3
reciprocate/openllama-13b_rm_oasst-hh
Text Classification
•
Updated
Jun 21, 2023
•
4
Expand 18 models
datasets
35
Sort: Recently updated
reciprocate/kaggle-lmarena-synth-50k
Viewer
•
Updated
20 days ago
•
50.7k
•
42
reciprocate/ultra-annotated-200k
Viewer
•
Updated
Sep 1, 2024
•
208k
•
28
reciprocate/dpo-objective-v0.2
Viewer
•
Updated
May 14, 2024
•
384
•
26
reciprocate/tinygsm_interpreter_1M
Viewer
•
Updated
May 6, 2024
•
1M
•
46
reciprocate/dpo_untoxic
Viewer
•
Updated
Apr 7, 2024
•
541
•
27
reciprocate/dpo_mix-zero-math-untoxic
Viewer
•
Updated
Mar 29, 2024
•
6.91k
•
38
reciprocate/dpo_mix-7k_untoxic
Viewer
•
Updated
Mar 26, 2024
•
7.29k
•
28
•
2
reciprocate/tinygsm_mixtral_12M
Viewer
•
Updated
Mar 24, 2024
•
12M
•
184
•
1
reciprocate/dpo_ultra-capybara-code_filtered-best
Viewer
•
Updated
Mar 19, 2024
•
35.2k
•
25
•
1
reciprocate/tinygsm_dpo
Viewer
•
Updated
Mar 15, 2024
•
6.17k
•
34
•
2
Expand 35 datasets