Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
reward-trainer
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
Eval Results
8-bit precision
Misc with no match
Merge
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
390
Full-text search
Edit filters
Sort: Trending
Active filters:
reward-trainer
Clear all
HFXM/RM_HHRLHF_Rule3
Text Classification
•
Updated
Dec 4, 2024
•
7
HFXM/RM_HHRLHF_Rule1
Text Classification
•
Updated
Dec 5, 2024
•
1
HFXM/RM_HHRLHF_Rule2
Text Classification
•
Updated
Dec 5, 2024
•
1
RLHF-And-Friends/Pythia-70M-Reward
Updated
Dec 18, 2024
blakenp/gpt-Reward
Text Classification
•
Updated
Dec 12, 2024
•
102
blakenp/Qwen2.5-1.5B-Reward
Text Classification
•
Updated
Dec 12, 2024
•
96
blakenp/Qwen2-0.5B-Reward
Text Classification
•
Updated
Dec 13, 2024
•
102
ZHIYII/Qwen2.5-7B-Reward
Text Classification
•
Updated
Dec 14, 2024
•
197
eth-dl-rewards/internlm2-7b-reward-code-30k
Updated
Dec 13, 2024
eth-dl-rewards/internlm2-7b-reward-code-100k
Updated
Dec 13, 2024
eth-dl-rewards/internlm2-7b-reward-code-60k
Updated
22 days ago
eth-dl-rewards/internlm2-7b-reward-math-30k
Updated
Dec 14, 2024
eth-dl-rewards/internlm2-7b-reward-math-60k
Updated
23 days ago
eth-dl-rewards/internlm2-7b-reward-math-100k
Updated
Dec 14, 2024
eth-dl-rewards/internlm2-7b-reward-math-100k-scratch
Updated
Dec 14, 2024
HFXM/DynamicRules_RM-5e-5-1epoch
Text Classification
•
Updated
26 days ago
•
6
HFXM/DynamicRules_RM-2e-5-1epoch
Text Classification
•
Updated
26 days ago
•
4
HFXM/DynamicRules_RM-5e-5-2epoch
Text Classification
•
Updated
25 days ago
•
5
HFXM/DynamicRules_RM-2e-5-2epoch
Text Classification
•
Updated
25 days ago
•
5
eth-dl-rewards/internlm2-7b-reward-code-20k
Updated
23 days ago
eth-dl-rewards/internlm2-7b-reward-code-40k
Updated
22 days ago
eth-dl-rewards/internlm2-7b-reward-code-60k-scratch
Updated
22 days ago
eth-dl-rewards/internlm2-7b-reward-math-20k
Updated
23 days ago
eth-dl-rewards/internlm2-7b-reward-math-40k
Updated
23 days ago
eth-dl-rewards/internlm2-7b-reward-math-60k-scratch
Updated
23 days ago
eth-dl-rewards/internlm2-7b-reward-code-to-math-20k
Updated
22 days ago
HFXM/SkyworkFinetunedRM-2e-5-1epoch
Text Classification
•
Updated
23 days ago
•
9
HFXM/SkyworkFinetunedRM-5e-5-1epoch
Text Classification
•
Updated
23 days ago
•
4
HFXM/SkyworkFinetunedRMTemp1-2e-5-1epoch
Text Classification
•
Updated
23 days ago
•
7
Ololade/smol-Reward
Text Classification
•
Updated
22 days ago
•
127
Previous
1
...
9
10
11
12
13
Next