Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Replicate
Hyperbolic
Novita
Cerebras
Together AI
Nebius AI Studio
SambaNova
fal
HF Inference API
Misc
Reset Misc
reward-trainer
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
Eval Results
8-bit precision
Misc with no match
Merge
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
546
Full-text search
Edit filters
Sort: Trending
Active filters:
reward-trainer
Clear all
HFXM/RM_HHRLHF_Rule3_Seed2029
Text Classification
•
Updated
Feb 2
•
14
HFXM/RM_HHRLHF_Rule2_Seed2029
Text Classification
•
Updated
Feb 2
•
7
HFXM/RM_HHRLHF_Rule2_Seed2026
Text Classification
•
Updated
Feb 2
•
10
HFXM/RM_HHRLHF_Rule4_Seed2027
Text Classification
•
Updated
Feb 2
•
8
HFXM/RM_HHRLHF_Rule4_Seed2029
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule3_Seed2027
Text Classification
•
Updated
Feb 2
•
12
HFXM/RM_HHRLHF_Rule8_Seed2026
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule8_Seed2025
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule5_Seed2029
Text Classification
•
Updated
Feb 2
•
14
HFXM/RM_HHRLHF_Rule3_Seed2028
Text Classification
•
Updated
Feb 2
•
11
HFXM/RM_HHRLHF_Rule5_Seed2028
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule5_Seed2025
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule9_Seed2026
Text Classification
•
Updated
Feb 2
•
10
HFXM/RM_HHRLHF_Rule8_Seed2027
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule9_Seed2029
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule9_Seed2027
Text Classification
•
Updated
Feb 2
•
8
HFXM/RM_HHRLHF_Rule9_Seed2028
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule8_Seed2028
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule5_Seed2027
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule5_Seed2026
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule9_Seed2025
Text Classification
•
Updated
Feb 2
•
7
HFXM/RM_HHRLHF_Rule4_Seed2028
Text Classification
•
Updated
Feb 2
•
6
HFXM/RM_HHRLHF_Rule8_Seed2029
Text Classification
•
Updated
Feb 2
•
10
HFXM/RM_HHRLHF_Rule3_Seed2026
Text Classification
•
Updated
Feb 3
•
12
fjxdaisy/RM_HHRLHF_Rule0_Seed2029
Text Classification
•
Updated
Feb 3
•
6
fjxdaisy/RM_HHRLHF_Rule1_Seed2028
Text Classification
•
Updated
Feb 3
•
13
fjxdaisy/RM_HHRLHF_Rule0_Seed2028
Text Classification
•
Updated
Feb 3
•
10
MilyaShams/SmolLM2-135M-Instruct-Reward-probabilistic
Text Classification
•
Updated
Feb 3
•
11
abhayesian/gpt2-large_helpful-only-reward-model
Text Classification
•
Updated
Feb 3
•
29
HFXM/RM_HHRLHF_Rule7_Seed2029
Text Classification
•
Updated
Feb 3
•
6
Previous
1
...
12
13
14
15
16
...
19
Next