Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lewtun
/
Qwen2-0.5B-Reward
like
0
Text Classification
Transformers
Safetensors
qwen2
trl
reward-trainer
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
f52a123
Qwen2-0.5B-Reward
/
training_args.bin
Commit History
End of training
f52a123
verified
lewtun
HF staff
commited on
Sep 23
End of training
0d0c7a3
verified
lewtun
HF staff
commited on
Sep 23
End of training
b083665
verified
lewtun
HF staff
commited on
Sep 5