Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kapfy78
/
qwen-2.5-3b-r1-countdown
like
0
Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen-2.5-3b-r1-countdown
/
runs
Ctrl+K
Ctrl+K
1 contributor
History:
24 commits
kapfy78
Training in progress, step 450
c085d01
verified
4 months ago
Feb06_18-45-37_ip-172-31-22-46
Training in progress, step 150
4 months ago
Feb07_05-46-43_ip-172-31-22-46
Training in progress, step 450
4 months ago