Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
qihoo360
/
Light-R1-14B-DS
like
30
Follow
北京奇虎科技有限公司
187
Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
Inference Endpoints
arxiv:
2503.10460
License:
apache-2.0
Model card
Files
Files and versions
Community
5
Train
Deploy
Use this model
GRPO dataset?
#5
by
Armaan11
- opened
5 days ago
Discussion
Armaan11
5 days ago
Please can the GRPO dataset also be published?
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Your need to confirm your account before you can post a new comment.
Comment
·
Sign up
or
log in
to comment