Feynman-Grpo-Exp / README.md
prithivMLmods's picture
Update README.md
45ffd50 verified
|
raw
history blame
210 Bytes
---
license: apache-2.0
datasets:
- openai/gsm8k
language:
- en
base_model:
- Qwen/Qwen2.5-0.5B-Instruct
pipeline_tag: text-generation
library_name: transformers
tags:
- math
- reasoning
- grpo
- trl
- code
---