Feynman-Grpo-Exp / README.md
prithivMLmods's picture
Update README.md
45ffd50 verified
|
raw
history blame
210 Bytes
metadata
license: apache-2.0
datasets:
  - openai/gsm8k
language:
  - en
base_model:
  - Qwen/Qwen2.5-0.5B-Instruct
pipeline_tag: text-generation
library_name: transformers
tags:
  - math
  - reasoning
  - grpo
  - trl
  - code