xyj787878
/

Qwen2.5-0.5B-GRPO-kuakua

Reinforcement Learning

Model card Files Files and versions Community

Qwen2.5-0.5B-GRPO-kuakua / README.md

xyj787878's picture

initial commit

13f298f verified 7 days ago

|

31 Bytes

	---
	license: apache-2.0
	---