zztheaven
/

Llama-3.2-3B-Instruct-Open-R1-GRPO

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3.2-3B-Instruct-Open-R1-GRPO / trainer_state.json

zztheaven's picture

Model save

54e00b3 verified 21 days ago

history contribute delete

339 kB

File too large to display, you can check the raw version instead.