nbd22
/

Llama-3.1-8B-Instruct-GRPO-gsm8k-ft-lora

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

Llama-3.1-8B-Instruct-GRPO-gsm8k-ft-lora

1 contributor

History: 1 commit

nbd22's picture

initial commit

37f8b3c verified about 1 month ago

.gitattributes

1.52 kB

initial commit about 1 month ago