Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
MagpieLM-8B-Chat-v0.1 / trainer_state.json

Commit History

Model save
d52283b
verified

flydust commited on