Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
MagpieLM-8B-Chat-v0.1 / model-00002-of-00004.safetensors

Commit History

Training in progress, step 1531
e0f4dd7
verified

flydust commited on

Training in progress, step 1500
72f60d1
verified

flydust commited on

Training in progress, step 1000
c53d548
verified

flydust commited on

Training in progress, step 500
8656d23
verified

flydust commited on