Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
MagpieLM-4B-Chat-v0.1 / model-00001-of-00002.safetensors

Commit History

Training in progress, step 1531
b188e2b
verified

flydust commited on

Training in progress, step 1500
ede8b90
verified

flydust commited on

Training in progress, step 1000
590ab4f
verified

flydust commited on

Training in progress, step 500
6a45477
verified

flydust commited on