pythia-6.9b-tldr-dpo-beta-0.0175-alpha-0-step-19968 / model-00004-of-00006.safetensors

Commit History