This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 20.5k • • 170 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 1.14k • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 1.1k • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 1.44k •