Post
2684
Uploaded two basic SLERP merges of
princeton-nlp/Llama-3-Instruct-8B-SimPO and
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3, alternating the choice of base model, for people to test out and potentially use as merge fuel. (Personally, I am drawn to intelligent and attentive models, hence the experimentation.)
grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge
grimjim/Llama-3-Instruct-8B-SimPO-SPPO-Iter3-merge
grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge
grimjim/Llama-3-Instruct-8B-SimPO-SPPO-Iter3-merge