Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
UCLA-AGI
's Collections
zephyr-7b-sft-full-SPIN
datasets-SPIN
SPIN-Diffusion
SPPO
SPPO
updated
Jun 29
Self-Play Preference Optimization
Upvote
12
+2
UCLA-AGI/Mistral7B-PairRM-SPPO
Text Generation
•
Updated
May 7
•
3.47k
•
6
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter1
Text Generation
•
Updated
May 6
•
2.63k
•
2
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2
Text Generation
•
Updated
May 6
•
7.91k
•
1
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3
Text Generation
•
Updated
May 7
•
7.92k
•
5
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter1
Text Generation
•
Updated
Jun 25
•
4.83k
•
1
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2
Text Generation
•
Updated
Jun 25
•
7.88k
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
Updated
Jun 28
•
7.63k
•
77
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
Text Generation
•
Updated
Jul 1
•
9.09k
•
118
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2
Text Generation
•
Updated
Jul 1
•
6.65k
•
2
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1
Text Generation
•
Updated
Jul 1
•
5.15k
•
3
Upvote
12
+8
Share collection
View history
Collection guide
Browse collections