CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-all-vanilla-400k
Updated
CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-500k-vanilla-100k
Text Generation
• 3B • Updated • 2
CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-200k
Text Generation
• 3B • Updated • 2
CriteriaPO/qwen2.5-3b-dpo-vanilla-in-finegrained
Text Generation
• 3B • Updated • 3
CriteriaPO/qwen2.5-3b-dpo-finegrained
Text Generation
• 3B • Updated • 13
• CriteriaPO/qwen2.5-3b-dpo-coarse
Text Generation
• 3B • Updated • 6
• CriteriaPO/qwen2.5-3b-dpo-mini
Text Generation
• 3B • Updated • 3
• CriteriaPO/qwen2.5-3b-dpo-vanilla
Text Generation
• 3B • Updated • 48
• CriteriaPO/llama3.2-3b-dpo-coarse
Text Generation
• 3B • Updated • 7
• CriteriaPO/llama3.2-3b-dpo-finegrained
Text Generation
• 3B • Updated • 10
• CriteriaPO/llama3.2-3b-dpo-vanilla
Text Generation
• 3B • Updated • 1
• CriteriaPO/llama3.2-3b-dpo-mini
Text Generation
• 3B • Updated • 2
• CriteriaPO/qwen2.5-3b-sft-10
Text Generation
• 3B • Updated • 37
• CriteriaPO/llama3.2-3b-sft-10
Text Generation
• 3B • Updated • 1
•