·
AI & ML interests
None yet
Organizations
xiwenc1/dpo_llama3.1-8b_beta0.1
Updated
xiwenc1/dpo_llama3.2-3b_beta0.1
Updated
xiwenc1/dpo_qwen2.5-7b_beta0.1
Updated
1.0B • Updated • 1
xiwenc1/sft_qwen2.5-3b_v3
Text Generation
• 0.4B • Updated • 6
xiwenc1/dpo_qwen2.5-3b_beta0.1
Text Generation
• 0.4B • Updated • 3
Text Generation
• 4B • Updated • 2
xiwenc1/OpenRS-DR_GRPO_DPP3334
2B • Updated • 1
xiwenc1/OpenRS-DR_GRPO_dra-qwen2
Text Generation
• 3B • Updated • 4
• xiwenc1/OpenRS-GRPO-qwen2
Text Generation
• 3B • Updated • 2
xiwenc1/OpenRS-DR_GRPO_dra-qwen
4B • Updated • 1
xiwenc1/OpenRS-DR_GRPO-qwen
4B • Updated • 1
xiwenc1/OpenRS-grpodra_nomic1
2B • Updated • 1
xiwenc1/OpenRS-dr_grpodra_nomic2
2B • Updated • 1
xiwenc1/OpenRS-grpodra_nomic2
2B • Updated • 1
xiwenc1/OpenRS-dr_grpodra_nomic1
2B • Updated • 1
xiwenc1/OpenRS-GRPO-DPPv3-savemore
2B • Updated • 1
xiwenc1/OpenRS-DR_GRPO_DPP2
2B • Updated • 1
Text Generation
• 2B • Updated • 2
2B • Updated • 1
xiwenc1/OpenRS-DR_GRPO_DPP
Text Generation
• 2B • Updated • 3
xiwenc1/OpenRS-GRPO-DPPv5
Text Generation
• 2B • Updated • 2
xiwenc1/OpenRS-GRPO-DPPv3
2B • Updated • 1
xiwenc1/OpenRS-GRPO-DPPv4
Text Generation
• 2B • Updated • 2
xiwenc1/OpenRS-GRPO-DPP_dropv0
Updated
xiwenc1/OpenRS-GRPO-DPPv1
Text Generation
• 2B • Updated • 7