YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr1e-06_42 Updated about 21 hours ago
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs256_lr1e-07_0 Text Generation • Updated Mar 3 • 36
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs256_lr5e-06_0 Text Generation • Updated Mar 2 • 47
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs256_lr1e-06_0 Updated Mar 2 • 19
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs256_lr5e-07_0 Text Generation • Updated Mar 2 • 94
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_43 Text Generation • Updated Feb 19 • 46
YuchenLi01/Math-Step-DPO-10K-augmented-Qwen2.5MathRM72B Viewer • Updated about 7 hours ago • 10.8k • 71