selfcorrexp2

AI & ML interests

None defined yet.

models 33

selfcorrexp2/llama31_ace_1ep

Text Generation • 8B • Updated Jan 28, 2025 • 1

selfcorrexp2/beta01_balanced_dpo_step100

Text Generation • 8B • Updated Jan 22, 2025 • 1

selfcorrexp2/llama3sft_balanced_dpo_step550

Text Generation • 8B • Updated Jan 22, 2025 • 2

selfcorrexp2/type12_70b_step300

Text Generation • 8B • Updated Jan 20, 2025 • 1

selfcorrexp2/type12_math_augmath_beta05_nosftloss_step400

Text Generation • 8B • Updated Jan 18, 2025 • 1

selfcorrexp2/type12_math_augmath_dpo_sftlossbeta05_step400

Text Generation • 8B • Updated Jan 18, 2025 • 1

selfcorrexp2/nosft_llama3sft_dpo_type3_7k_ver2_step100

Text Generation • 8B • Updated Jan 15, 2025 • 2

selfcorrexp2/llama3_sft_more_corr_rr0k_3ep

Text Generation • 8B • Updated Jan 10, 2025 • 2

selfcorrexp2/llama3_sft_less_corr_rr0k_ep3_train_on_reasoning

Text Generation • 8B • Updated Jan 10, 2025 • 3

selfcorrexp2/llama3_sft_balanced_corr_rr0k_ep3_train_on_reasoning

Text Generation • 8B • Updated Jan 8, 2025 • 4

datasets 288

selfcorrexp2/llama31_ace_kumar_testtmp07

Viewer • Updated Jan 28, 2025 • 15k • 6

selfcorrexp2/llama31_ace_kumar_testtmp10

Viewer • Updated Jan 28, 2025 • 15k • 4

selfcorrexp2/balanced_model_as_rm_2prompt

Viewer • Updated Jan 23, 2025 • 5k • 4 • 1

selfcorrexp2/balanced_model_as_rm

Viewer • Updated Jan 23, 2025 • 5k • 6

selfcorrexp2/selfcorrexp2_llama3_openmath_1m_ep1_tmp10_goldrm_labeled

Viewer • Updated Jan 23, 2025 • 15k • 11

selfcorrexp2/HanningZhang_Llama3-sft-more-corr-rr60k-3ep_moredatatmp10_vllmexp3

Viewer • Updated Jan 23, 2025 • 15k • 2

selfcorrexp2/HanningZhang_Llama3-sft-more-corr-rr60k-3ep_moredatatmp10

Viewer • Updated Jan 23, 2025 • 15k • 1

selfcorrexp2/HanningZhang_Llama3-sft-more-corr-rr60k-3ep_moredatatmp10_gold_reward

Viewer • Updated Jan 23, 2025 • 15k • 4

selfcorrexp2/balanced_self_rewarding_rm_labeled_llama3_sft_gen_1round_prompt

Viewer • Updated Jan 23, 2025 • 15k • 4

selfcorrexp2/llama3_sft_more_corr_rr0k_3ep_more_datatmp10_vllmexp3

Viewer • Updated Jan 23, 2025 • 15k • 3

View 288 datasets