RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter3 Text Generation • Updated 21 days ago • 86
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-2e-7 Text Generation • Updated 20 days ago • 45
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-4e-7 Text Generation • Updated 20 days ago • 30
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-6e-7 Text Generation • Updated 20 days ago • 28