RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5 Text Generation • Updated 18 days ago • 472
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_OpenMathIt2_iter1 Text Generation • Updated 16 days ago • 26
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_metaMathQA_dpo_iter5 Text Generation • Updated 16 days ago • 26
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr1e-7 Text Generation • Updated 16 days ago • 31
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr3e-7 Text Generation • Updated 16 days ago • 29