RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5-gguf Updated 17 days ago • 588
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-2e-7-gguf Updated 17 days ago • 627
RyanYr/self-correct_Llama-3.2-3B-Instruct_OpenMathInstruct-2_dpo_iter1 Text Generation • Updated 16 days ago • 31
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_OpenMathIt2_iter1 Text Generation • Updated 16 days ago • 26
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_metaMathQA_dpo_iter5 Text Generation • Updated 16 days ago • 26
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr1e-7 Text Generation • Updated 16 days ago • 31
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr3e-7 Text Generation • Updated 16 days ago • 29
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter7 Text Generation • Updated 16 days ago • 50
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter7_ds-iter3-metaMathQA Text Generation • Updated 16 days ago • 34
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter1 Text Generation • Updated 6 days ago • 273
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter2 Text Generation • Updated 13 days ago • 75
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter3 Text Generation • Updated 13 days ago • 31
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter3_lr4e-7 Text Generation • Updated 13 days ago • 22
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter3_lr1e-7 Text Generation • Updated 13 days ago • 26
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter4_lr1e-7 Text Generation • Updated 13 days ago • 43
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter5_lr1e-7 Text Generation • Updated 12 days ago • 22
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter5_lr2e-7 Text Generation • Updated 12 days ago • 42
RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter6_lr2e-7 Text Generation • Updated 12 days ago • 29
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter6-gguf Updated 12 days ago • 430
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter7-gguf Updated 12 days ago • 398
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-4e-7-gguf Updated 12 days ago • 476
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter7_ds-iter3-metaMathQA-gguf Updated 12 days ago • 430
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-6e-7-gguf Updated 12 days ago • 465
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_OpenMathInstruct-2_dpo_iter1-gguf Updated 12 days ago • 445
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr3e-7-gguf Updated 12 days ago • 426
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_OpenMathIt2_iter1-gguf Updated 11 days ago • 423
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr1e-7-gguf Updated 11 days ago • 568