plaguss/Llama-3.1-8B-Instruct-FineTome-APO-zero-6epoch-rmsprop Text Generation • Updated 23 days ago • 25
plaguss/Llama-3.1-8B-Instruct-FineTome-APO-zero-12epoch-rmsprop-2048 Text Generation • Updated 22 days ago • 28
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter1 Text Generation • Updated 20 days ago • 266
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2 Text Generation • Updated 21 days ago • 69
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter3 Text Generation • Updated 21 days ago • 86
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-2e-7 Text Generation • Updated 20 days ago • 45
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-4e-7 Text Generation • Updated 20 days ago • 30
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-6e-7 Text Generation • Updated 20 days ago • 28
RyanYr/self-correct_Llama-3.1-8B-Instruct_metaMathQA_dpo_iter1 Text Generation • Updated 20 days ago • 24
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4 Text Generation • Updated 18 days ago • 99
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5 Text Generation • Updated 18 days ago • 472
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter6 Text Generation • Updated 17 days ago • 65
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter3-gguf Updated 17 days ago • 639
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-gguf Updated 17 days ago • 766
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4-gguf Updated 17 days ago • 631