RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_metaMathQA_dpo_iter5-gguf Updated 8 days ago • 412
RichardErkhov/RyanYr_-_self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter1-gguf Updated 7 days ago • 437
RichardErkhov/RyanYr_-_self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter2-gguf Updated 7 days ago • 415
RyanYr/self-correct_mistral-small-it_mMQA_dpo_iter2_ref-iter1 Text Generation • Updated 4 days ago • 17
RyanYr/self-correct_mistral-small-it_mMQA_dpo_iter3_ref-iter2 Text Generation • Updated 4 days ago • 16
RyanYr/self-correct_mistral-small-it_mMQA_dpo_iter4_beta.05 Text Generation • Updated 4 days ago • 15
RyanYr/self-correct_mistral-small-it_mMQA_dpo_iter2_beta.05 Text Generation • Updated 4 days ago • 15
RyanYr/self-correct_mistral-small-it_mMQA_dpo_iter3_beta.05 Text Generation • Updated 3 days ago • 14
RyanYr/self-correct_mistral-small-it_mMQA_dpo_iter1_beta.05 Text Generation • Updated 3 days ago • 13
AlekseyKorshuk/ai-detection-gutenberg-human-v2-formatted-ai-sft-qwen-7b-dpo-3epochs Text Generation • Updated about 16 hours ago