SFT final models merged with the base model in full precision, as observed to preserve the results
clembench-project-playpen
community
AI & ML interests
None defined yet.
models 337
clembench-playpen/Qwen2-7B-DPO_dialogue
Updated
clembench-playpen/Qwen2-7B-DPO_turn
Updated
clembench-playpen/Qwen2-7B-SFT_merged
Text Generation • 8B • Updated
• 2
clembench-playpen/Llama8B_DPO_turn_solved
Updated
clembench-playpen/Qwen2-7B-Instruct
Updated
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16_turn
Updated
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16_dialogue
Updated
clembench-playpen/Qwen2.5-7B-Instruct_dialogue
Updated
clembench-playpen/Mistral-Small-24B-Instruct-less-steps_playpen_SFT-e3_DFINAL_0.35K-steps
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_copy_turn
Updated
datasets 51
clembench-playpen/DPO_turn
Viewer
• Updated
• 58.9k • 75
clembench-playpen/DPO_turn_solved_old
Viewer
• Updated
• 87.6k • 6
clembench-playpen/DPO_dialogue
Viewer
• Updated
• 10.1k • 9
clembench-playpen/DPO_turn_bug
Viewer
• Updated
• 87.6k • 24
clembench-playpen/SFT-Final-Dataset
Viewer
• Updated
• 7.37k • 24
clembench-playpen/DPO_turn_allneg_old_and_new
Viewer
• Updated
• 202k • 3
clembench-playpen/DPO_turn_allneg_old
Viewer
• Updated
• 34k • 6
clembench-playpen/DPO_dialogue_1neg_old
Viewer
• Updated
• 6.7k • 50
clembench-playpen/DPO_turn_allneg_old_6m
Viewer
• Updated
• 34k • 21
clembench-playpen/DPO_dialogue_1neg_best_models_old_6m
Viewer
• Updated
• 2.33k • 4