kushal-tri/sft-codecontests-qwen_ds-code-contests_model-Qwen2.5-Coder-7B-Instruct_sch-cosine_lr-1e-7_bs-64_ Updated Nov 13 • 3
kushal-tri/sft-codecontests-qwen_ds-code-contests_model-Qwen2.5-Coder-7B-Instruct_sch-cosine_lr-1e-6_bs-64_ Updated Nov 13 • 3
kushal-tri/sft-codecontests-qwen_ds-code-contests_model-Qwen2.5-Coder-7B-Instruct_sch-cosine_lr-1e-5_bs-64_ Updated Nov 13 • 5
kushal-tri/sft_ds-prm800k_model-Meta-Llama-3-8B-Instruct_sch-constant_lr-1e-5_bs-128_acc-4_len-2048 Updated Nov 8