mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.15_dcftv1.2 Text Generation • Updated 21 days ago • 100
mlfoundations-dev/hp_ablations_mistral_scheduler_inverse_sqrt_dcftv1.2 Text Generation • Updated 21 days ago • 125
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.10_dcftv1.2 Text Generation • Updated 21 days ago • 84
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-6_dcftv1.2 Text Generation • Updated 21 days ago • 221
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_dcftv1.2 Text Generation • Updated 20 days ago • 275
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr5e-7_dcftv1.2 Text Generation • Updated 20 days ago • 100
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-7_dcftv1.2 Text Generation • Updated 21 days ago • 125
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • Updated 9 days ago • 87
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.15 Text Generation • Updated 9 days ago • 86
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • Updated 9 days ago • 85
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15 Text Generation • Updated 9 days ago • 85
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • Updated 9 days ago • 86
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • Updated 9 days ago • 83