mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05 Text Generation • Updated Dec 2, 2024 • 195
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.10 Text Generation • Updated Dec 2, 2024 • 194
mlfoundations-dev/hp_ablations_mistral_scheduler_inverse_sqrt Text Generation • Updated Dec 2, 2024 • 190
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.15 Text Generation • Updated Dec 2, 2024 • 106
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10 Text Generation • Updated Dec 1, 2024 • 148
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.05 Text Generation • Updated Dec 2, 2024 • 187
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-7 Text Generation • Updated Dec 3, 2024 • 128
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-6 Text Generation • Updated Dec 3, 2024 • 190
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr5e-7 Text Generation • Updated Dec 3, 2024 • 79
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr5e-7 Text Generation • Updated Dec 3, 2024 • 79