mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-7 Text Generation • Updated Dec 3, 2024 • 76
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-6 Text Generation • Updated Dec 3, 2024 • 151
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.85_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 125
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.95_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 82
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.9_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 29
mlfoundations-dev/hp_ablations_mistral_adambeta1_0.92_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 192
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.999_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 104
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.995_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 190
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.95_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 131
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.99_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 131
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.9995_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 171
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.98_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 80
mlfoundations-dev/hp_ablations_mistral_scheduler_constant_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 21
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-6_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 104
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr5e-7_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 84
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 79
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-7_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 79
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.05_dcftv1.2 Text Generation • Updated Dec 4, 2024 • 20
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.15_dcftv1.2 Text Generation • Updated Dec 4, 2024 • 79
mlfoundations-dev/hp_ablations_mistral_scheduler_inverse_sqrt_dcftv1.2 Text Generation • Updated Dec 4, 2024 • 103
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.10_dcftv1.2 Text Generation • Updated Dec 4, 2024 • 12