mlfoundations-dev/hp_ablations_qwen_scheduler_linear_warmup0.05 Text Generation • Updated Dec 2, 2024 • 21
mlfoundations-dev/hp_ablations_qwen_scheduler_linear_warmup0.10 Text Generation • Updated Dec 2, 2024 • 4
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10 Text Generation • Updated Dec 2, 2024 • 21
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05 Text Generation • Updated Dec 2, 2024 • 45
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.15 Text Generation • Updated Dec 2, 2024 • 21
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-6 Text Generation • Updated Dec 2, 2024 • 45
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-6 Text Generation • Updated Dec 3, 2024 • 45
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-7 Text Generation • Updated Dec 3, 2024 • 4
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr5e-7 Text Generation • Updated Dec 3, 2024 • 45
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr5e-7 Text Generation • Updated Dec 3, 2024 • 4
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-7 Text Generation • Updated Dec 3, 2024 • 20
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.95_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 9
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.92_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 44
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.95_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 12
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.98_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 9
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.85_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 45
mlfoundations-dev/hp_ablations_qwen_adambeta2_0.999_dcftv1.2 Text Generation • Updated Dec 5, 2024 • 45