Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper • 2501.11873 • Published 1 day ago • 45
HanxuHU/gemma-llama-2-9b-it-ultrafeedback-annotate-ultrafb-judge-5-maj Viewer • Updated Nov 28, 2024 • 60k • 46
HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-merge-single-filtered Viewer • Updated Nov 26, 2024 • 56.4k • 51
HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-judge-5-majority-filtered Viewer • Updated Nov 26, 2024 • 55.2k • 40
HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-merge-single-judge Viewer • Updated Nov 25, 2024 • 60.7k • 45
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-ultrafb-merge-single-judge Viewer • Updated Nov 25, 2024 • 1.96k • 47
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-honesty-judge Viewer • Updated Nov 24, 2024 • 1.96k • 40
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-ultrafb-judge-5-maj Viewer • Updated Nov 24, 2024 • 60.7k • 44
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-5aspect-judge Viewer • Updated Nov 17, 2024 • 60.7k • 42
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-coherence-judge Viewer • Updated Nov 16, 2024 • 58.8k • 44
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-helpsteer-judge-helpfulness Viewer • Updated Nov 15, 2024 • 60.7k • 41
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-helpsteer-judge-5-llm Viewer • Updated Nov 13, 2024 • 60.7k • 39
HanxuHU/Llama-3-8B-Instruct-ultrafeedback-annotate-helpsteer-verbose Viewer • Updated Nov 13, 2024 • 1.2M • 66