Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper • 2410.13848 • Published Oct 17, 2024 • 32
B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests Paper • 2409.08692 • Published Sep 13, 2024 • 26
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline Paper • 2408.15079 • Published Aug 27, 2024 • 52
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis Paper • 2407.13301 • Published Jul 18, 2024 • 56
DynMoE Family Collection DynMoE model checkpoints and paper on huggingface • 4 items • Updated Aug 19, 2024 • 4
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models Paper • 2405.14297 • Published May 23, 2024 • 2