view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 1 day ago β’ 24
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control Paper β’ 2412.20800 β’ Published 5 days ago β’ 4
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 5 days ago β’ 17
PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper β’ 2412.21206 β’ Published 4 days ago β’ 14
Training Software Engineering Agents and Verifiers with SWE-Gym Paper β’ 2412.21139 β’ Published 4 days ago β’ 16
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Paper β’ 2412.21037 β’ Published 5 days ago β’ 20
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper β’ 2412.20070 β’ Published 7 days ago β’ 39
YuLan-Mini: An Open Data-efficient Language Model Paper β’ 2412.17743 β’ Published 11 days ago β’ 59
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. β’ 5 items β’ Updated 6 days ago β’ 10
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Paper β’ 2412.19326 β’ Published 8 days ago β’ 17
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper β’ 2412.18619 β’ Published 19 days ago β’ 44
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper β’ 2412.18319 β’ Published 11 days ago β’ 33
WavePulse: Real-time Content Analytics of Radio Livestreams Paper β’ 2412.17998 β’ Published 11 days ago β’ 9
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System? Paper β’ 2412.18495 β’ Published 11 days ago β’ 8
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding Paper β’ 2412.17295 β’ Published 12 days ago β’ 9
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Paper β’ 2412.17153 β’ Published 12 days ago β’ 33