In Case You Missed It: ARC 'Challenge' Is Not That Challenging Paper • 2412.17758 • Published 9 days ago • 14
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published 13 days ago • 13
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement Paper • 2412.12881 • Published 15 days ago • 1
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 9 days ago • 17
The Open Source Advantage in Large Language Models (LLMs) Paper • 2412.12004 • Published 16 days ago • 9
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper • 2412.04862 • Published 26 days ago • 48
SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance Paper • 2412.02687 • Published 29 days ago • 108
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Paper • 2412.03517 • Published 28 days ago • 18
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 19 days ago • 120
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 28 days ago • 119
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models Paper • 2412.01824 • Published 30 days ago • 65
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 127
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published Nov 7, 2024 • 49