Collections
Discover the best community collections!
Collections including paper arxiv:2312.00752
-
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper • 2312.03818 • Published • 32 -
Scaling Laws of Synthetic Images for Model Training ... for Now
Paper • 2312.04567 • Published • 8 -
Large Language Models for Mathematicians
Paper • 2312.04556 • Published • 12 -
LooseControl: Lifting ControlNet for Generalized Depth Conditioning
Paper • 2312.03079 • Published • 13
-
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 140 -
SparQ Attention: Bandwidth-Efficient LLM Inference
Paper • 2312.04985 • Published • 39 -
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Paper • 2402.00159 • Published • 62 -
Neural Network Diffusion
Paper • 2402.13144 • Published • 95
-
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 140 -
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
Paper • 2312.03491 • Published • 34 -
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Paper • 2312.06134 • Published • 3 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259
-
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 73 -
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Paper • 2311.10775 • Published • 8 -
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning
Paper • 2311.11077 • Published • 25 -
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning
Paper • 2311.11501 • Published • 34