-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 50 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ā¢ 2307.08691 ā¢ Published ā¢ 8 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 158 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 47
Collections
Discover the best community collections!
Collections including paper arxiv:2402.17177
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ā¢ 2402.17764 ā¢ Published ā¢ 605 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 158 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 47 -
Don't Make Your LLM an Evaluation Benchmark Cheater
Paper ā¢ 2311.01964 ā¢ Published ā¢ 1
-
OmnimatteRF: Robust Omnimatte with 3D Background Modeling
Paper ā¢ 2309.07749 ā¢ Published ā¢ 7 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper ā¢ 2309.07314 ā¢ Published ā¢ 25 -
Generative Image Dynamics
Paper ā¢ 2309.07906 ā¢ Published ā¢ 53 -
MagiCapture: High-Resolution Multi-Concept Portrait Customization
Paper ā¢ 2309.06895 ā¢ Published ā¢ 27
-
Large-Scale Automatic Audiobook Creation
Paper ā¢ 2309.03926 ā¢ Published ā¢ 54 -
Agents: An Open-source Framework for Autonomous Language Agents
Paper ā¢ 2309.07870 ā¢ Published ā¢ 42 -
PDFTriage: Question Answering over Long, Structured Documents
Paper ā¢ 2309.08872 ā¢ Published ā¢ 53 -
StarCoder: may the source be with you!
Paper ā¢ 2305.06161 ā¢ Published ā¢ 29