Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper ā¢ 2408.15518 ā¢ Published Aug 28, 2024 ā¢ 42
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Paper ā¢ 2408.15237 ā¢ Published Aug 27, 2024 ā¢ 38 ā¢ 4