view article Article Train 400x faster Static Embedding Models with Sentence Transformers 5 days ago ā¢ 105
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb ā¢ Nov 28, 2024 ā¢ 132
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models ā¢ 11 items ā¢ Updated Dec 6, 2024 ā¢ 640
FlashSpeech: Efficient Zero-Shot Speech Synthesis Paper ā¢ 2404.14700 ā¢ Published Apr 23, 2024 ā¢ 30
Proactive Detection of Voice Cloning with Localized Watermarking Paper ā¢ 2401.17264 ā¢ Published Jan 30, 2024 ā¢ 18
Masked Audio Generation using a Single Non-Autoregressive Transformer Paper ā¢ 2401.04577 ā¢ Published Jan 9, 2024 ā¢ 42
Pheme: Efficient and Conversational Speech Generation Paper ā¢ 2401.02839 ā¢ Published Jan 5, 2024 ā¢ 17
CoMoSVC: Consistency Model-based Singing Voice Conversion Paper ā¢ 2401.01792 ā¢ Published Jan 3, 2024 ā¢ 8
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper ā¢ 2312.11514 ā¢ Published Dec 12, 2023 ā¢ 257
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit Paper ā¢ 2312.09911 ā¢ Published Dec 15, 2023 ā¢ 53
StemGen: A music generation model that listens Paper ā¢ 2312.08723 ā¢ Published Dec 14, 2023 ā¢ 47
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning Paper ā¢ 2312.06134 ā¢ Published Dec 11, 2023 ā¢ 2
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration Paper ā¢ 2311.04257 ā¢ Published Nov 7, 2023 ā¢ 20
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Paper ā¢ 2312.03491 ā¢ Published Dec 6, 2023 ā¢ 33
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models Paper ā¢ 2312.03632 ā¢ Published Dec 6, 2023 ā¢ 4
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper ā¢ 2312.00752 ā¢ Published Dec 1, 2023 ā¢ 139