Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap Jun 19, 2024 β’ 11
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18, 2024 β’ 224
view article Article Memory-efficient Diffusion Transformers with Quanto and Diffusers Jul 30, 2024 β’ 61
Writing in the Margins: Better Inference Pattern for Long Context Retrieval Paper β’ 2408.14906 β’ Published Aug 27, 2024 β’ 138
NIM Serverless Inference API Collection Models in this collection are available for inference via a serverless API powered by NVIDIA NIM. β’ 8 items β’ Updated Oct 14, 2024 β’ 22
view article Article π₯ Argilla 2.0: the data-centric tool for AI makers π€ By dvilasuero β’ Jul 30, 2024 β’ 37
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29, 2024 β’ 260
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper β’ 2407.14933 β’ Published Jul 20, 2024 β’ 12
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate Jun 13, 2024 β’ 45
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper β’ 2405.00732 β’ Published Apr 29, 2024 β’ 118
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers β’ 67 items β’ Updated Jul 3, 2024 β’ 89
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1, 2024 β’ 69
view article Article Making thousands of open LLMs bloom in the Vertex AI Model Garden Apr 10, 2024 β’ 18