Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1, 2024 • 69
Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker Apr 8, 2021
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 9 days ago • 91
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling Paper • 2407.21787 • Published Jul 31, 2024 • 12
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 9 days ago • 91
Post-Training Releases November 2024 Collection Includes papers with post-training sides from best open-models from November, including OpenCoder, SmolLM-v2, Orca Agent Instruct, Tülü 3 • 3 items • Updated Nov 23, 2024
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 39