Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper โข 2412.13663 โข Published 19 days ago โข 117
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 โข 215