view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 โข 216
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper โข 2409.01704 โข Published Sep 3, 2024 โข 83
view article Article Crazy Challenge: Run Llama 405B on a 8GB VRAM GPU By lyogavin โข Aug 2, 2024 โข 10
view article Article Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages May 24, 2024 โข 25