A Survey on Model Compression for Large Language Models Paper • 2308.07633 • Published Aug 15, 2023 • 3
A Survey on Efficient Inference for Large Language Models Paper • 2404.14294 • Published Apr 22, 2024 • 2
Model Compression and Efficient Inference for Large Language Models: A Survey Paper • 2402.09748 • Published Feb 15, 2024 • 1