"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published Nov 4, 2024 • 46
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 105
Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registers Paper • 2410.14072 • Published Oct 17, 2024
FoPru: Focal Pruning for Efficient Large Vision-Language Models Paper • 2411.14164 • Published Nov 21, 2024