Quantized 4-bit models Collection Large model quantized with post-quantization performance very close to the original models, allowing it to run on reasonable infrastructure. • 9 items • Updated Nov 14, 2024 • 1
Quantized 4-bit models Collection Large model quantized with post-quantization performance very close to the original models, allowing it to run on reasonable infrastructure. • 9 items • Updated Nov 14, 2024 • 1