Request for quantized version

by sudhir2016 - opened Jan 26, 2024

Jan 26, 2024

A quantized version of the model which can be used for inference in a free tier Google Colab notebook would be nice.

jisx

MaLA-LM org Jan 30, 2024

Feb 1, 2024

Yes please. Will it work with load_in_4bit=True.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment