Fix q8 weights (use uint8 for q8; int8 produces poor results)

#18
by Xenova HF staff - opened
Hugging Face TB Research org
edited Nov 26, 2024

Slightly better, but not great. Will play around with other settings

Xenova changed pull request title from Upload fixed q8 ONNX models (reduce_range=True, per_channel=True) to Fix q8 weights (use uint8 for q8; int8 produces poor results)
Xenova changed pull request status to merged

Sign up or log in to comment