kennylam
/

Taiwan-LLM-13B-v2.0-chat-exl2

kennylam commited on Feb 6, 2024

Commit

9c4d935

1 Parent(s): 502483a

Updated README.md for branches description.

Files changed (1) hide show

README.md CHANGED Viewed

@@ -29,6 +29,20 @@ Original model 原始模型: https://huggingface.co/yentinglin/Taiwan-LLM-13B-v2
 This is a quantizated model from [yentinglin/Taiwan-LLM-13B-v2.0-chat](https://huggingface.co/yentinglin/Taiwan-LLM-13B-v2.0-chat) in exl2 format.
 ## Citation
 If you find Taiwan LLM is useful in your work, please cite it with:

 This is a quantizated model from [yentinglin/Taiwan-LLM-13B-v2.0-chat](https://huggingface.co/yentinglin/Taiwan-LLM-13B-v2.0-chat) in exl2 format.
+You are currently at the [main](https://huggingface.co/kennylam/Taiwan-LLM-13B-v2.0-chat-exl2/tree/main) branch, which provides only [measurement.json](measurement.json) used in the ExLlamaV2 quantization. Please take a look of your choices in following table of branches.
+這裡是main branch, 只提供EvLlamaV2量化時所用到的[measurement.json](measurement.json)檔案。
+[8.0bpw-h8](/kennylam/Taiwan-LLM-13B-v2.0-chat-exl2/tree/8.0bpw-h8) 8 bits per weight.
+[6.0bpw-h6](/kennylam/Taiwan-LLM-13B-v2.0-chat-exl2/tree/6.0bpw-h6) 6 bits per weight.
+[4.0bpw-h6](/kennylam/Taiwan-LLM-13B-v2.0-chat-exl2/tree/4.0bpw-h6) 4 bits per weight.
+[3.0bpw-h6](/kennylam/Taiwan-LLM-13B-v2.0-chat-exl2/tree/3.0bpw-h6) 3 bits per weight.
+[2.0bpw-h6](/kennylam/Taiwan-LLM-13B-v2.0-chat-exl2/tree/2.0bpw-h6) 2 bits per weight.
 ## Citation
 If you find Taiwan LLM is useful in your work, please cite it with: