Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mohitsha
/
Llama-2-7b-chat-hf-AMMO-TRT
like
0
Model card
Files
Files and versions
Community
mohitsha
HF staff
commited on
Jun 25, 2024
Commit
b11cd75
•
1 Parent(s):
30cfe57
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+3
-1
README.md
CHANGED
Viewed
@@ -1 +1,3 @@
1
-
# LLama2 Model with FP8 KV Cache checkpoint for TRTLM
1
+
# LLama2 Model with FP8 KV Cache checkpoint for TRTLM
2
+
3
+
Generated using https://github.com/vllm-project/vllm/blob/main/examples/fp8/quantizer/quantize.py