mohitsha
/

Llama-2-7b-chat-hf-AMMO-TRT

Model card Files Files and versions Community

mohitsha HF staff commited on Jun 25, 2024

Commit

b11cd75

•

1 Parent(s): 30cfe57

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

	@@ -1 +1,3 @@
1	- # LLama2 Model with FP8 KV Cache checkpoint for TRTLM


1	+ # LLama2 Model with FP8 KV Cache checkpoint for TRTLM
2	+
3	+ Generated using https://github.com/vllm-project/vllm/blob/main/examples/fp8/quantizer/quantize.py