Fishfishfishfishfish commited on
Commit
f3e69d1
·
verified ·
1 Parent(s): 4b076dc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -4,4 +4,6 @@ language:
4
  - en
5
  base_model: google/gemma-2-2b-it
6
  ---
7
- Gemma 2 2B quantized for wllama (under 2gb).
 
 
 
4
  - en
5
  base_model: google/gemma-2-2b-it
6
  ---
7
+ Gemma 2 2B quantized for wllama (under 2gb).
8
+
9
+ q4_0_4_8 is WAY faster when using llama.cpp, with wllama, it's about the same as q4_k.