Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ inference: true
|
|
8 |
- 4-bit quantized
|
9 |
- Based on version 1.1
|
10 |
- Used PR "More accurate Q4_0 and Q4_1 quantizations #896" (should be closer in quality to unquantized)
|
11 |
-
- Uncensored variant is available, but it's based on version 1.0
|
12 |
- For q4_2, "Q4_2 ARM #1046" was used. Will update regularly if new changes are made.
|
13 |
- **Choosing between q4_0, q4_1, and q4_2:**
|
14 |
- 4_0 is the fastest. The quality is the poorest.
|
|
|
8 |
- 4-bit quantized
|
9 |
- Based on version 1.1
|
10 |
- Used PR "More accurate Q4_0 and Q4_1 quantizations #896" (should be closer in quality to unquantized)
|
11 |
+
- Uncensored variant is available, but it's based on version 1.0 (worse quality wise)
|
12 |
- For q4_2, "Q4_2 ARM #1046" was used. Will update regularly if new changes are made.
|
13 |
- **Choosing between q4_0, q4_1, and q4_2:**
|
14 |
- 4_0 is the fastest. The quality is the poorest.
|