Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,44 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
tags:
|
6 |
+
- creative
|
7 |
+
- story
|
8 |
+
- roleplay
|
9 |
+
- rp
|
10 |
+
- 32 bit upscale
|
11 |
+
- remastered
|
12 |
+
- writing
|
13 |
+
---
|
14 |
+
<h3><font color="red"> Dark Sapling V1.1 7B - 32k Context - Ultra Quality - 32 bit upscale.</font></h3>
|
15 |
+
|
16 |
+
Complete remerge, and remaster of the incredible Dark Sapling V1 7B - 32k Context from source files.
|
17 |
+
|
18 |
+
Registering an impressive drop of 240 points (lower is better) at Q4KM.
|
19 |
+
|
20 |
+
This puts "Q4KM" operating at "Q6" levels, and further elevates Q6 and Q8 as well.
|
21 |
+
|
22 |
+
Likewise, even Q2K (smallest quant) will operate at much higher levels than it's original source counterpart.
|
23 |
+
|
24 |
+
<B>RESULTS:</b>
|
25 |
+
|
26 |
+
The result is superior performance in instruction following, reasoning, depth, nuance and emotion.
|
27 |
+
|
28 |
+
Reduction in prompt size, as it understands nuance better.
|
29 |
+
|
30 |
+
And as a side effect more context available for output due to reduction in prompt size.
|
31 |
+
|
32 |
+
Note that there will be an outsized difference between quants especially for creative and/or "no right answer" use cases.
|
33 |
+
|
34 |
+
Because of this it is suggested to download the highest quant you can operate, and it's closest neighbours so to speak.
|
35 |
+
|
36 |
+
IE: Q4KS, Q4KM, Q5KS as an example.
|
37 |
+
|
38 |
+
Imatrix Plus versions to be uploaded at a separate repo shortly.
|
39 |
+
|
40 |
+
Special thanks to "TEEZEE" the original model creator:
|
41 |
+
|
42 |
+
[ https://huggingface.co/TeeZee/DarkSapling-7B-v1.1 ]
|
43 |
+
|
44 |
+
NOTE: Version 1 and Version 2 are also remastered.
|