DavidAU
/

DarkSapling-V1.1-Ultra-Quality-7B-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

DavidAU commited on Jun 3, 2024

Commit

e49d39c

·

verified ·

1 Parent(s): a1b1fd6

Create README.md

Files changed (1) hide show

README.md +44 -0

README.md ADDED Viewed

	@@ -0,0 +1,44 @@

+---
+license: apache-2.0
+language:
+- en
+tags:
+- creative
+- story
+- roleplay
+- rp
+- 32 bit upscale
+- remastered
+- writing
+---
+<h3><font color="red"> Dark Sapling V1.1 7B - 32k Context - Ultra Quality - 32 bit upscale.</font></h3>
+Complete remerge, and remaster of the incredible Dark Sapling V1 7B - 32k Context from source files.
+Registering an impressive drop of 240 points (lower is better) at Q4KM.
+This puts "Q4KM" operating at "Q6" levels, and further elevates Q6 and Q8 as well.
+Likewise, even Q2K (smallest quant) will operate at much higher levels than it's original source counterpart.
+<B>RESULTS:</b>
+The result is superior performance in instruction following, reasoning, depth, nuance and emotion.
+Reduction in prompt size, as it understands nuance better.
+And as a side effect more context available for output due to reduction in prompt size.
+Note that there will be an outsized difference between quants especially for creative and/or "no right answer" use cases.
+Because of this it is suggested to download the highest quant you can operate, and it's closest neighbours so to speak.
+IE: Q4KS, Q4KM, Q5KS as an example.
+Imatrix Plus versions to be uploaded at a separate repo shortly.
+Special thanks to "TEEZEE" the original model creator:
+[ https://huggingface.co/TeeZee/DarkSapling-7B-v1.1 ]
+NOTE: Version 1 and Version 2 are also remastered.