Update README.md
Browse files
README.md
CHANGED
@@ -45,6 +45,17 @@ This is the third iteration "Emerald" of the final four and the one I liked the
|
|
45 |
|
46 |
Findings: o_proj and down_proj can be stolen from Aethora-v2 so new 15B frankenmerges don't seem to really ***need*** finetuning to "heal" the layers, though it has a big influence on the output so it's slightly censored in one-shot.
|
47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
48 |
# <h1>merge</h1>
|
49 |
|
50 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
45 |
|
46 |
Findings: o_proj and down_proj can be stolen from Aethora-v2 so new 15B frankenmerges don't seem to really ***need*** finetuning to "heal" the layers, though it has a big influence on the output so it's slightly censored in one-shot.
|
47 |
|
48 |
+
# <p>Samplers</p>
|
49 |
+
```bash
|
50 |
+
I found success with:
|
51 |
+
temperature 0.9
|
52 |
+
min_p 0.08
|
53 |
+
tfs 0.97
|
54 |
+
smoothing_factor 0.3
|
55 |
+
smoothing_curve 1.1
|
56 |
+
|
57 |
+
```
|
58 |
+
|
59 |
# <h1>merge</h1>
|
60 |
|
61 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|