matchaaaaa
/

MN-Tiramisu-12B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

matchaaaaa commited on Oct 21, 2024

Commit

3e17757

·

verified ·

1 Parent(s): fad8774

Update README.md

Files changed (1) hide show

README.md +24 -1

README.md CHANGED Viewed

@@ -4,12 +4,35 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
 # MN-Tiramisu-12B
 This is a really yappity-yappy yapping model that's good for long-form RP. Tried to rein it in with Mahou and give it some more character understanding with Pantheon. Feedback is always welcome.
 ## Merge Details
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

 tags:
 - mergekit
 - merge
 ---
+![cute](https://huggingface.co/matchaaaaa/MN-Tiramisu-12B/resolve/main/tiramisu-cute.png)
 # MN-Tiramisu-12B
 This is a really yappity-yappy yapping model that's good for long-form RP. Tried to rein it in with Mahou and give it some more character understanding with Pantheon. Feedback is always welcome.
+**Native Context Length: 16K/16384** *(can be extended using RoPE, YMMY)*
+## Prompt Template: Chat
+```
+<|im_start|>system
+{system prompt}<|im_end|>
+<|im_start|>user
+{message}<|im_end|>
+<|im_start|>assistant
+{response}
+```
+## Recommended Settings:
+Here are some settings ranges that tend to work for me. They aren't strict values, and there's a bit of leeway in them. Feel free to experiment a bit!
+* Temperature:        **1.0** (maybe less, a little bit goes a long way with Nemo)
+* Min-P:              **0.1** to **0.2**
+* *(all other samplers disabled)*
 ## Merge Details
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).