Gryphe
/

Pantheon-RP-1.6-12b-Nemo-KTO

Model card Files Files and versions Community

Gryphe commited on Aug 31, 2024

Commit

6cb6d8d

·

verified ·

1 Parent(s): a43cce9

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -12,7 +12,9 @@ language:
 ---
 ![image/png](Pantheon.png)
 # Pantheon-RP-1.6-12b-Nemo-KTO
-Welcome to the next iteration of my Pantheon model series, in which I strive to introduce a whole collection of diverse personas that can be summoned with a simple activation phrase. The huge variety in personalities introduced also serve to enhance the general roleplay experience, helping to encompass personality traits and accents that language models might otherwise find difficult to convey well.
 **KTO Edition:** This is an improved version of 1.6 in which I applied KTO preference training to further refine, deslopify and diversify the model's responses. Moving forward, future versions will have this type of additional training by default but for now the two 1.6 versions will live side by side.
@@ -38,13 +40,11 @@ Just like 1.5, I used a multi-stage finetuning process as Mistral Nemo was provi
 ## Inference
-Nemo is a somewhat strange model when it comes to temperatures so I highly encourage you to experiment to see which works best.
 ```
-"temperature": 0.3-1.0,
 "repetition_penalty": 1.05,
-"top_p": 0.95
-"top_k": 40
-"min_p": 0.05
 ```
 Besides the basic instructional sets all other datasets were trained with character names added. Enable this at all times for an optimal experience.

 ---
 ![image/png](Pantheon.png)
 # Pantheon-RP-1.6-12b-Nemo-KTO
+Welcome to the next iteration of my Pantheon model series, in which I strive to introduce a whole collection of diverse personas that can be summoned with a simple activation phrase.
+Pantheon's purpose is two-fold, as these personalities similarly enhance the general roleplay experience, helping to encompass personality traits, accents and mannerisms that language models might otherwise find difficult to convey well.
 **KTO Edition:** This is an improved version of 1.6 in which I applied KTO preference training to further refine, deslopify and diversify the model's responses. Moving forward, future versions will have this type of additional training by default but for now the two 1.6 versions will live side by side.
 ## Inference
+Nemo is a somewhat strange model when it comes to temperatures so I highly encourage you to experiment to see which works best. Here's my current preset:
 ```
+"temperature": 0.8,
 "repetition_penalty": 1.05,
+"min_p": 0.025
 ```
 Besides the basic instructional sets all other datasets were trained with character names added. Enable this at all times for an optimal experience.