AuriAetherwiing commited on
Commit
6522394
·
verified ·
1 Parent(s): 2a6e3e2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -12
README.md CHANGED
@@ -1,8 +1,7 @@
1
  ---
2
  base_model:
3
  - UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
4
- - AuriAetherwiing/sunfall-g2-lora
5
- - UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
6
  - allura-org/G2-9B-Sugarquill-v0
7
  library_name: transformers
8
  tags:
@@ -10,22 +9,26 @@ tags:
10
  - merge
11
 
12
  ---
13
- # Aletheia-5
14
 
15
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
 
 
16
 
17
- ## Merge Details
18
- ### Merge Method
19
 
20
- This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3) as a base.
21
 
22
- ### Models Merged
 
 
 
 
 
23
 
24
- The following models were included in the merge:
25
- * [UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3) + [AuriAetherwiing/sunfall-g2-lora](https://huggingface.co/AuriAetherwiing/sunfall-g2-lora)
26
- * [allura-org/G2-9B-Sugarquill-v0](https://huggingface.co/allura-org/G2-9B-Sugarquill-v0)
27
 
28
- ### Configuration
29
 
30
  The following YAML configuration was used to produce this model:
31
 
 
1
  ---
2
  base_model:
3
  - UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
4
+ - crestf411/gemma2-9B-sunfall-v0.5.2
 
5
  - allura-org/G2-9B-Sugarquill-v0
6
  library_name: transformers
7
  tags:
 
9
  - merge
10
 
11
  ---
12
+ # Gemma-2-9B Aletheia v1
13
 
14
+ A merge of Sugarquill and Sunfall. I wanted to combine Sugarquill's more novel-like writing style with something that would improve it's RP perfomance and make it more steerable, w/o adding superfluous synthetic writing patterns.
15
+ I quite like Crestfall's Sunfall models and I felt like Gemma version of Sunfall will steer the model in this direction when merged it. To keep more of Gemma-2-9B-it-SPPO-iter3's smarts, I've decided to apply Sunfall LoRA on top of it, instead of using the published Sunfall model.
16
+ I'm generally pleased with the result, this model has nice, fresh writing style, good charcard adherence and good system prompt following.
17
+ It still should work well for raw completion storywriting, as it's a trained feature in both merged models.
18
 
19
+ ### Format
 
20
 
21
+ Model responds to Gemma instruct formatting, exactly like it's base model.
22
 
23
+ ```
24
+ <bos><start_of_turn>user
25
+ {user message}<end_of_turn>
26
+ <start_of_turn>model
27
+ {response}<end_of_turn><eos>
28
+ ```
29
 
 
 
 
30
 
31
+ ### Mergekit config
32
 
33
  The following YAML configuration was used to produce this model:
34