KnutJaegersberg commited on
Commit
7297c2c
·
1 Parent(s): 54de6ba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -4
README.md CHANGED
@@ -3,17 +3,16 @@ license: other
3
  license_name: yi-license
4
  license_link: LICENSE
5
  pipeline_tag: text-generation
 
 
6
  ---
7
 
8
  This model has been llamafied and uses a llama tokenizer. I took it from https://huggingface.co/chargoddard/Yi-34B-Llama
9
-
10
- Introduction
11
- The Yi series models are large language models trained from scratch by developers at 01.AI. The first public release contains two bilingual(English/Chinese) base models with the parameter sizes of 6B(Yi-6B) and 34B(Yi-34B). Both of them are trained with 4K sequence length and can be extended to 32K during inference time.
12
 
13
  License
14
  The Yi series models are fully open for academic research and free commercial usage with permission via applications. All usage must adhere to the Model License Agreement 2.0. To apply for the official commercial license, please contact us ([email protected]).
15
 
16
- Hmm I think these are already the merged weights of the Deacon, not just the base model. Try prompting it as fine tuned:
17
 
18
  Prompt Example:
19
  ```
 
3
  license_name: yi-license
4
  license_link: LICENSE
5
  pipeline_tag: text-generation
6
+ datasets:
7
+ - totally-not-an-llm/EverythingLM-data-V3
8
  ---
9
 
10
  This model has been llamafied and uses a llama tokenizer. I took it from https://huggingface.co/chargoddard/Yi-34B-Llama
11
+ It's fine tuned on EverythingLM dataset for 5 epochs with NEFTune.
 
 
12
 
13
  License
14
  The Yi series models are fully open for academic research and free commercial usage with permission via applications. All usage must adhere to the Model License Agreement 2.0. To apply for the official commercial license, please contact us ([email protected]).
15
 
 
16
 
17
  Prompt Example:
18
  ```