KnutJaegersberg
commited on
Commit
·
7297c2c
1
Parent(s):
54de6ba
Update README.md
Browse files
README.md
CHANGED
@@ -3,17 +3,16 @@ license: other
|
|
3 |
license_name: yi-license
|
4 |
license_link: LICENSE
|
5 |
pipeline_tag: text-generation
|
|
|
|
|
6 |
---
|
7 |
|
8 |
This model has been llamafied and uses a llama tokenizer. I took it from https://huggingface.co/chargoddard/Yi-34B-Llama
|
9 |
-
|
10 |
-
Introduction
|
11 |
-
The Yi series models are large language models trained from scratch by developers at 01.AI. The first public release contains two bilingual(English/Chinese) base models with the parameter sizes of 6B(Yi-6B) and 34B(Yi-34B). Both of them are trained with 4K sequence length and can be extended to 32K during inference time.
|
12 |
|
13 |
License
|
14 |
The Yi series models are fully open for academic research and free commercial usage with permission via applications. All usage must adhere to the Model License Agreement 2.0. To apply for the official commercial license, please contact us ([email protected]).
|
15 |
|
16 |
-
Hmm I think these are already the merged weights of the Deacon, not just the base model. Try prompting it as fine tuned:
|
17 |
|
18 |
Prompt Example:
|
19 |
```
|
|
|
3 |
license_name: yi-license
|
4 |
license_link: LICENSE
|
5 |
pipeline_tag: text-generation
|
6 |
+
datasets:
|
7 |
+
- totally-not-an-llm/EverythingLM-data-V3
|
8 |
---
|
9 |
|
10 |
This model has been llamafied and uses a llama tokenizer. I took it from https://huggingface.co/chargoddard/Yi-34B-Llama
|
11 |
+
It's fine tuned on EverythingLM dataset for 5 epochs with NEFTune.
|
|
|
|
|
12 |
|
13 |
License
|
14 |
The Yi series models are fully open for academic research and free commercial usage with permission via applications. All usage must adhere to the Model License Agreement 2.0. To apply for the official commercial license, please contact us ([email protected]).
|
15 |
|
|
|
16 |
|
17 |
Prompt Example:
|
18 |
```
|