KnutJaegersberg
/

Deacon-34B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

KnutJaegersberg commited on Nov 15, 2023

Commit

7297c2c

·

1 Parent(s): 54de6ba

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -3,17 +3,16 @@ license: other
 license_name: yi-license
 license_link: LICENSE
 pipeline_tag: text-generation
 ---
 This model has been llamafied and uses a llama tokenizer. I took it from https://huggingface.co/chargoddard/Yi-34B-Llama
-Introduction
-The Yi series models are large language models trained from scratch by developers at 01.AI. The first public release contains two bilingual(English/Chinese) base models with the parameter sizes of 6B(Yi-6B) and 34B(Yi-34B). Both of them are trained with 4K sequence length and can be extended to 32K during inference time.
 License
 The Yi series models are fully open for academic research and free commercial usage with permission via applications. All usage must adhere to the Model License Agreement 2.0. To apply for the official commercial license, please contact us ([email protected]).
-Hmm I think these are already the merged weights of the Deacon, not just the base model. Try prompting it as fine tuned:
 Prompt Example:
 ```

 license_name: yi-license
 license_link: LICENSE
 pipeline_tag: text-generation
+datasets:
+- totally-not-an-llm/EverythingLM-data-V3
 ---
 This model has been llamafied and uses a llama tokenizer. I took it from https://huggingface.co/chargoddard/Yi-34B-Llama
+It's fine tuned on EverythingLM dataset for 5 epochs with NEFTune.
 License
 The Yi series models are fully open for academic research and free commercial usage with permission via applications. All usage must adhere to the Model License Agreement 2.0. To apply for the official commercial license, please contact us ([email protected]).
 Prompt Example:
 ```