elinas
/

Chronos-Gold-12B-1.0

Text Generation

general-purpose

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

elinas commited on Aug 22, 2024

Commit

e2c934d

·

verified ·

1 Parent(s): 185a5c9

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -15,13 +15,14 @@ library_name: transformers
 ![image/webp](https://cdn-uploads.huggingface.co/production/uploads/630417380907b9a115c6aa9f/3hc8zt8fzKdO3qHK1p1mW.webp)
 Chronos Gold 12B 1.0 is a very unique model that applies to domains areas such as
-geneal chatbot functionatliy, *roleplay*, and storywriting. The model has been observed to write up to 2250 tokens in a single sequence.
 The base model is `mistralai/Mistral-Nemo-Base-2407` which was heavily modified to produce a more coherent model, comparable to much larger models.
 **Chronos Gold 12B-1.0** re-creates the uniqueness of the original Chronos with significiantly enhanced prompt adherence (following), coherence, a modern dataset, as well as supporting a majority of "character card" formats in applications like SillyTavern.
-It went through an intelligent merge process as my previous models and was further finetuned on a dataset curated for it.
 The specifics of the model will not be disclosed at the time due to dataset ownership.

 ![image/webp](https://cdn-uploads.huggingface.co/production/uploads/630417380907b9a115c6aa9f/3hc8zt8fzKdO3qHK1p1mW.webp)
 Chronos Gold 12B 1.0 is a very unique model that applies to domains areas such as
+geneal chatbot functionatliy, *roleplay*, and storywriting. The model has been observed to write up to 2250 tokens in a single sequence. The model was trained at a
+sequence length of 16384 (16k) and will still retain the *apparent* 128k context length from Mistral-Nemo.
 The base model is `mistralai/Mistral-Nemo-Base-2407` which was heavily modified to produce a more coherent model, comparable to much larger models.
 **Chronos Gold 12B-1.0** re-creates the uniqueness of the original Chronos with significiantly enhanced prompt adherence (following), coherence, a modern dataset, as well as supporting a majority of "character card" formats in applications like SillyTavern.
+It went through an iterative and objective merge process as my previous models and was further finetuned on a dataset curated for it.
 The specifics of the model will not be disclosed at the time due to dataset ownership.