Jotschi
/

Mistral-7B-v0.1-coco-caption-de

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Jotschi commited on Mar 22, 2024

Commit

081f106

·

1 Parent(s): d6af853

Update readme

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ fall und Felsen vor dem Gebäude mit Blick auf den Fluss.
 - **Developed by:** [Jotschi](https://huggingface.co/Jotschi)
 - **License:** [Apache License](https://www.apache.org/licenses/LICENSE-2.0)
-- **Finetuned from model [optional]:** [Mistral7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)
 ## Uses
@@ -52,7 +52,11 @@ The model was trained using PEFT 4Bit Q-LoRA with the following parameters:
 * rank: 256
 * alpha: 16
-* gradient accumulation steps: 8
 * batch size: 4
 * Input sequence length: 512
 * Learning Rate: 2.0e-5

 - **Developed by:** [Jotschi](https://huggingface.co/Jotschi)
 - **License:** [Apache License](https://www.apache.org/licenses/LICENSE-2.0)
+- **Finetuned from model:** [Mistral7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)
 ## Uses
 * rank: 256
 * alpha: 16
+* steps: 8500
+* bf16: True
+* lr_scheduler_type: cosine
+* warmup_ratio: 0.03
+* gradient accumulation steps: 2
 * batch size: 4
 * Input sequence length: 512
 * Learning Rate: 2.0e-5