tohur
/

natsumura-storytelling-rp-1.0-llama-3.1-8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tohur commited on Aug 2, 2024

Commit

ec1dda9

•

1 Parent(s): 297fb8b

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -44,6 +44,21 @@ ollama pull Tohur/natsumura-storytelling-rp-llama-3.1
 - tdh87/Just-stories
 - tdh87/Just-stories-2
 ## Inference
 I use the following settings for inference:

 - tdh87/Just-stories
 - tdh87/Just-stories-2
+The following parameters were used in [Llama Factory](https://github.com/hiyouga/LLaMA-Factory) during training:
+- per_device_train_batch_size=2
+- gradient_accumulation_steps=4
+- lr_scheduler_type="cosine"
+- logging_steps=10
+- warmup_ratio=0.1
+- save_steps=1000
+- learning_rate=2e-5
+- num_train_epochs=3.0
+- max_samples=500
+- max_grad_norm=1.0
+- quantization_bit=4
+- loraplus_lr_ratio=16.0
+- fp16=True
 ## Inference
 I use the following settings for inference: