End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5280
 ## Model description
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
 - total_eval_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- training_steps: 1100
 ### Training results
@@ -90,10 +90,20 @@ The following hyperparameters were used during training:
 | 2.7515        | 0.15  | 950  | 2.6994          |
 | 2.7365        | 0.16  | 975  | 2.6933          |
 | 2.7635        | 0.16  | 1000 | 2.6882          |
-| 2.7881        | 0.16  | 1025 | 2.6844          |
-| 2.7033        | 0.17  | 1050 | 2.6783          |
-| 2.7138        | 0.17  | 1075 | 2.6728          |
-| 2.643         | 0.18  | 1100 | 2.6683          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4855
 ## Model description
 - total_eval_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- training_steps: 1350
 ### Training results
 | 2.7515        | 0.15  | 950  | 2.6994          |
 | 2.7365        | 0.16  | 975  | 2.6933          |
 | 2.7635        | 0.16  | 1000 | 2.6882          |
+| 2.7883        | 0.16  | 1025 | 2.6841          |
+| 2.7032        | 0.17  | 1050 | 2.6782          |
+| 2.714         | 0.17  | 1075 | 2.6728          |
+| 2.6427        | 0.18  | 1100 | 2.6684          |
+| 2.6727        | 0.18  | 1125 | 2.6644          |
+| 2.7536        | 0.18  | 1150 | 2.6593          |
+| 2.7379        | 0.19  | 1175 | 2.6547          |
+| 2.5601        | 0.19  | 1200 | 2.6500          |
+| 2.6281        | 0.2   | 1225 | 2.6461          |
+| 2.6526        | 0.2   | 1250 | 2.6421          |
+| 2.724         | 0.2   | 1275 | 2.6388          |
+| 2.6527        | 0.21  | 1300 | 2.6347          |
+| 2.5999        | 0.21  | 1325 | 2.6305          |
+| 2.525         | 0.22  | 1350 | 2.6270          |
 ### Framework versions

model-00001-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cf8307e5197d314fe4cda7e6fdbad5dea87b51b05d508284821d124247f02a26
 size 4938985352

 version https://git-lfs.github.com/spec/v1
+oid sha256:629f62061531135f9dbb47cc43678a8bfffaa7d41343d1e83c60696643d8e9e5
 size 4938985352

model-00002-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2c185ab205448ec2ee849c93c06489b9b140e129e82e0dfeb285bbc520117ac4
 size 4947390880

 version https://git-lfs.github.com/spec/v1
+oid sha256:b8d44f6919844c2978a337a236f5b0fd18b8881d6f9d91d8b4e9163306cc664d
 size 4947390880

model-00003-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:22e5a2b5d67140a5ff9cbbef4f9b05f1733192d3980d536a53b75a5695183fe7
 size 3590488816

 version https://git-lfs.github.com/spec/v1
+oid sha256:d04347c5c33b551d63099291d3f74e90b2752ae99c7a59b347d6f766c68b22a9
 size 3590488816