End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3-mini-128k-instruct](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1499
 ## Model description
@@ -53,11 +53,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.0996        | 0.4   | 2    | 2.2203          |
-| 2.1578        | 0.8   | 4    | 2.1806          |
-| 1.9713        | 1.2   | 6    | 2.1613          |
-| 2.067         | 1.6   | 8    | 2.1525          |
-| 1.8717        | 2.0   | 10   | 2.1499          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/Phi-3-mini-128k-instruct](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.5556
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.4798        | 0.4   | 2    | 2.6784          |
+| 2.5949        | 0.8   | 4    | 2.6066          |
+| 2.3319        | 1.2   | 6    | 2.5751          |
+| 2.5069        | 1.6   | 8    | 2.5597          |
+| 2.2803        | 2.0   | 10   | 2.5556          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -22,8 +22,8 @@
   "target_modules": [
     "v_proj",
     "q_proj",
-    "o_proj",
-    "k_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "target_modules": [
     "v_proj",
     "q_proj",
+    "k_proj",
+    "o_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:899d386e4d3e5c7392b2b6e38e49b2e01b932405f9bcaaa565cfa653c07322ec
 size 12591456

 version https://git-lfs.github.com/spec/v1
+oid sha256:192c22047f638b7bd795006bcb9052d96a113e2118b11dcb925514d672a00438
 size 12591456

runs/Jun01_10-27-47_2303b76e0112/events.out.tfevents.1717237668.2303b76e0112.20343.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:228a9c6ecf5c84881390e2a46c812380b7b3eb4889ea7adc881ddfab93851a16
+size 11881

tokenizer.json CHANGED Viewed

@@ -1,6 +1,11 @@
 {
   "version": "1.0",
-  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 128,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:08ab11c6985f9b623509d624b9dceb862ebec5d2c2b4d2782dfd3ddf62255fdc
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:8fa946ac499622d1737b53c65feedf16d367bf2ffeb4dcffff1f8a38fb2683e5
 size 5112