cheonboy/shawgpt-ft

Files changed (3) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.8999
 ## Model description
@@ -51,22 +51,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 4.5943        | 0.9231 | 3    | 3.9667          |
-| 4.052         | 1.8462 | 6    | 3.4447          |
-| 3.4816        | 2.7692 | 9    | 3.0003          |
-| 2.2728        | 4.0    | 13   | 2.5840          |
-| 2.7079        | 4.9231 | 16   | 2.3495          |
-| 2.4026        | 5.8462 | 19   | 2.1672          |
-| 2.1944        | 6.7692 | 22   | 2.0317          |
-| 1.5774        | 8.0    | 26   | 1.9874          |
-| 2.0433        | 8.9231 | 29   | 1.9125          |
-| 1.3935        | 9.2308 | 30   | 1.8999          |
 ### Framework versions
 - PEFT 0.11.1
 - Transformers 4.41.2
-- Pytorch 2.1.0+cu121
 - Datasets 2.20.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7326
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 4.5934        | 0.9231 | 3    | 3.9723          |
+| 4.0607        | 1.8462 | 6    | 3.4680          |
+| 3.4937        | 2.7692 | 9    | 3.0132          |
+| 2.2669        | 4.0    | 13   | 2.5644          |
+| 2.6584        | 4.9231 | 16   | 2.2894          |
+| 2.312         | 5.8462 | 19   | 2.0650          |
+| 2.0331        | 6.7692 | 22   | 1.8955          |
+| 1.4248        | 8.0    | 26   | 1.7767          |
+| 1.813         | 8.9231 | 29   | 1.7372          |
+| 1.2648        | 9.2308 | 30   | 1.7326          |
 ### Framework versions
 - PEFT 0.11.1
 - Transformers 4.41.2
+- Pytorch 2.3.0+cu121
 - Datasets 2.20.0
 - Tokenizers 0.19.1

runs/Jul11_04-31-39_7f32845c3f27/events.out.tfevents.1720672300.7f32845c3f27.1311.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e4b49d60694c941bfdb00bd1330648b042b6dc48044e5e77a9c7ace17f1fe885
+size 10532

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:06beb265eb91bdd418a5b7221dd2f8a53edb0aa066e3b4fddd52c1f13055b03d
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:abf2686134fe1dbde703ba4d6e2021cb7031b95303e9c8ba60b4821dc6abe926
 size 5112