satyanshu404 commited on
Commit
4d775ff
·
verified ·
1 Parent(s): 28da41a

End of training

Browse files
Files changed (3) hide show
  1. README.md +70 -0
  2. generation_config.json +7 -0
  3. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ metrics:
5
+ - rouge
6
+ model-index:
7
+ - name: gpt2-finetuned-justification-v3
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # gpt2-finetuned-justification-v3
15
+
16
+ This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.2415
19
+ - Rouge1: 30.8957
20
+ - Rouge2: 13.5597
21
+ - Rougel: 22.4384
22
+ - Rougelsum: 28.2668
23
+
24
+ ## Model description
25
+
26
+ More information needed
27
+
28
+ ## Intended uses & limitations
29
+
30
+ More information needed
31
+
32
+ ## Training and evaluation data
33
+
34
+ More information needed
35
+
36
+ ## Training procedure
37
+
38
+ ### Training hyperparameters
39
+
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 5e-05
42
+ - train_batch_size: 2
43
+ - eval_batch_size: 2
44
+ - seed: 42
45
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
+ - lr_scheduler_type: linear
47
+ - num_epochs: 10
48
+
49
+ ### Training results
50
+
51
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
52
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
53
+ | No log | 1.0 | 338 | 0.1980 | 30.0775 | 13.8145 | 22.3863 | 28.0341 |
54
+ | 0.226 | 2.0 | 676 | 0.1972 | 28.9676 | 13.7684 | 21.8084 | 26.6768 |
55
+ | 0.1594 | 3.0 | 1014 | 0.2007 | 29.8576 | 13.3727 | 22.1581 | 27.5726 |
56
+ | 0.1594 | 4.0 | 1352 | 0.2071 | 32.2090 | 13.7848 | 22.8787 | 29.0171 |
57
+ | 0.1259 | 5.0 | 1690 | 0.2146 | 28.5240 | 13.5821 | 21.4908 | 26.2550 |
58
+ | 0.1046 | 6.0 | 2028 | 0.2211 | 26.1623 | 13.1641 | 21.5936 | 25.0346 |
59
+ | 0.1046 | 7.0 | 2366 | 0.2294 | 28.7169 | 13.4858 | 21.1068 | 26.1213 |
60
+ | 0.0894 | 8.0 | 2704 | 0.2355 | 30.8957 | 13.5597 | 22.4384 | 28.2668 |
61
+ | 0.0785 | 9.0 | 3042 | 0.2398 | 30.8957 | 13.5597 | 22.4384 | 28.2668 |
62
+ | 0.0785 | 10.0 | 3380 | 0.2415 | 30.8957 | 13.5597 | 22.4384 | 28.2668 |
63
+
64
+
65
+ ### Framework versions
66
+
67
+ - Transformers 4.36.2
68
+ - Pytorch 2.2.2+cu121
69
+ - Datasets 2.16.0
70
+ - Tokenizers 0.15.2
generation_config.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 50256,
3
+ "decoder_start_token_id": 50256,
4
+ "eos_token_id": 50256,
5
+ "pad_token_id": 50256,
6
+ "transformers_version": "4.36.2"
7
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8c5148ea2433780b8c2e470ffe9fd88a0652a95de98647252df8a72948961216
3
  size 1205925664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0b146191b256077b0a0d8ddc6b253e0dd09b34c7458fd64ab69615bc65662035
3
  size 1205925664