sk-2302 commited on
Commit
0f3ea97
·
1 Parent(s): cbf941e

End of training

Browse files
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Rouge1
24
  type: rouge
25
- value: 42.6528
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,12 +32,12 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the samsum dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.6754
36
- - Rouge1: 42.6528
37
- - Rouge2: 18.3481
38
- - Rougel: 35.2388
39
- - Rougelsum: 38.9352
40
- - Gen Len: 16.8474
41
 
42
  ## Model description
43
 
@@ -57,8 +57,8 @@ More information needed
57
 
58
  The following hyperparameters were used during training:
59
  - learning_rate: 5e-05
60
- - train_batch_size: 52
61
- - eval_batch_size: 52
62
  - seed: 42
63
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
64
  - lr_scheduler_type: linear
@@ -69,11 +69,15 @@ The following hyperparameters were used during training:
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
71
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
72
- | 1.8824 | 0.35 | 100 | 1.7015 | 42.4522 | 18.3268 | 35.0509 | 38.8425 | 16.6532 |
73
- | 1.8578 | 0.7 | 200 | 1.6878 | 41.9777 | 18.2479 | 34.9247 | 38.5155 | 16.7216 |
74
- | 1.835 | 1.06 | 300 | 1.6823 | 42.727 | 18.6215 | 35.3788 | 39.0045 | 16.9048 |
75
- | 1.8144 | 1.41 | 400 | 1.6786 | 42.6033 | 18.4035 | 35.2884 | 38.9211 | 16.6618 |
76
- | 1.8094 | 1.76 | 500 | 1.6754 | 42.6528 | 18.3481 | 35.2388 | 38.9352 | 16.8474 |
 
 
 
 
77
 
78
 
79
  ### Framework versions
 
22
  metrics:
23
  - name: Rouge1
24
  type: rouge
25
+ value: 42.6
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the samsum dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.6729
36
+ - Rouge1: 42.6
37
+ - Rouge2: 18.7153
38
+ - Rougel: 35.4138
39
+ - Rougelsum: 38.8543
40
+ - Gen Len: 16.9170
41
 
42
  ## Model description
43
 
 
57
 
58
  The following hyperparameters were used during training:
59
  - learning_rate: 5e-05
60
+ - train_batch_size: 32
61
+ - eval_batch_size: 32
62
  - seed: 42
63
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
64
  - lr_scheduler_type: linear
 
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
71
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
72
+ | 1.8863 | 0.22 | 100 | 1.7049 | 42.0859 | 18.0002 | 34.7349 | 38.3446 | 16.5788 |
73
+ | 1.8463 | 0.43 | 200 | 1.6947 | 42.4056 | 18.3005 | 34.9821 | 38.8013 | 17.3614 |
74
+ | 1.8548 | 0.65 | 300 | 1.6792 | 42.585 | 18.5643 | 35.2235 | 38.8298 | 17.1514 |
75
+ | 1.8358 | 0.87 | 400 | 1.6772 | 42.1544 | 18.2303 | 34.8971 | 38.3609 | 16.5873 |
76
+ | 1.8129 | 1.08 | 500 | 1.6729 | 42.6 | 18.7153 | 35.4138 | 38.8543 | 16.9170 |
77
+ | 1.8068 | 1.3 | 600 | 1.6709 | 42.5217 | 18.3285 | 35.1455 | 38.5954 | 16.9451 |
78
+ | 1.7973 | 1.52 | 700 | 1.6687 | 42.8667 | 18.624 | 35.3429 | 38.9322 | 16.7546 |
79
+ | 1.7979 | 1.74 | 800 | 1.6668 | 42.919 | 18.7388 | 35.4528 | 39.0561 | 16.8791 |
80
+ | 1.7899 | 1.95 | 900 | 1.6670 | 43.0931 | 18.741 | 35.5047 | 39.2321 | 16.9109 |
81
 
82
 
83
  ### Framework versions
logs/events.out.tfevents.1703012981.1dd062b0e6e2.43.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e27c24ec87f2664d023f0685925ddd753ffc76aa31e8367433227dad9b7c88cc
3
- size 8785
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b58cea332b58ab9c397d285444d91bd576cfb4cb480e93320b1dd7328bbea51
3
+ size 11867
logs/events.out.tfevents.1703013989.1dd062b0e6e2.43.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8773f0397561a92f47a1ae028e3bf0e13378fc75b861bf181c16614aa8abbbf9
3
+ size 613