benjaminzwhite
commited on
doc: update README.md
Browse files- add ROUGE scores
- add qualitative subjective impressions of the summary quality
README.md
CHANGED
@@ -5,6 +5,13 @@ tags:
|
|
5 |
model-index:
|
6 |
- name: pegasus-samsum-nlp-with-transformers-ch06
|
7 |
results: []
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
---
|
9 |
|
10 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -12,21 +19,30 @@ should probably proofread and complete it, then remove this comment. -->
|
|
12 |
|
13 |
# pegasus-samsum-nlp-with-transformers-ch06
|
14 |
|
15 |
-
This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on
|
|
|
16 |
It achieves the following results on the evaluation set:
|
17 |
- Loss: 1.4839
|
18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
## Model description
|
20 |
|
21 |
-
|
22 |
|
23 |
## Intended uses & limitations
|
24 |
|
25 |
-
|
26 |
|
27 |
## Training and evaluation data
|
28 |
|
29 |
-
|
30 |
|
31 |
## Training procedure
|
32 |
|
@@ -56,4 +72,4 @@ The following hyperparameters were used during training:
|
|
56 |
- Transformers 4.39.3
|
57 |
- Pytorch 2.1.2
|
58 |
- Datasets 2.18.0
|
59 |
-
- Tokenizers 0.15.2
|
|
|
5 |
model-index:
|
6 |
- name: pegasus-samsum-nlp-with-transformers-ch06
|
7 |
results: []
|
8 |
+
datasets:
|
9 |
+
- samsum
|
10 |
+
language:
|
11 |
+
- en
|
12 |
+
metrics:
|
13 |
+
- rouge
|
14 |
+
pipeline_tag: summarization
|
15 |
---
|
16 |
|
17 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
19 |
|
20 |
# pegasus-samsum-nlp-with-transformers-ch06
|
21 |
|
22 |
+
This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on the [SAMSum](https://huggingface.co/datasets/samsum) dataset.
|
23 |
+
|
24 |
It achieves the following results on the evaluation set:
|
25 |
- Loss: 1.4839
|
26 |
|
27 |
+
It achieves the following ROUGE scores on the test set:
|
28 |
+
- rouge1: 0.555556
|
29 |
+
- rouge2: 0.230769
|
30 |
+
- rougeL: 0.518519
|
31 |
+
- rougeLsum: 0.518519
|
32 |
+
|
33 |
+
**Quick human evaluation of summarization quality:** the results are generally good, after visual inspection of the summaries generated on test set conversations. However it seems some entities/attributions are incorrect (saw an example where model confuses peoples' roles in multi-person chat)
|
34 |
+
|
35 |
## Model description
|
36 |
|
37 |
+
PEGASUS doc can be found here: [https://huggingface.co/docs/transformers/model_doc/pegasus](https://huggingface.co/docs/transformers/model_doc/pegasus)
|
38 |
|
39 |
## Intended uses & limitations
|
40 |
|
41 |
+
This model was trained while studying the NLP With Transformers book; it is not intended to be used for any real applications.
|
42 |
|
43 |
## Training and evaluation data
|
44 |
|
45 |
+
The finetuning data is the SAMSum dataset only.
|
46 |
|
47 |
## Training procedure
|
48 |
|
|
|
72 |
- Transformers 4.39.3
|
73 |
- Pytorch 2.1.2
|
74 |
- Datasets 2.18.0
|
75 |
+
- Tokenizers 0.15.2
|