Update README.md
Browse files
README.md
CHANGED
@@ -1,8 +1,25 @@
|
|
1 |
---
|
2 |
datasets:
|
3 |
- ccdv/arxiv-summarization
|
4 |
-
|
5 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
---
|
7 |
## distilbart-sum-arxiv
|
8 |
This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://huggingface.co/sshleifer/distilbart-cnn-12-6) on a subset of the [ccdv/arxiv-summarization dataset](https://huggingface.co/datasets/ccdv/arxiv-summarization). It achieves the following results on the evaluation set:
|
@@ -13,7 +30,7 @@ This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://h
|
|
13 |
* RougeLSum: 24.260
|
14 |
|
15 |
## Model description
|
16 |
-
This model is a distilled version of BART with 306M parameters (vs. 406 for the BART model)
|
17 |
|
18 |
## Intended uses & limitations
|
19 |
Since this model has been trained on scientific papers, it may perform poorly when attempting to summarize other types of content.
|
|
|
1 |
---
|
2 |
datasets:
|
3 |
- ccdv/arxiv-summarization
|
4 |
+
model-index:
|
5 |
+
- name: distilbart-sum-arxiv
|
6 |
+
results:
|
7 |
+
- task:
|
8 |
+
type: summarization
|
9 |
+
name: Summarization
|
10 |
+
dataset:
|
11 |
+
name: arxiv-summarization
|
12 |
+
type: arxiv-summarization
|
13 |
+
metrics:
|
14 |
+
- type: rouge-1
|
15 |
+
value: 42.1856
|
16 |
+
name: Validation ROUGE-1
|
17 |
+
- type: rouge-2
|
18 |
+
value: 15.4815
|
19 |
+
name: Validation ROUGE-2
|
20 |
+
- type: rouge-l
|
21 |
+
value: 24.4409
|
22 |
+
name: Validation ROUGE-L
|
23 |
---
|
24 |
## distilbart-sum-arxiv
|
25 |
This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://huggingface.co/sshleifer/distilbart-cnn-12-6) on a subset of the [ccdv/arxiv-summarization dataset](https://huggingface.co/datasets/ccdv/arxiv-summarization). It achieves the following results on the evaluation set:
|
|
|
30 |
* RougeLSum: 24.260
|
31 |
|
32 |
## Model description
|
33 |
+
This model is a distilled version of BART with 306M parameters (vs. 406 for the BART model), but it is 1.68 times faster than BART at inference. It has been trained on 60_000 samples and has a limitation of 1024 tokens.
|
34 |
|
35 |
## Intended uses & limitations
|
36 |
Since this model has been trained on scientific papers, it may perform poorly when attempting to summarize other types of content.
|