AdamCodd commited on
Commit
0e18368
·
1 Parent(s): 28a4fda

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -3
README.md CHANGED
@@ -1,8 +1,25 @@
1
  ---
2
  datasets:
3
  - ccdv/arxiv-summarization
4
- metrics:
5
- - rouge
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  ---
7
  ## distilbart-sum-arxiv
8
  This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://huggingface.co/sshleifer/distilbart-cnn-12-6) on a subset of the [ccdv/arxiv-summarization dataset](https://huggingface.co/datasets/ccdv/arxiv-summarization). It achieves the following results on the evaluation set:
@@ -13,7 +30,7 @@ This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://h
13
  * RougeLSum: 24.260
14
 
15
  ## Model description
16
- This model is a distilled version of BART with 306M parameters (vs. 406 for the BART model) prioritizing speed with a 1.68x faster inference rate. It has been trained on 60_000 samples and has a limitation of 1024 tokens.
17
 
18
  ## Intended uses & limitations
19
  Since this model has been trained on scientific papers, it may perform poorly when attempting to summarize other types of content.
 
1
  ---
2
  datasets:
3
  - ccdv/arxiv-summarization
4
+ model-index:
5
+ - name: distilbart-sum-arxiv
6
+ results:
7
+ - task:
8
+ type: summarization
9
+ name: Summarization
10
+ dataset:
11
+ name: arxiv-summarization
12
+ type: arxiv-summarization
13
+ metrics:
14
+ - type: rouge-1
15
+ value: 42.1856
16
+ name: Validation ROUGE-1
17
+ - type: rouge-2
18
+ value: 15.4815
19
+ name: Validation ROUGE-2
20
+ - type: rouge-l
21
+ value: 24.4409
22
+ name: Validation ROUGE-L
23
  ---
24
  ## distilbart-sum-arxiv
25
  This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://huggingface.co/sshleifer/distilbart-cnn-12-6) on a subset of the [ccdv/arxiv-summarization dataset](https://huggingface.co/datasets/ccdv/arxiv-summarization). It achieves the following results on the evaluation set:
 
30
  * RougeLSum: 24.260
31
 
32
  ## Model description
33
+ This model is a distilled version of BART with 306M parameters (vs. 406 for the BART model), but it is 1.68 times faster than BART at inference. It has been trained on 60_000 samples and has a limitation of 1024 tokens.
34
 
35
  ## Intended uses & limitations
36
  Since this model has been trained on scientific papers, it may perform poorly when attempting to summarize other types of content.