AdamCodd
/

distilbart-sum-arxiv

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

AdamCodd commited on Nov 6, 2023

Commit

0e18368

·

1 Parent(s): 28a4fda

Update README.md

Files changed (1) hide show

README.md +20 -3

README.md CHANGED Viewed

@@ -1,8 +1,25 @@
 ---
 datasets:
 - ccdv/arxiv-summarization
-metrics:
-- rouge
 ---
 ## distilbart-sum-arxiv
 This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://huggingface.co/sshleifer/distilbart-cnn-12-6) on a subset of the [ccdv/arxiv-summarization dataset](https://huggingface.co/datasets/ccdv/arxiv-summarization). It achieves the following results on the evaluation set:
@@ -13,7 +30,7 @@ This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://h
 * RougeLSum: 24.260
 ## Model description
-This model is a distilled version of BART with 306M parameters (vs. 406 for the BART model) prioritizing speed with a 1.68x faster inference rate. It has been trained on 60_000 samples and has a limitation of 1024 tokens.
 ## Intended uses & limitations
 Since this model has been trained on scientific papers, it may perform poorly when attempting to summarize other types of content.

 ---
 datasets:
 - ccdv/arxiv-summarization
+model-index:
+- name: distilbart-sum-arxiv
+  results:
+  - task:
+      type: summarization
+      name: Summarization
+    dataset:
+      name: arxiv-summarization
+      type: arxiv-summarization
+    metrics:
+    - type: rouge-1
+      value: 42.1856
+      name: Validation ROUGE-1
+    - type: rouge-2
+      value: 15.4815
+      name: Validation ROUGE-2
+    - type: rouge-l
+      value: 24.4409
+      name: Validation ROUGE-L
 ---
 ## distilbart-sum-arxiv
 This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-6](https://huggingface.co/sshleifer/distilbart-cnn-12-6) on a subset of the [ccdv/arxiv-summarization dataset](https://huggingface.co/datasets/ccdv/arxiv-summarization). It achieves the following results on the evaluation set:
 * RougeLSum: 24.260
 ## Model description
+This model is a distilled version of BART with 306M parameters (vs. 406 for the BART model), but it is 1.68 times faster than BART at inference. It has been trained on 60_000 samples and has a limitation of 1024 tokens.
 ## Intended uses & limitations
 Since this model has been trained on scientific papers, it may perform poorly when attempting to summarize other types of content.