alex-atelo
/

mt5-small-finetuned-xlsum-en-es

text2text-generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

alex-atelo commited on Mar 20, 2024

Commit

a538084

·

verified ·

1 Parent(s): 407210b

Update README.md

Files changed (1) hide show

README.md +41 -1

README.md CHANGED Viewed

@@ -4,8 +4,13 @@ base_model: google/mt5-small
 tags:
 - summarization
 - generated_from_trainer
 metrics:
 - rouge
 model-index:
 - name: mt5-small-finetuned-xlsum-en-es
   results: []
@@ -16,7 +21,10 @@ should probably proofread and complete it, then remove this comment. -->
 # mt5-small-finetuned-xlsum-en-es
-This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.9483
 - Rouge1: 19.42
@@ -31,6 +39,10 @@ More information needed
 ## Intended uses & limitations
 More information needed
 ## Training and evaluation data
@@ -65,3 +77,31 @@ The following hyperparameters were used during training:
 - Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

 tags:
 - summarization
 - generated_from_trainer
+language:
+- en
+- es
 metrics:
 - rouge
+datasets:
+- csebuetnlp/xlsum
 model-index:
 - name: mt5-small-finetuned-xlsum-en-es
   results: []
 # mt5-small-finetuned-xlsum-en-es
+This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the csebuetnlp/xlsum dataset.
+A reduced version of the English/Spanish subsets were used, focusing on shorter targets.
 It achieves the following results on the evaluation set:
 - Loss: 2.9483
 - Rouge1: 19.42
 ## Intended uses & limitations
+Model may produce false information when summarizing.
+This is very much an initial draft, and is not expected for use in production, use at your own risk.
 More information needed
 ## Training and evaluation data
 - Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2
+## Citation
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+```
+@inproceedings{hasan-etal-2021-xl,
+    title = "{XL}-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages",
+    author = "Hasan, Tahmid  and
+      Bhattacharjee, Abhik  and
+      Islam, Md. Saiful  and
+      Mubasshir, Kazi  and
+      Li, Yuan-Fang  and
+      Kang, Yong-Bin  and
+      Rahman, M. Sohel  and
+      Shahriyar, Rifat",
+    booktitle = "Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021",
+    month = aug,
+    year = "2021",
+    address = "Online",
+    publisher = "Association for Computational Linguistics",
+    url = "https://aclanthology.org/2021.findings-acl.413",
+    pages = "4693--4703",
+}
+```