Sami92
/

XLM-PER-B

Token Classification

Inference Endpoints

Model card Files Files and versions Community

Sami92 commited on Jun 24, 2024

Commit

a02ab97

·

verified ·

1 Parent(s): 5b5bb24

Update README.md

Files changed (1) hide show

README.md +11 -34

README.md CHANGED Viewed

@@ -63,17 +63,10 @@ for entity in entities:
 ### Training Data
-<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
-### Training Procedure
-<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-#### Preprocessing [optional]
-[More Information Needed]
 #### Training Hyperparameters
@@ -85,34 +78,18 @@ for entity in entities:
-## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-[More Information Needed]
 #### Metrics
-<!-- These are the evaluation metrics being used, ideally with a description of why. -->
-[More Information Needed]
-### Results
-[More Information Needed]
 ## Model Card Authors [optional]
 @misc{your_model_name,
   author = {Nenno, Sai},
   title = {Public Entity Recognition Model},
@@ -122,4 +99,4 @@ for entity in entities:
   url = {https://huggingface.co/Sami92/XLM-PER-B}
 }
-´

 ### Training Data
+The model was first fine-tuned on a weakly annotated dataset: German newspaper articles (total = 267,786) and German Wikipedia articles (total = 4,348).
+The weak annotation was based on the [database of public speakers](https://github.com/Leibniz-HBI/DBoeS-data/).
+In a second step the model was fine-tuned on a manually annotated dataset of 3090 sentences from similar sources. The test-split of this data was used for evaluation.
 #### Training Hyperparameters
 #### Metrics
+- type: f1
+  value: 0.80
+- type: recall
+  value: 0.78
+- type: precision
+  value: 0.84
 ## Model Card Authors [optional]
+```css
 @misc{your_model_name,
   author = {Nenno, Sai},
   title = {Public Entity Recognition Model},
   url = {https://huggingface.co/Sami92/XLM-PER-B}
 }
+```