howanching-clara commited on
Commit
7184cc0
·
verified ·
1 Parent(s): e1d7dc5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -22,11 +22,21 @@ It achieves the following results on the evaluation set:
22
 
23
  ## Model description
24
 
25
- More information needed
 
 
 
 
 
 
26
 
27
  ## Intended uses & limitations
28
 
29
- More information needed
 
 
 
 
30
 
31
  ## Training and evaluation data
32
 
 
22
 
23
  ## Model description
24
 
25
+ The model is fine-tuned with academic publications in Linguistics, to classify sentences in publications into 4 classes.
26
+
27
+ The 4 classes:
28
+ 0: out of scope - materials that are of low significance, eg. page number, noise from OCR/pdf-to-text convertion
29
+ 1: main text - texts that are the main texts of the publication, to be used for down-stream tasks
30
+ 2: examples - texts that are captions of the figures, or quotes or excerpts
31
+ 3: references - references of the publication, excluding in-text citations
32
 
33
  ## Intended uses & limitations
34
 
35
+ Intended uses:
36
+ - to extract main text in academic texts for down-stream tasks
37
+
38
+ Limitations:
39
+ - training and evaluation data is limited to English, and academic texts in Linguistics
40
 
41
  ## Training and evaluation data
42