howanching-clara
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -22,11 +22,21 @@ It achieves the following results on the evaluation set:
|
|
22 |
|
23 |
## Model description
|
24 |
|
25 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
## Intended uses & limitations
|
28 |
|
29 |
-
|
|
|
|
|
|
|
|
|
30 |
|
31 |
## Training and evaluation data
|
32 |
|
|
|
22 |
|
23 |
## Model description
|
24 |
|
25 |
+
The model is fine-tuned with academic publications in Linguistics, to classify sentences in publications into 4 classes.
|
26 |
+
|
27 |
+
The 4 classes:
|
28 |
+
0: out of scope - materials that are of low significance, eg. page number, noise from OCR/pdf-to-text convertion
|
29 |
+
1: main text - texts that are the main texts of the publication, to be used for down-stream tasks
|
30 |
+
2: examples - texts that are captions of the figures, or quotes or excerpts
|
31 |
+
3: references - references of the publication, excluding in-text citations
|
32 |
|
33 |
## Intended uses & limitations
|
34 |
|
35 |
+
Intended uses:
|
36 |
+
- to extract main text in academic texts for down-stream tasks
|
37 |
+
|
38 |
+
Limitations:
|
39 |
+
- training and evaluation data is limited to English, and academic texts in Linguistics
|
40 |
|
41 |
## Training and evaluation data
|
42 |
|