File size: 1,929 Bytes
e8c18e6
 
 
 
 
 
 
 
e90aef8
 
 
 
 
 
b1cf6ea
 
 
722e89b
 
86e8585
722e89b
 
 
 
 
 
 
e2b1534
 
722e89b
 
e2b1534
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
722e89b
86e8585
 
 
722e89b
86e8585
722e89b
86e8585
722e89b
86e8585
722e89b
 
 
86e8585
 
722e89b
86e8585
722e89b
b1cf6ea
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
---
license: mit
datasets:
- turkish-nlp-suite/turkish-wikiNER
language:
- tr
library_name: transformers
pipeline_tag: token-classification
metrics:
- bertscore
tags:
- chemistry
- music
- finance
- ner
- bert
- turkish
---

**Turkish NER dataset from Wikipedia sentences. 20.000 sentences are sampled and re-annotated from Kuzgunlar NER dataset.**

Data split:

-18.000 train
-1000 test
-1000 dev



Labels:

•	CARDINAL
•	DATE
•	EVENT
•	FAC
•	GPE
•	LANGUAGE
•	LAW
•	LOC
•	MONEY
•	NORP
•	ORDINAL
•	ORG
•	PERCENT
•	PERSON
•	PRODUCT
•	QUANTITY
•	TIME
•	TITLE
•	WORK_OF_ART

Example:

![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F63edf9885e6cf35f9b89b0ff%2FS9DMiCryIfQQ8ok6JZSqs.png%3C%2Fspan%3E)

**Model Evaluation**


The validation process of the model was performed on the test dataset. During the evaluation:

• The model was put into evaluation mode.

• Loss and accuracy were calculated.

• A classification report was created using the Seqeval library. It shows the performance of the model for each label in detail.
![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F63edf9885e6cf35f9b89b0ff%2FMFS3u-huX33mjwPlxWA2l.png%3C%2Fspan%3E)

**Results and Performance**

The accuracy and loss values ​​obtained in the training and validation stages of the model are reported, and the classification report and F1 score, precision and recall values ​​of each label are given. The performance of the model reached high accuracy rates in the Turkish NER task.

It has shown the effectiveness of the BERT model for named entity recognition tasks in the Turkish language. The methods used in the training and evaluation processes increased the overall performance of the model and ensured that the difficulties related to the language model were overcome.
![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F63edf9885e6cf35f9b89b0ff%2Flsy8OWloF__F-cRoIuCJg.png%3C%2Fspan%3E)%3C!-- HTML_TAG_END -->