nlztrk commited on
Commit
b0cf554
·
1 Parent(s): 915f414

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ **Train-Test Set:** "teknofest_train_final.csv"
2
+
3
+ **Model:** "dbmdz/bert-base-turkish-128k-uncased"
4
+
5
+ **Önişleme**
6
+ - Karakterler küçültülmüştür
7
+ - Noktalama işaretleri silinmiştir
8
+
9
+ ## Tokenizer Parametreleri
10
+ ```
11
+ max_length=64
12
+ padding=True
13
+ truncation=True
14
+ ```
15
+
16
+ ## Eğitim Parametreleri
17
+ - **Epoch:** 3
18
+ - **Learning Rate:** 7e-5
19
+ - **Batch-Size:** 64
20
+ - **Tokenizer Length:** 64
21
+ - **Loss:** BCE
22
+ - **Online Hard Example Mining:** Açık
23
+ - **Class-Weighting:** Açık (^0.3)
24
+ - **Early Stopping:** Kapalı
25
+ - **Stratified Batch Sampling:** Açık
26
+ - **Gradient Accumulation:** Kapalı
27
+ - **LR Scheduler:** Cosine-with-Warmup
28
+ - **Warmup Ratio:** 0.1
29
+ - **Weight Decay:** 0.01
30
+ - **LLRD:** 0.95
31
+ - **Label Smoothing:** 0.05
32
+ - **Gradient Clipping:** 1.0
33
+ - **MLM Pre-Training:** Kapalı
34
+
35
+
36
+ ## CV10 Sonuçları
37
+ ```
38
+ precision recall f1-score support
39
+
40
+ INSULT 0.9098 0.9143 0.9120 2393
41
+ OTHER 0.9596 0.9481 0.9538 3528
42
+ PROFANITY 0.9599 0.9575 0.9587 2376
43
+ RACIST 0.9551 0.9636 0.9594 2033
44
+ SEXIST 0.9552 0.9635 0.9593 2081
45
+
46
+ accuracy 0.9485 12411
47
+ macro avg 0.9479 0.9494 0.9486 12411
48
+ weighted avg 0.9486 0.9485 0.9485 12411
49
+ ```