patrickramos commited on
Commit
e18e0ad
·
1 Parent(s): 9276abf

Update model card

Browse files
Files changed (1) hide show
  1. README.md +58 -0
README.md CHANGED
@@ -1,3 +1,61 @@
1
  ---
2
  license: cc-by-sa-3.0
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-sa-3.0
3
+ language:
4
+ - ja
5
+ tag:
6
+ - emotion-analysis
7
+ datasets:
8
+ - wrime
9
  ---
10
+
11
+ # WRIME-fine-tuned BERT base Japanese
12
+
13
+ This model is a [Japanese BERT<sub>BASE</sub>](https://huggingface.co/cl-tohoku/bert-base-japanese-v2) fine-tuned on the [WRIME](https://github.com/ids-cv/wrime) dataset. It was trained as part of the paper ["Emotion Analysis of Writers and Readers of Japanese Tweets on Vaccinations"](https://aclanthology.org/2022.wassa-1.10/). Fine-tuning code is available at this [repo](https://github.com/PatrickJohnRamos/BERT-Japan-vaccination).
14
+
15
+ # Intended uses and limitations
16
+
17
+ This model can be used to predict intensities scores for eight emotions for writers and readers. Please refer to the `Fine-tuning data` section for the list of emotions.
18
+
19
+ Because of the regression fine-tuning task, it is possible for the model to infer scores outside of the range of the scores of the fine-tuning data (`score < 0` or `score > 4`).
20
+
21
+ # Model Architecture, Tokenization, and Pretraining
22
+
23
+ The Japanese BERT<sub>BASE</sub> fine-tuned was `cl-tohoku/bert-base-japanese-v2`. Please refer to their [model card](https://huggingface.co/cl-tohoku/bert-base-japanese-v2) for details regarding the model architecture, tokenization, pretraining data, and pretraining procedure.
24
+
25
+ # Fine-tuning data
26
+
27
+ The model is fine-tuned on [WRIME](https://github.com/ids-cv/wrime), a dataset of Japanese Tweets annotated with writer and reader emotion intensities. We use version 1 of the dataset. Each Tweet is accompanied by a set of writer emotion intensities (from the author of the Tweet) and three sets of reader emotions (from three annotators). The emotions follow Plutchhik's emotions, namely:
28
+
29
+ * joy
30
+ * sadness
31
+ * anticipation
32
+ * surprise
33
+ * anger
34
+ * fear
35
+ * disgust
36
+ * trust
37
+
38
+ These emotion intensities follow a four-point scale:
39
+
40
+ | emotion intensity | emotion presence|
41
+ |---|---|
42
+ | 0 | no |
43
+ | 1 | weak |
44
+ | 2 | medium |
45
+ | 3 | strong |
46
+
47
+ # Fine-tuning
48
+
49
+ The BERT is fine-tuned to directly regress the emotion intensities of the writer and the averaged emotions of the readers from each Tweet, meaning there are 16 outputs (8 emotions per writer/reader).
50
+
51
+ The fine-tuning was inspired by common BERT fine-tuning procedures. The BERT was fine-tuned on WRIME for 3 epochs using the AdamW optimizer with a learning rate of 2e-5, β<sub>1</sub>=0.9, β<sub>2</sub>=0.999, weight decay of 0.01, linear decay, a warmup ratio of 0.01, and a batch size of 32. Training was conducted with an NVIDIA Tesla K80 and finished in 3 hours.
52
+
53
+ # Evaluation results
54
+
55
+ Below are the MSEs of the BERT on the test split of WRIME.
56
+
57
+ | Annotator | Joy | Sadness | Anticipation | Surprise | Anger | Fear | Disgust | Trust | Overall |
58
+ |---|---|---|---|---|---|---|---|---|---|
59
+ | Writer | 0.658 | 0.688 | 0.746 | 0.542 | 0.486 | 0.462 | 0.664 | 0.400 | 0.581 |
60
+ | Reader | 0.192 | 0.178 | 0.211 | 0.139 | 0.032 | 0.147 | 0.123 | 0.029 | 0.131 |
61
+ | Both | 0.425 | 0.433 | 0.479 | 0.341 | 0.259 | 0.304 | 0.394 | 0.214 | 0.356 |