|
--- |
|
language: en |
|
tags: |
|
- text-classification |
|
- emotion classification |
|
- emotion recognition |
|
- cross-domain emotion recognition |
|
datasets: |
|
- go_emotions |
|
- sem_eval_2018_task_1 |
|
- xed_en_fi |
|
- daily_dialog |
|
license: mit |
|
widget: |
|
- text: All your work was lost when the computer crashed.</s><s>Oh my god. I spent a whole week on that. |
|
--- |
|
|
|
# EmoMosaic-large |
|
EmoMosaic-large is a well-competent model designed for classifying emotions in text, demonstrating strong performance across multiple domains. In fact, it outperforms recent state-of-the-art models on |
|
SemEval-2018 Task 1: Affect in Tweets and GoEmotions datasets. The model was developed as a part of my master's thesis. |
|
|
|
## Author |
|
|
|
Author: Bc. Vít Tlustoš ([email protected]) |
|
|
|
Supervisor: doc. Malik Aamir Saeed Ph.D |
|
|
|
## Thesis Text |
|
Once the thesis has been defended, the text will be accessible at https://www.vut.cz/studenti/zav-prace/detail/153407. You are welcome to read it. Should you have any questions, please don't hesitate to contact me via the provided email. |
|
|
|
## Demo Application |
|
As a part of the solution, we developed a Gradio application and deployed it on the Hugging Face Spaces platform. Once the thesis is made public, you can access it at: https://huggingface.co/spaces/vtlustos/EmoMosaic-space. This allows anyone to experiment with the models easily without requiring any technical skills or setup. |
|
|
|
## Models |
|
To utilize these models within your application, first install all the necessary dependencies. |
|
|
|
```bash |
|
pip install torch |
|
pip install transformers |
|
pip install datasets |
|
``` |
|
|
|
To utilize these models within your application, integrate following code and format your samples as `context</s><s>sentence`. The `context` is optional and represents sentences preceding the sentence to be classified, while `sentence` refers to the actual sentence undergoing classification. This example demonstrates how to use the `EmoMosaic-base` model. If you prefer to use its larger counterpart, replace `vtlustos/EmoMosaic-base` with `vtlustos/EmoMosaic-large`. |
|
|
|
```python |
|
import torch |
|
from transformers import RobertaTokenizer |
|
from transformers import RobertaForSequenceClassification |
|
|
|
# 1. initialize the model |
|
tokenizer = RobertaTokenizer.from_pretrained( |
|
"vtlustos/EmoMosaic-base" |
|
) |
|
model = RobertaForSequenceClassification.from_pretrained( |
|
"vtlustos/EmoMosaic-base" |
|
).to('cuda:0') |
|
|
|
# 2. tokenize the sentences |
|
tokens = tokenizer( |
|
[ |
|
"All your work was lost when the computer crashed.</s><s>Oh my god. I spent a whole week on that." |
|
], |
|
truncation=True, |
|
padding=True, |
|
return_tensors = "pt" |
|
) |
|
|
|
# 3. make the prediction |
|
with torch.no_grad(): |
|
logits = model( |
|
tokens["input_ids"].to('cuda:0'), |
|
tokens["attention_mask"].to('cuda:0') |
|
).logits |
|
|
|
# 4. convert to probabilities |
|
preds = torch.sigmoid(logits) |
|
|
|
print(preds) |
|
``` |
|
|
|
After executing the code, you will receive a tensor with dimensions `[S,E]`, where `S` represents the number of samples and `E` denotes the number of emotions. To associate individual probabilities with their respective emotions, use to the dictionary provided below: |
|
|
|
```python |
|
ix2label = { |
|
"0": "admiration", |
|
"1": "amusement", |
|
"2": "anger", |
|
"3": "annoyance", |
|
"4": "anticipation", |
|
"5": "approval", |
|
"6": "caring", |
|
"7": "confusion", |
|
"8": "curiosity", |
|
"9": "desire", |
|
"10": "disappointment", |
|
"11": "disapproval", |
|
"12": "disgust", |
|
"13": "embarrassment", |
|
"14": "excitement", |
|
"15": "fear", |
|
"16": "gratitude", |
|
"17": "grief", |
|
"18": "happiness", |
|
"19": "joy", |
|
"20": "love", |
|
"21": "nervousness", |
|
"22": "optimism", |
|
"23": "pessimism", |
|
"24": "pride", |
|
"25": "realization", |
|
"26": "relief", |
|
"27": "remorse", |
|
"28": "sadness", |
|
"29": "surprise", |
|
"30": "trust" |
|
} |
|
``` |
|
|
|
## Results |
|
Here we present a brief overview of the results. For an in-depth analysis and discussion, please refer to the text of the thesis. The analysis covers model training, comparisons with other methods, assessments of performance at the level of individual categories, calibration, and qualitative evaluations across various scenarios. |
|
|
|
### SemEval-2018 Task 1: Affect in Tweets |
|
| Model | Accuracy | P (macro) | R (macro) | F1 (macro) | P (micro) | R (micro) | F1 (micro) | |
|
|------------------|----------|-----------|-----------|------------|-----------|-----------|------------| |
|
| EmoMosaic-base | 20.65 | 54.96 | 62.58 | 58.44 | 64.63 | 73.62 | 68.83 | |
|
| EmoMosaic-large | 22.49 | 57.97 | 64.12 | 60.72 | 67.44 | 75.27 | 71.14 | |
|
|
|
*Note: P and R denote precision and recall, respectively. Results are shown for our top-performing models measured on the test set of the SemEval-2018 Task 1: Affect in Tweets dataset.* |
|
|
|
### GoEmotions |
|
| Model | Accuracy | P (macro) | R (macro) | F1 (macro) | P (micro) | R (micro) | F1 (micro) | |
|
|------------------|----------|-----------|-----------|------------|-----------|-----------|------------| |
|
| EmoMosaic-base | 46.47 | 51.41 | 57.81 | 53.72 | 52.70 | 62.53 | 57.19 | |
|
| EmoMosaic-large | 46.67| 51.35 | 58.34 | 53.93 | 52.86 | 63.39 | 57.65 | |
|
|
|
*Note: P and R denote precision and recall, respectively. Results are shown for our two top-performing models measured on the test set of the GoEmotions dataset.* |
|
|
|
### XED |
|
| Model | Accuracy | P (macro) | R (macro) | F1 (macro) | P (micro) | R (micro) | F1 (micro) | |
|
|------------------|----------|-----------|-----------|------------|-----------|-----------|------------| |
|
| EmoMosaic-base | 51.78 | 48.47 | 63.00 | 54.67 | 48.62 | 63.86 | 55.21 | |
|
| EmoMosaic-large | 52.59 | 50.35 | 66.54 | 57.19 | 50.43 | 67.43 | 57.70 | |
|
|
|
*Note: P and R denote precision and recall, respectively. Results are shown for our two top-performing models measured on the test set of the XED dataset.* |
|
|
|
### DailyDialog |
|
| Model | Accuracy | P (macro) | R (macro) | F1 (macro) | P (micro) | R (micro) | F1 (micro) | |
|
|------------------|----------|-----------|-----------|------------|-----------|-----------|------------| |
|
| EmoMosaic-base | 84.85 | 46.34 | 49.60 | 46.94 | 53.44 | 64.81 | 58.57 | |
|
| EmoMosaic-large | 85.05 | 47.20 | 53.80 | 49.65 | 54.24 | 68.77 | 60.65 | |
|
|
|
*Note: P and R denote precision and recall, respectively. Results are shown for our two top-performing models measured on the test set of the DailyDialog dataset.* |
|
|
|
### Per-Emotion Performance |
|
|
|
#### EmoMosaic-base |
|
|
|
| Emotion | Precision | Recall | F1 | |
|
|----------------|-----------|--------|-------| |
|
| admiration | 63.82 | 80.16 | 71.06 | |
|
| amusement | 74.11 | 94.32 | 83.00 | |
|
| anger | 63.46 | 74.08 | 68.36 | |
|
| annoyance | 35.15 | 44.37 | 39.23 | |
|
| anticipation | 39.09 | 55.15 | 45.75 | |
|
| approval | 43.40 | 45.87 | 44.60 | |
|
| caring | 45.67 | 42.96 | 44.27 | |
|
| confusion | 36.10 | 56.86 | 44.16 | |
|
| curiosity | 48.48 | 67.25 | 56.34 | |
|
| desire | 53.09 | 51.81 | 52.44 | |
|
| disappointment | 35.57 | 35.10 | 35.33 | |
|
| disapproval | 40.00 | 49.44 | 44.22 | |
|
| disgust | 62.05 | 71.31 | 66.36 | |
|
| embarrassment | 57.69 | 40.54 | 47.62 | |
|
| excitement | 37.40 | 44.66 | 40.71 | |
|
| fear | 61.93 | 68.69 | 65.13 | |
|
| gratitude | 93.29 | 90.91 | 92.09 | |
|
| grief | 66.67 | 66.67 | 66.67 | |
|
| happiness | 58.10 | 70.76 | 63.81 | |
|
| joy | 73.43 | 81.18 | 77.11 | |
|
| love | 64.95 | 73.74 | 69.07 | |
|
| nervousness | 33.33 | 43.48 | 37.74 | |
|
| optimism | 64.33 | 76.00 | 69.68 | |
|
| pessimism | 42.31 | 52.80 | 46.98 | |
|
| pride | 66.67 | 37.50 | 48.00 | |
|
| realization | 32.71 | 24.14 | 27.78 | |
|
| relief | 55.56 | 45.45 | 50.00 | |
|
| remorse | 55.56 | 89.29 | 68.49 | |
|
| sadness | 58.65 | 70.14 | 63.88 | |
|
| surprise | 40.02 | 51.29 | 44.96 | |
|
| trust | 35.33 | 47.01 | 40.34 | |
|
|
|
#### EmoMosaic-large |
|
| Emotion | Precision | Recall | F1 | |
|
|----------------|-----------|--------|-------| |
|
| admiration | 65.25 | 79.37 | 71.62 | |
|
| amusement | 73.87 | 93.18 | 82.41 | |
|
| anger | 64.29 | 76.00 | 69.66 | |
|
| annoyance | 33.81 | 44.06 | 38.26 | |
|
| anticipation | 42.10 | 57.99 | 48.78 | |
|
| approval | 42.66 | 44.73 | 43.67 | |
|
| caring | 40.26 | 45.93 | 42.91 | |
|
| confusion | 38.76 | 52.94 | 44.75 | |
|
| curiosity | 48.40 | 74.65 | 58.73 | |
|
| desire | 65.08 | 49.40 | 56.16 | |
|
| disappointment | 34.36 | 37.09 | 35.67 | |
|
| disapproval | 39.14 | 47.94 | 43.10 | |
|
| disgust | 63.62 | 72.30 | 67.68 | |
|
| embarrassment | 58.33 | 37.84 | 45.90 | |
|
| excitement | 39.82 | 43.69 | 41.67 | |
|
| fear | 64.22 | 71.24 | 67.55 | |
|
| gratitude | 91.01 | 92.05 | 91.53 | |
|
| grief | 66.67 | 66.67 | 66.67 | |
|
| happiness | 58.21 | 75.23 | 65.63 | |
|
| joy | 74.55 | 83.53 | 78.78 | |
|
| love | 64.13 | 76.13 | 69.62 | |
|
| nervousness | 42.86 | 39.13 | 40.91 | |
|
| optimism | 66.98 | 79.38 | 72.66 | |
|
| pessimism | 43.66 | 47.73 | 45.61 | |
|
| pride | 63.64 | 43.75 | 51.85 | |
|
| realization | 34.29 | 24.83 | 28.80 | |
|
| relief | 33.33 | 36.36 | 34.78 | |
|
| remorse | 57.78 | 92.86 | 71.23 | |
|
| sadness | 61.08 | 72.67 | 66.37 | |
|
| surprise | 44.02 | 55.67 | 49.16 | |
|
| trust | 40.59 | 48.26 | 44.09 | |