Michael Stewart
commited on
Commit
·
5f8a5e8
1
Parent(s):
f2e17d6
initial commit
Browse files- README.md +15 -0
- dev.tsv +0 -0
- loss.tsv +2 -0
- pytorch_model.bin +3 -0
- test.tsv +0 -0
- training.log +103 -0
- weights.txt +0 -0
README.md
ADDED
@@ -0,0 +1,15 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- flair
|
4 |
+
- text-classification
|
5 |
+
- text-classification-model
|
6 |
+
language: en
|
7 |
+
datasets:
|
8 |
+
- mwo_re
|
9 |
+
widget:
|
10 |
+
- text: "pump broken Item Observation pump is broken"
|
11 |
+
---
|
12 |
+
|
13 |
+
## MWO NER Test
|
14 |
+
|
15 |
+
A flair-based RE model for MWOs. There are three classes: `HAS_ACTIVITY`, `HAS_OBSERVATION`, and `APPEARS_WITH`.
|
dev.tsv
ADDED
The diff for this file is too large to render.
See raw diff
|
|
loss.tsv
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
EPOCH TIMESTAMP BAD_EPOCHS LEARNING_RATE TRAIN_LOSS DEV_LOSS DEV_PRECISION DEV_RECALL DEV_F1 DEV_ACCURACY
|
2 |
+
1 20:40:28 0 0.1000 0.01682120544167471 0.0031255579087883234 0.9592 0.8773 0.9164 0.9417
|
pytorch_model.bin
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:92c2e416ce656932663f8fed2a7db8c6fa1bd203a92582289365669557595713
|
3 |
+
size 906903603
|
test.tsv
ADDED
The diff for this file is too large to render.
See raw diff
|
|
training.log
ADDED
@@ -0,0 +1,103 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2022-11-21 20:37:39,123 ----------------------------------------------------------------------------------------------------
|
2 |
+
2022-11-21 20:37:39,125 Model: "TextClassifier(
|
3 |
+
(decoder): Linear(in_features=256, out_features=13, bias=True)
|
4 |
+
(dropout): Dropout(p=0.0, inplace=False)
|
5 |
+
(locked_dropout): LockedDropout(p=0.0)
|
6 |
+
(word_dropout): WordDropout(p=0.0)
|
7 |
+
(loss_function): CrossEntropyLoss()
|
8 |
+
(document_embeddings): DocumentRNNEmbeddings(
|
9 |
+
(embeddings): StackedEmbeddings(
|
10 |
+
(list_embedding_0): PooledFlairEmbeddings(
|
11 |
+
(context_embeddings): FlairEmbeddings(
|
12 |
+
(lm): LanguageModel(
|
13 |
+
(drop): Dropout(p=0.25, inplace=False)
|
14 |
+
(encoder): Embedding(275, 100)
|
15 |
+
(rnn): LSTM(100, 2048)
|
16 |
+
(decoder): Linear(in_features=2048, out_features=275, bias=True)
|
17 |
+
)
|
18 |
+
)
|
19 |
+
)
|
20 |
+
(list_embedding_1): PooledFlairEmbeddings(
|
21 |
+
(context_embeddings): FlairEmbeddings(
|
22 |
+
(lm): LanguageModel(
|
23 |
+
(drop): Dropout(p=0.25, inplace=False)
|
24 |
+
(encoder): Embedding(275, 100)
|
25 |
+
(rnn): LSTM(100, 2048)
|
26 |
+
(decoder): Linear(in_features=2048, out_features=275, bias=True)
|
27 |
+
)
|
28 |
+
)
|
29 |
+
)
|
30 |
+
)
|
31 |
+
(word_reprojection_map): Linear(in_features=8192, out_features=8192, bias=True)
|
32 |
+
(rnn): GRU(8192, 256, batch_first=True)
|
33 |
+
(dropout): Dropout(p=0.5, inplace=False)
|
34 |
+
)
|
35 |
+
(weights): None
|
36 |
+
(weight_tensor) None
|
37 |
+
)"
|
38 |
+
2022-11-21 20:37:39,126 ----------------------------------------------------------------------------------------------------
|
39 |
+
2022-11-21 20:37:39,127 Corpus: "Corpus: 24804 train + 3310 dev + 3234 test sentences"
|
40 |
+
2022-11-21 20:37:39,128 ----------------------------------------------------------------------------------------------------
|
41 |
+
2022-11-21 20:37:39,128 Parameters:
|
42 |
+
2022-11-21 20:37:39,129 - learning_rate: "0.100000"
|
43 |
+
2022-11-21 20:37:39,130 - mini_batch_size: "32"
|
44 |
+
2022-11-21 20:37:39,131 - patience: "3"
|
45 |
+
2022-11-21 20:37:39,132 - anneal_factor: "0.5"
|
46 |
+
2022-11-21 20:37:39,133 - max_epochs: "1"
|
47 |
+
2022-11-21 20:37:39,134 - shuffle: "True"
|
48 |
+
2022-11-21 20:37:39,135 - train_with_dev: "False"
|
49 |
+
2022-11-21 20:37:39,135 - batch_growth_annealing: "False"
|
50 |
+
2022-11-21 20:37:39,136 ----------------------------------------------------------------------------------------------------
|
51 |
+
2022-11-21 20:37:39,137 Model training base path: "models\re_models\flair"
|
52 |
+
2022-11-21 20:37:39,138 ----------------------------------------------------------------------------------------------------
|
53 |
+
2022-11-21 20:37:39,139 Device: cuda:0
|
54 |
+
2022-11-21 20:37:39,140 ----------------------------------------------------------------------------------------------------
|
55 |
+
2022-11-21 20:37:39,140 Embeddings storage mode: gpu
|
56 |
+
2022-11-21 20:37:39,141 ----------------------------------------------------------------------------------------------------
|
57 |
+
2022-11-21 20:37:39,143 train mode resetting embeddings
|
58 |
+
2022-11-21 20:37:39,143 train mode resetting embeddings
|
59 |
+
2022-11-21 20:37:53,636 epoch 1 - iter 77/776 - loss 0.05875709 - samples/sec: 174.18 - lr: 0.100000
|
60 |
+
2022-11-21 20:38:09,078 epoch 1 - iter 154/776 - loss 0.04883092 - samples/sec: 164.38 - lr: 0.100000
|
61 |
+
2022-11-21 20:38:24,116 epoch 1 - iter 231/776 - loss 0.04067766 - samples/sec: 168.98 - lr: 0.100000
|
62 |
+
2022-11-21 20:38:39,196 epoch 1 - iter 308/776 - loss 0.03364710 - samples/sec: 167.23 - lr: 0.100000
|
63 |
+
2022-11-21 20:38:54,197 epoch 1 - iter 385/776 - loss 0.02853559 - samples/sec: 169.50 - lr: 0.100000
|
64 |
+
2022-11-21 20:39:09,990 epoch 1 - iter 462/776 - loss 0.02485532 - samples/sec: 159.51 - lr: 0.100000
|
65 |
+
2022-11-21 20:39:24,857 epoch 1 - iter 539/776 - loss 0.02212722 - samples/sec: 171.02 - lr: 0.100000
|
66 |
+
2022-11-21 20:39:39,629 epoch 1 - iter 616/776 - loss 0.02006071 - samples/sec: 170.70 - lr: 0.100000
|
67 |
+
2022-11-21 20:39:55,776 epoch 1 - iter 693/776 - loss 0.01834604 - samples/sec: 155.97 - lr: 0.100000
|
68 |
+
2022-11-21 20:40:10,464 epoch 1 - iter 770/776 - loss 0.01691758 - samples/sec: 171.81 - lr: 0.100000
|
69 |
+
2022-11-21 20:40:11,521 ----------------------------------------------------------------------------------------------------
|
70 |
+
2022-11-21 20:40:11,523 EPOCH 1 done: loss 0.0168 - lr 0.100000
|
71 |
+
2022-11-21 20:40:28,143 Evaluating as a multi-label problem: False
|
72 |
+
2022-11-21 20:40:28,163 DEV : loss 0.0031255579087883234 - f1-score (micro avg) 0.9164
|
73 |
+
2022-11-21 20:40:28,894 BAD EPOCHS (no improvement): 0
|
74 |
+
2022-11-21 20:40:28,896 saving best model
|
75 |
+
2022-11-21 20:40:31,928 ----------------------------------------------------------------------------------------------------
|
76 |
+
2022-11-21 20:40:31,929 loading file models\re_models\flair\best-model.pt
|
77 |
+
2022-11-21 20:40:49,408 Evaluating as a multi-label problem: False
|
78 |
+
2022-11-21 20:40:49,427 0.9587 0.8521 0.9023 0.9332
|
79 |
+
2022-11-21 20:40:49,428
|
80 |
+
Results:
|
81 |
+
- F-score (micro) 0.9023
|
82 |
+
- F-score (macro) 0.9294
|
83 |
+
- Accuracy 0.9332
|
84 |
+
|
85 |
+
By class:
|
86 |
+
precision recall f1-score support
|
87 |
+
|
88 |
+
HAS_OBSERVATION 1.0000 1.0000 1.0000 313
|
89 |
+
HAS_ACTIVITY 1.0000 1.0000 1.0000 279
|
90 |
+
HAS_LOCATION 1.0000 1.0000 1.0000 238
|
91 |
+
APPEARS_WITH 0.5114 0.2064 0.2941 218
|
92 |
+
HAS_CONSUMABLE 1.0000 1.0000 1.0000 59
|
93 |
+
HAS_AGENT 1.0000 1.0000 1.0000 21
|
94 |
+
HAS_SPECIFIER 1.0000 1.0000 1.0000 15
|
95 |
+
HAS_ATTRIBUTE 1.0000 1.0000 1.0000 12
|
96 |
+
HAS_CARDINALITY 1.0000 1.0000 1.0000 10
|
97 |
+
HAS_TIME 1.0000 1.0000 1.0000 5
|
98 |
+
|
99 |
+
micro avg 0.9587 0.8521 0.9023 1170
|
100 |
+
macro avg 0.9511 0.9206 0.9294 1170
|
101 |
+
weighted avg 0.9090 0.8521 0.8685 1170
|
102 |
+
|
103 |
+
2022-11-21 20:40:49,429 ----------------------------------------------------------------------------------------------------
|
weights.txt
ADDED
File without changes
|