Michael Stewart commited on
Commit
5f8a5e8
·
1 Parent(s): f2e17d6

initial commit

Browse files
Files changed (7) hide show
  1. README.md +15 -0
  2. dev.tsv +0 -0
  3. loss.tsv +2 -0
  4. pytorch_model.bin +3 -0
  5. test.tsv +0 -0
  6. training.log +103 -0
  7. weights.txt +0 -0
README.md ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - flair
4
+ - text-classification
5
+ - text-classification-model
6
+ language: en
7
+ datasets:
8
+ - mwo_re
9
+ widget:
10
+ - text: "pump broken Item Observation pump is broken"
11
+ ---
12
+
13
+ ## MWO NER Test
14
+
15
+ A flair-based RE model for MWOs. There are three classes: `HAS_ACTIVITY`, `HAS_OBSERVATION`, and `APPEARS_WITH`.
dev.tsv ADDED
The diff for this file is too large to render. See raw diff
 
loss.tsv ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ EPOCH TIMESTAMP BAD_EPOCHS LEARNING_RATE TRAIN_LOSS DEV_LOSS DEV_PRECISION DEV_RECALL DEV_F1 DEV_ACCURACY
2
+ 1 20:40:28 0 0.1000 0.01682120544167471 0.0031255579087883234 0.9592 0.8773 0.9164 0.9417
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:92c2e416ce656932663f8fed2a7db8c6fa1bd203a92582289365669557595713
3
+ size 906903603
test.tsv ADDED
The diff for this file is too large to render. See raw diff
 
training.log ADDED
@@ -0,0 +1,103 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2022-11-21 20:37:39,123 ----------------------------------------------------------------------------------------------------
2
+ 2022-11-21 20:37:39,125 Model: "TextClassifier(
3
+ (decoder): Linear(in_features=256, out_features=13, bias=True)
4
+ (dropout): Dropout(p=0.0, inplace=False)
5
+ (locked_dropout): LockedDropout(p=0.0)
6
+ (word_dropout): WordDropout(p=0.0)
7
+ (loss_function): CrossEntropyLoss()
8
+ (document_embeddings): DocumentRNNEmbeddings(
9
+ (embeddings): StackedEmbeddings(
10
+ (list_embedding_0): PooledFlairEmbeddings(
11
+ (context_embeddings): FlairEmbeddings(
12
+ (lm): LanguageModel(
13
+ (drop): Dropout(p=0.25, inplace=False)
14
+ (encoder): Embedding(275, 100)
15
+ (rnn): LSTM(100, 2048)
16
+ (decoder): Linear(in_features=2048, out_features=275, bias=True)
17
+ )
18
+ )
19
+ )
20
+ (list_embedding_1): PooledFlairEmbeddings(
21
+ (context_embeddings): FlairEmbeddings(
22
+ (lm): LanguageModel(
23
+ (drop): Dropout(p=0.25, inplace=False)
24
+ (encoder): Embedding(275, 100)
25
+ (rnn): LSTM(100, 2048)
26
+ (decoder): Linear(in_features=2048, out_features=275, bias=True)
27
+ )
28
+ )
29
+ )
30
+ )
31
+ (word_reprojection_map): Linear(in_features=8192, out_features=8192, bias=True)
32
+ (rnn): GRU(8192, 256, batch_first=True)
33
+ (dropout): Dropout(p=0.5, inplace=False)
34
+ )
35
+ (weights): None
36
+ (weight_tensor) None
37
+ )"
38
+ 2022-11-21 20:37:39,126 ----------------------------------------------------------------------------------------------------
39
+ 2022-11-21 20:37:39,127 Corpus: "Corpus: 24804 train + 3310 dev + 3234 test sentences"
40
+ 2022-11-21 20:37:39,128 ----------------------------------------------------------------------------------------------------
41
+ 2022-11-21 20:37:39,128 Parameters:
42
+ 2022-11-21 20:37:39,129 - learning_rate: "0.100000"
43
+ 2022-11-21 20:37:39,130 - mini_batch_size: "32"
44
+ 2022-11-21 20:37:39,131 - patience: "3"
45
+ 2022-11-21 20:37:39,132 - anneal_factor: "0.5"
46
+ 2022-11-21 20:37:39,133 - max_epochs: "1"
47
+ 2022-11-21 20:37:39,134 - shuffle: "True"
48
+ 2022-11-21 20:37:39,135 - train_with_dev: "False"
49
+ 2022-11-21 20:37:39,135 - batch_growth_annealing: "False"
50
+ 2022-11-21 20:37:39,136 ----------------------------------------------------------------------------------------------------
51
+ 2022-11-21 20:37:39,137 Model training base path: "models\re_models\flair"
52
+ 2022-11-21 20:37:39,138 ----------------------------------------------------------------------------------------------------
53
+ 2022-11-21 20:37:39,139 Device: cuda:0
54
+ 2022-11-21 20:37:39,140 ----------------------------------------------------------------------------------------------------
55
+ 2022-11-21 20:37:39,140 Embeddings storage mode: gpu
56
+ 2022-11-21 20:37:39,141 ----------------------------------------------------------------------------------------------------
57
+ 2022-11-21 20:37:39,143 train mode resetting embeddings
58
+ 2022-11-21 20:37:39,143 train mode resetting embeddings
59
+ 2022-11-21 20:37:53,636 epoch 1 - iter 77/776 - loss 0.05875709 - samples/sec: 174.18 - lr: 0.100000
60
+ 2022-11-21 20:38:09,078 epoch 1 - iter 154/776 - loss 0.04883092 - samples/sec: 164.38 - lr: 0.100000
61
+ 2022-11-21 20:38:24,116 epoch 1 - iter 231/776 - loss 0.04067766 - samples/sec: 168.98 - lr: 0.100000
62
+ 2022-11-21 20:38:39,196 epoch 1 - iter 308/776 - loss 0.03364710 - samples/sec: 167.23 - lr: 0.100000
63
+ 2022-11-21 20:38:54,197 epoch 1 - iter 385/776 - loss 0.02853559 - samples/sec: 169.50 - lr: 0.100000
64
+ 2022-11-21 20:39:09,990 epoch 1 - iter 462/776 - loss 0.02485532 - samples/sec: 159.51 - lr: 0.100000
65
+ 2022-11-21 20:39:24,857 epoch 1 - iter 539/776 - loss 0.02212722 - samples/sec: 171.02 - lr: 0.100000
66
+ 2022-11-21 20:39:39,629 epoch 1 - iter 616/776 - loss 0.02006071 - samples/sec: 170.70 - lr: 0.100000
67
+ 2022-11-21 20:39:55,776 epoch 1 - iter 693/776 - loss 0.01834604 - samples/sec: 155.97 - lr: 0.100000
68
+ 2022-11-21 20:40:10,464 epoch 1 - iter 770/776 - loss 0.01691758 - samples/sec: 171.81 - lr: 0.100000
69
+ 2022-11-21 20:40:11,521 ----------------------------------------------------------------------------------------------------
70
+ 2022-11-21 20:40:11,523 EPOCH 1 done: loss 0.0168 - lr 0.100000
71
+ 2022-11-21 20:40:28,143 Evaluating as a multi-label problem: False
72
+ 2022-11-21 20:40:28,163 DEV : loss 0.0031255579087883234 - f1-score (micro avg) 0.9164
73
+ 2022-11-21 20:40:28,894 BAD EPOCHS (no improvement): 0
74
+ 2022-11-21 20:40:28,896 saving best model
75
+ 2022-11-21 20:40:31,928 ----------------------------------------------------------------------------------------------------
76
+ 2022-11-21 20:40:31,929 loading file models\re_models\flair\best-model.pt
77
+ 2022-11-21 20:40:49,408 Evaluating as a multi-label problem: False
78
+ 2022-11-21 20:40:49,427 0.9587 0.8521 0.9023 0.9332
79
+ 2022-11-21 20:40:49,428
80
+ Results:
81
+ - F-score (micro) 0.9023
82
+ - F-score (macro) 0.9294
83
+ - Accuracy 0.9332
84
+
85
+ By class:
86
+ precision recall f1-score support
87
+
88
+ HAS_OBSERVATION 1.0000 1.0000 1.0000 313
89
+ HAS_ACTIVITY 1.0000 1.0000 1.0000 279
90
+ HAS_LOCATION 1.0000 1.0000 1.0000 238
91
+ APPEARS_WITH 0.5114 0.2064 0.2941 218
92
+ HAS_CONSUMABLE 1.0000 1.0000 1.0000 59
93
+ HAS_AGENT 1.0000 1.0000 1.0000 21
94
+ HAS_SPECIFIER 1.0000 1.0000 1.0000 15
95
+ HAS_ATTRIBUTE 1.0000 1.0000 1.0000 12
96
+ HAS_CARDINALITY 1.0000 1.0000 1.0000 10
97
+ HAS_TIME 1.0000 1.0000 1.0000 5
98
+
99
+ micro avg 0.9587 0.8521 0.9023 1170
100
+ macro avg 0.9511 0.9206 0.9294 1170
101
+ weighted avg 0.9090 0.8521 0.8685 1170
102
+
103
+ 2022-11-21 20:40:49,429 ----------------------------------------------------------------------------------------------------
weights.txt ADDED
File without changes