gokulsrinivasagan commited on
Commit
4f3e370
·
verified ·
1 Parent(s): 5dbb255

Model save

Browse files
README.md ADDED
@@ -0,0 +1,78 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ base_model: gokulsrinivasagan/bert_base_lda_20_v1_book
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - spearmanr
8
+ model-index:
9
+ - name: bert_base_lda_20_v1_book_stsb
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # bert_base_lda_20_v1_book_stsb
17
+
18
+ This model is a fine-tuned version of [gokulsrinivasagan/bert_base_lda_20_v1_book](https://huggingface.co/gokulsrinivasagan/bert_base_lda_20_v1_book) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.6833
21
+ - Pearson: 0.8393
22
+ - Spearmanr: 0.8380
23
+ - Combined Score: 0.8387
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 5e-05
43
+ - train_batch_size: 256
44
+ - eval_batch_size: 256
45
+ - seed: 10
46
+ - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 50
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Pearson | Spearmanr | Combined Score |
53
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:---------:|:--------------:|
54
+ | 2.8738 | 1.0 | 23 | 2.4670 | 0.1765 | 0.1748 | 0.1756 |
55
+ | 1.4719 | 2.0 | 46 | 1.0280 | 0.7397 | 0.7404 | 0.7401 |
56
+ | 0.9801 | 3.0 | 69 | 0.8276 | 0.7956 | 0.7954 | 0.7955 |
57
+ | 0.783 | 4.0 | 92 | 0.7431 | 0.8197 | 0.8193 | 0.8195 |
58
+ | 0.5677 | 5.0 | 115 | 0.9075 | 0.8135 | 0.8152 | 0.8144 |
59
+ | 0.4407 | 6.0 | 138 | 0.7474 | 0.8267 | 0.8272 | 0.8269 |
60
+ | 0.3821 | 7.0 | 161 | 0.6753 | 0.8391 | 0.8371 | 0.8381 |
61
+ | 0.3036 | 8.0 | 184 | 0.8726 | 0.8246 | 0.8260 | 0.8253 |
62
+ | 0.269 | 9.0 | 207 | 0.7331 | 0.8311 | 0.8293 | 0.8302 |
63
+ | 0.2191 | 10.0 | 230 | 0.7562 | 0.8383 | 0.8368 | 0.8375 |
64
+ | 0.1854 | 11.0 | 253 | 0.7022 | 0.8365 | 0.8343 | 0.8354 |
65
+ | 0.1718 | 12.0 | 276 | 0.6650 | 0.8407 | 0.8382 | 0.8394 |
66
+ | 0.1685 | 13.0 | 299 | 0.7270 | 0.8350 | 0.8333 | 0.8342 |
67
+ | 0.1368 | 14.0 | 322 | 0.7532 | 0.8392 | 0.8376 | 0.8384 |
68
+ | 0.1351 | 15.0 | 345 | 0.8710 | 0.8379 | 0.8379 | 0.8379 |
69
+ | 0.1459 | 16.0 | 368 | 0.7801 | 0.8416 | 0.8398 | 0.8407 |
70
+ | 0.106 | 17.0 | 391 | 0.6833 | 0.8393 | 0.8380 | 0.8387 |
71
+
72
+
73
+ ### Framework versions
74
+
75
+ - Transformers 4.46.3
76
+ - Pytorch 2.2.1+cu118
77
+ - Datasets 2.17.0
78
+ - Tokenizers 0.20.3
logs/events.out.tfevents.1733846209.ki-g0008.520107.30 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:28ed39b021aaa891ab801327347c3b6908f7e984d7191d082a6f3a91be3205b4
3
- size 14080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c69b5ab72680c60f892826d2eb51a19313e3aabd96b50c906387fd0eea855b47
3
+ size 16366
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8e829e83053f96b7d90f4245689129e66dfd00fea732c923ea9ba103cc22b0fe
3
  size 437950172
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:43caa0055b474f92037d4b568520498bf0424342843fac7972dfd877c8e01e6a
3
  size 437950172