gokulsrinivasagan commited on
Commit
75a1772
·
verified ·
1 Parent(s): 8622264

Model save

Browse files
README.md ADDED
@@ -0,0 +1,76 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ base_model: gokulsrinivasagan/bert_base_lda_100_v1
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - spearmanr
8
+ model-index:
9
+ - name: bert_base_lda_100_v1_stsb
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # bert_base_lda_100_v1_stsb
17
+
18
+ This model is a fine-tuned version of [gokulsrinivasagan/bert_base_lda_100_v1](https://huggingface.co/gokulsrinivasagan/bert_base_lda_100_v1) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 2.4802
21
+ - Pearson: nan
22
+ - Spearmanr: nan
23
+ - Combined Score: nan
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 0.001
43
+ - train_batch_size: 256
44
+ - eval_batch_size: 256
45
+ - seed: 10
46
+ - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 50
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Pearson | Spearmanr | Combined Score |
53
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:---------:|:--------------:|
54
+ | 5.4876 | 1.0 | 23 | 2.5971 | nan | nan | nan |
55
+ | 2.2047 | 2.0 | 46 | 2.3759 | nan | nan | nan |
56
+ | 2.2017 | 3.0 | 69 | 2.4512 | nan | nan | nan |
57
+ | 2.1807 | 4.0 | 92 | 2.4512 | nan | nan | nan |
58
+ | 2.1807 | 5.0 | 115 | 2.5112 | nan | nan | nan |
59
+ | 2.196 | 6.0 | 138 | 2.3448 | nan | nan | nan |
60
+ | 2.1902 | 7.0 | 161 | 2.7164 | nan | nan | nan |
61
+ | 2.1899 | 8.0 | 184 | 2.6349 | nan | nan | nan |
62
+ | 2.1962 | 9.0 | 207 | 2.3354 | nan | nan | nan |
63
+ | 2.1802 | 10.0 | 230 | 2.3028 | nan | nan | nan |
64
+ | 2.1945 | 11.0 | 253 | 2.7164 | nan | nan | nan |
65
+ | 2.1932 | 12.0 | 276 | 2.7380 | nan | nan | nan |
66
+ | 2.206 | 13.0 | 299 | 2.7380 | nan | nan | nan |
67
+ | 2.1965 | 14.0 | 322 | 2.6546 | nan | nan | nan |
68
+ | 2.1794 | 15.0 | 345 | 2.4802 | nan | nan | nan |
69
+
70
+
71
+ ### Framework versions
72
+
73
+ - Transformers 4.46.3
74
+ - Pytorch 2.2.1+cu118
75
+ - Datasets 2.17.0
76
+ - Tokenizers 0.20.3
logs/events.out.tfevents.1732650509.ki-g0008.2137866.12 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0fa5c4b714ae8a8bced9b7fca6bfbdaa7223cb39ef4d4d4555ab44287d0e9376
3
- size 10841
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cda7636a4426a7ec34fe21dc3f918403f599d4297767fbee52b97ab4c84bd901
3
+ size 15059
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3426344417c14113adab55536b3bb884d31c8986fc44c9928b3dbcf5c7e6e15b
3
  size 437950172
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b858ba2f138e2e0d5357a8e3eb81ca96861b93c1a354b5766437f117d6b30d3e
3
  size 437950172