gokulsrinivasagan
/

distilbert_lda_100_v1_sst2

@@ -1,28 +1,13 @@
 ---
 library_name: transformers
-language:
-- en
 base_model: gokulsrinivasagan/distilbert_lda_100_v1
 tags:
 - generated_from_trainer
-datasets:
-- glue
 metrics:
 - accuracy
 model-index:
 - name: distilbert_lda_100_v1_sst2
-  results:
-  - task:
-      name: Text Classification
-      type: text-classification
-    dataset:
-      name: GLUE SST2
-      type: glue
-      args: sst2
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.5091743119266054
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # distilbert_lda_100_v1_sst2
-This model is a fine-tuned version of [gokulsrinivasagan/distilbert_lda_100_v1](https://huggingface.co/gokulsrinivasagan/distilbert_lda_100_v1) on the GLUE SST2 dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.6953
-- Accuracy: 0.5092
 ## Model description
@@ -52,7 +37,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
@@ -64,12 +49,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.7081        | 1.0   | 264  | 0.6953          | 0.5092   |
-| 0.6871        | 2.0   | 528  | 0.6971          | 0.5092   |
-| 0.6868        | 3.0   | 792  | 0.6972          | 0.5092   |
-| 0.6867        | 4.0   | 1056 | 0.6974          | 0.5092   |
-| 0.6868        | 5.0   | 1320 | 0.6956          | 0.5092   |
-| 0.6866        | 6.0   | 1584 | 0.6971          | 0.5092   |
 ### Framework versions

 ---
 library_name: transformers
 base_model: gokulsrinivasagan/distilbert_lda_100_v1
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: distilbert_lda_100_v1_sst2
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # distilbert_lda_100_v1_sst2
+This model is a fine-tuned version of [gokulsrinivasagan/distilbert_lda_100_v1](https://huggingface.co/gokulsrinivasagan/distilbert_lda_100_v1) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.6953
+- Accuracy: 0.8234
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.3896        | 1.0   | 264  | 0.3907          | 0.8268   |
+| 0.2205        | 2.0   | 528  | 0.4041          | 0.8349   |
+| 0.1574        | 3.0   | 792  | 0.5309          | 0.8165   |
+| 0.1151        | 4.0   | 1056 | 0.5299          | 0.8211   |
+| 0.0891        | 5.0   | 1320 | 0.5801          | 0.8372   |
+| 0.0677        | 6.0   | 1584 | 0.6953          | 0.8234   |
 ### Framework versions

logs/events.out.tfevents.1733320591.ki-g0008.1206436.28 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:966d2a9fafd28c8fc9693d8d6a210398889a3289358e1187dd795a1d487fd307
-size 7759

 version https://git-lfs.github.com/spec/v1
+oid sha256:ae5389a6d47ab997f73998d950b798085f1f84f765962a55edbfb256de7c884e
+size 8647

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:83d0ca735e57372e10ae36a8ece980000adab00abc6386553c286c25cb202c72
 size 267832560

 version https://git-lfs.github.com/spec/v1
+oid sha256:c55d9c92249c501d6c55c412f5f17a0f6b1827066fa467d23f0afb57bd326e97
 size 267832560