gokulsrinivasagan
/

distilbert_lda_100_v1_mnli

@@ -1,28 +1,13 @@
 ---
 library_name: transformers
-language:
-- en
 base_model: gokulsrinivasagan/distilbert_lda_100_v1
 tags:
 - generated_from_trainer
-datasets:
-- glue
 metrics:
 - accuracy
 model-index:
 - name: distilbert_lda_100_v1_mnli
-  results:
-  - task:
-      name: Text Classification
-      type: text-classification
-    dataset:
-      name: GLUE MNLI
-      type: glue
-      args: mnli
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.3522172497965826
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # distilbert_lda_100_v1_mnli
-This model is a fine-tuned version of [gokulsrinivasagan/distilbert_lda_100_v1](https://huggingface.co/gokulsrinivasagan/distilbert_lda_100_v1) on the GLUE MNLI dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0962
-- Accuracy: 0.3522
 ## Model description
@@ -52,7 +37,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
@@ -64,13 +49,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 1.1037        | 1.0   | 1534  | 1.0993          | 0.3274   |
-| 1.0986        | 2.0   | 3068  | 1.0962          | 0.3545   |
-| 1.0988        | 3.0   | 4602  | 1.0989          | 0.3274   |
-| 1.0985        | 4.0   | 6136  | 1.1016          | 0.3182   |
-| 1.0985        | 5.0   | 7670  | 1.0989          | 0.3545   |
-| 1.0987        | 6.0   | 9204  | 1.0989          | 0.3545   |
-| 1.0984        | 7.0   | 10738 | 1.0994          | 0.3182   |
 ### Framework versions

 ---
 library_name: transformers
 base_model: gokulsrinivasagan/distilbert_lda_100_v1
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: distilbert_lda_100_v1_mnli
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # distilbert_lda_100_v1_mnli
+This model is a fine-tuned version of [gokulsrinivasagan/distilbert_lda_100_v1](https://huggingface.co/gokulsrinivasagan/distilbert_lda_100_v1) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9470
+- Accuracy: 0.7383
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.8015        | 1.0   | 1534  | 0.7065          | 0.6998   |
+| 0.6395        | 2.0   | 3068  | 0.6530          | 0.7296   |
+| 0.5434        | 3.0   | 4602  | 0.6438          | 0.7388   |
+| 0.459         | 4.0   | 6136  | 0.6610          | 0.7388   |
+| 0.3802        | 5.0   | 7670  | 0.7116          | 0.7474   |
+| 0.3083        | 6.0   | 9204  | 0.7747          | 0.7442   |
+| 0.2483        | 7.0   | 10738 | 0.8570          | 0.7382   |
+| 0.202         | 8.0   | 12272 | 0.9470          | 0.7383   |
 ### Framework versions

logs/events.out.tfevents.1733321410.ki-g0008.1206436.34 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:053d66b1c0c5acfb79d889485c14f0767393e70d9aeea6465736f93ad9c857f8
-size 8932

 version https://git-lfs.github.com/spec/v1
+oid sha256:e4115243047d063777ebe3c284980f8fffb561b20986d7629a42706bbdfb96f0
+size 9820

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e6ce96928ab5d26765ae186b4d5b982ef14dfac144046418929bb4e78c6d83cb
 size 267835644

 version https://git-lfs.github.com/spec/v1
+oid sha256:1ef3778a05062173bb3d33058c2a2e36a98d810c2df03f44c89e213b19898a70
 size 267835644