Porameht
/

whisper-small-th

@@ -7,9 +7,24 @@ tags:
 - generated_from_trainer
 datasets:
 - mozilla-foundation/common_voice_17_0
 model-index:
 - name: whisper-small-th
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -17,9 +32,14 @@ should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/service-engineering/fine_tune_whisper_th/runs/c58tla8j)
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/service-engineering/fine_tune_whisper_th/runs/bmgk0qse)
 # whisper-small-th
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 17.0 dataset.
 ## Model description
@@ -50,6 +70,16 @@ The following hyperparameters were used during training:
 - training_steps: 4000
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.41.0

 - generated_from_trainer
 datasets:
 - mozilla-foundation/common_voice_17_0
+metrics:
+- wer
 model-index:
 - name: whisper-small-th
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: Common Voice 17.0
+      type: mozilla-foundation/common_voice_17_0
+      config: th
+      split: None
+      args: 'config: th, split: test'
+    metrics:
+    - name: Wer
+      type: wer
+      value: 64.85347250100362
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/service-engineering/fine_tune_whisper_th/runs/c58tla8j)
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/service-engineering/fine_tune_whisper_th/runs/bmgk0qse)
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/service-engineering/fine_tune_whisper_th/runs/bmgk0qse)
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/service-engineering/fine_tune_whisper_th/runs/ddw0ira7)
 # whisper-small-th
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 17.0 dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1596
+- Wer: 64.8535
 ## Model description
 - training_steps: 4000
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer     |
+|:-------------:|:------:|:----:|:---------------:|:-------:|
+| 0.2535        | 0.7294 | 1000 | 0.2177          | 73.9061 |
+| 0.1453        | 1.4588 | 2000 | 0.1778          | 69.6909 |
+| 0.0923        | 2.1882 | 3000 | 0.1648          | 65.8303 |
+| 0.0781        | 2.9176 | 4000 | 0.1596          | 64.8535 |
 ### Framework versions
 - Transformers 4.41.0