ixxan
/

whisper-small-common-voice-ug

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

ixxan commited on Nov 25, 2024

Commit

7f26ff9

·

verified ·

1 Parent(s): cf250b7

Update README.md

Files changed (1) hide show

README.md +8 -4

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ datasets:
 metrics:
 - wer
 model-index:
-- name: Whisper Small Uyghur Common Voice 15 (Subset ~5.5hr train data)
   results:
   - task:
       name: Automatic Speech Recognition
@@ -28,10 +28,14 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Small Uyghur Common Voice 15
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 15 dataset.
-It achieves the following results on the evaluation set:
 - Loss: 0.5105
 - Wer Ortho: 41.6377
 - Wer: 34.9961

 metrics:
 - wer
 model-index:
+- name: Whisper Small Fine-tuned with Uyghur Common Voice
   results:
   - task:
       name: Automatic Speech Recognition
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Whisper Small Fine-tuned with Uyghur Common Voice (Subset ~5.5hrs train data)
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Uyghur Common Voice dataset.
+As a proof-of-concept, only 3264 recordings (~5.5 hrs of audio) were used for training, and 937 recordings (~1.5 hrs of audio) were used for validation.
+You may find the full dataset for Uyghur and other languages here: https://commonvoice.mozilla.org/en/datasets.
+This model achieves the following results on the evaluation set:
 - Loss: 0.5105
 - Wer Ortho: 41.6377
 - Wer: 34.9961