ixxan commited on
Commit
7f26ff9
·
verified ·
1 Parent(s): cf250b7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -4
README.md CHANGED
@@ -11,7 +11,7 @@ datasets:
11
  metrics:
12
  - wer
13
  model-index:
14
- - name: Whisper Small Uyghur Common Voice 15 (Subset ~5.5hr train data)
15
  results:
16
  - task:
17
  name: Automatic Speech Recognition
@@ -28,10 +28,14 @@ model-index:
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
  should probably proofread and complete it, then remove this comment. -->
30
 
31
- # Whisper Small Uyghur Common Voice 15
32
 
33
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 15 dataset.
34
- It achieves the following results on the evaluation set:
 
 
 
 
35
  - Loss: 0.5105
36
  - Wer Ortho: 41.6377
37
  - Wer: 34.9961
 
11
  metrics:
12
  - wer
13
  model-index:
14
+ - name: Whisper Small Fine-tuned with Uyghur Common Voice
15
  results:
16
  - task:
17
  name: Automatic Speech Recognition
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
  should probably proofread and complete it, then remove this comment. -->
30
 
31
+ # Whisper Small Fine-tuned with Uyghur Common Voice (Subset ~5.5hrs train data)
32
 
33
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Uyghur Common Voice dataset.
34
+
35
+ As a proof-of-concept, only 3264 recordings (~5.5 hrs of audio) were used for training, and 937 recordings (~1.5 hrs of audio) were used for validation.
36
+ You may find the full dataset for Uyghur and other languages here: https://commonvoice.mozilla.org/en/datasets.
37
+
38
+ This model achieves the following results on the evaluation set:
39
  - Loss: 0.5105
40
  - Wer Ortho: 41.6377
41
  - Wer: 34.9961