DipsankarSinha
/

wav2vec2-large-xls-r-300m-amharic-demo-colab

@@ -1,18 +1,18 @@
 ---
 base_model: facebook/wav2vec2-xls-r-300m
 datasets:
 - common_voice_16_1
-license: apache-2.0
 metrics:
 - wer
-tags:
-- generated_from_trainer
 model-index:
 - name: wav2vec2-large-xls-r-300m-amharic-demo-colab
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_16_1
       type: common_voice_16_1
@@ -20,9 +20,9 @@ model-index:
       split: test
       args: am
     metrics:
-    - type: wer
-      value: 0.8992661774516344
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_16_1 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6489
-- Wer: 0.8993
 ## Model description
@@ -53,33 +53,33 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
-- num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 12.8229       | 2.5   | 100  | 4.1682          | 1.0    |
-| 4.1232        | 5.0   | 200  | 4.0821          | 1.0    |
-| 4.0475        | 7.5   | 300  | 4.0087          | 1.0    |
-| 3.9841        | 10.0  | 400  | 3.9677          | 1.0    |
-| 3.9469        | 12.5  | 500  | 3.9503          | 1.0    |
-| 3.7544        | 15.0  | 600  | 3.3452          | 1.0    |
-| 2.1016        | 17.5  | 700  | 1.8871          | 0.9800 |
-| 0.9969        | 20.0  | 800  | 1.7061          | 0.9813 |
-| 0.6112        | 22.5  | 900  | 1.6420          | 0.9513 |
-| 0.4384        | 25.0  | 1000 | 1.6287          | 0.9466 |
-| 0.3355        | 27.5  | 1100 | 1.6593          | 0.9273 |
-| 0.293         | 30.0  | 1200 | 1.6489          | 0.8993 |
 ### Framework versions

 ---
+license: apache-2.0
 base_model: facebook/wav2vec2-xls-r-300m
+tags:
+- generated_from_trainer
 datasets:
 - common_voice_16_1
 metrics:
 - wer
 model-index:
 - name: wav2vec2-large-xls-r-300m-amharic-demo-colab
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_16_1
       type: common_voice_16_1
       split: test
       args: am
     metrics:
+    - name: Wer
+      type: wer
+      value: 0.8639092728485657
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_16_1 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6333
+- Wer: 0.8639
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
+- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
+- num_epochs: 60
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 12.6948       | 5.0   | 100  | 4.1621          | 1.0    |
+| 4.1026        | 10.0  | 200  | 4.0365          | 1.0    |
+| 4.0037        | 15.0  | 300  | 3.9726          | 1.0007 |
+| 3.9485        | 20.0  | 400  | 3.9524          | 1.0007 |
+| 3.4635        | 25.0  | 500  | 2.4384          | 0.9980 |
+| 1.1709        | 30.0  | 600  | 1.6987          | 0.9453 |
+| 0.4955        | 35.0  | 700  | 1.5927          | 0.9073 |
+| 0.3163        | 40.0  | 800  | 1.6750          | 0.8833 |
+| 0.2372        | 45.0  | 900  | 1.6683          | 0.8813 |
+| 0.1896        | 50.0  | 1000 | 1.6555          | 0.8779 |
+| 0.1619        | 55.0  | 1100 | 1.6312          | 0.8819 |
+| 0.1473        | 60.0  | 1200 | 1.6333          | 0.8639 |
 ### Framework versions