airesearch
/

wav2vec2-large-xlsr-53-th

Automatic Speech Recognition

hf-asr-leaderboard

robust-speech-event

xlsr-fine-tuning

Inference Endpoints

Model card Files Files and versions Community

cstorm125 commited on Jan 20, 2022

Commit

71ded5c

·

1 Parent(s): fc25f0f

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -18,6 +18,22 @@ Finetuning `wav2vec2-large-xlsr-53` on Thai [Common Voice 7.0](https://commonvoi
 We finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) based on [Fine-tuning Wav2Vec2 for English ASR](https://colab.research.google.com/github/patrickvonplaten/notebooks/blob/master/Fine_tuning_Wav2Vec2_for_English_ASR.ipynb) using Thai examples of [Common Voice Corpus 7.0](https://commonvoice.mozilla.org/en/datasets). The notebooks and scripts can be found in [vistec-ai/wav2vec2-large-xlsr-53-th](https://github.com/vistec-ai/wav2vec2-large-xlsr-53-th). The pretrained model and processor can be found at [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
 ## Usage
 ```

 We finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) based on [Fine-tuning Wav2Vec2 for English ASR](https://colab.research.google.com/github/patrickvonplaten/notebooks/blob/master/Fine_tuning_Wav2Vec2_for_English_ASR.ipynb) using Thai examples of [Common Voice Corpus 7.0](https://commonvoice.mozilla.org/en/datasets). The notebooks and scripts can be found in [vistec-ai/wav2vec2-large-xlsr-53-th](https://github.com/vistec-ai/wav2vec2-large-xlsr-53-th). The pretrained model and processor can be found at [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
+## `robust-speech-event`
+Add `syllable_tokenize`, `word_tokenize` ([PyThaiNLP](https://github.com/PyThaiNLP/pythainlp)) and [deepcut](https://github.com/rkcosmos/deepcut) tokenizers to `eval.py` from [robust-speech-event](https://github.com/huggingface/transformers/tree/master/examples/research_projects/robust-speech-event#evaluation)
+```
+> python eval.py --model_id ./ --dataset mozilla-foundation/common_voice_7_0 --config th --split test --log_outputs --thai_tokenizer newmm/syllable/deepcut/cer
+```
+### Eval results on Common Voice 7 "test":
+|                                 | WER PyThaiNLP 2.3.1 | WER deepcut | SER     | CER     |
+|---------------------------------|---------------------|-------------|---------|---------|
+| Only Tokenization               | 0.9524%             | 2.5316%     | 1.2346% | 0.1623% |
+| Cleaning rules and Tokenization | TBD                 | TBD         | TBD     | TBD     |
 ## Usage
 ```