Propicto
/

asr-wav2vec2-commonvoice-15-fr

Automatic Speech Recognition

Model card Files Files and versions Community

cecilemacaire commited on 6 days ago

Commit

3251ec3

·

verified ·

1 Parent(s): 43120cf

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ The fine-tuned model achieves the following performance :
 |:-------------:|:--------------:|:--------------:| :--------:|:--------:|
 | 2023-09-08 | 9.14  | 11.21  | 4xV100 32GB | 30 |
-## Model Details
 The ASR system is composed of:
 - the **Tokenizer** (char) that transforms the input text into a sequence of characters ("cat" into ["c", "a", "t"]) and trained with the train transcriptions (train.tsv).
@@ -37,7 +37,7 @@ The final acoustic representation is given to the CTC greedy decode.
 We used recordings sampled at 16kHz (single channel).
-## How to transcribe a file with the model
 ### Install and import speechbrain
@@ -67,7 +67,7 @@ def main():
     save_transcript(transcript, audio, "out.txt")
 ```
-## Training Details
 ### Training Data
@@ -104,7 +104,7 @@ With 4xV100 32GB, the training took ~ 81 hours.
   }
 ```
-## Information
 - **Developed by:** Cécile Macaire
 - **Funded by [optional]:** GENCI-IDRIS (Grant 2023-AD011013625R1)
@@ -113,7 +113,7 @@ PROPICTO ANR-20-CE93-0005
 - **License:** Apache-2.0
 - **Finetuned from model:** LeBenchmark/wav2vec2-FR-7K-large
-## Citation
 ```bibtex
 @inproceedings{macaire24_interspeech,

 |:-------------:|:--------------:|:--------------:| :--------:|:--------:|
 | 2023-09-08 | 9.14  | 11.21  | 4xV100 32GB | 30 |
+## 📝 Model Details
 The ASR system is composed of:
 - the **Tokenizer** (char) that transforms the input text into a sequence of characters ("cat" into ["c", "a", "t"]) and trained with the train transcriptions (train.tsv).
 We used recordings sampled at 16kHz (single channel).
+## 💻 How to transcribe a file with the model
 ### Install and import speechbrain
     save_transcript(transcript, audio, "out.txt")
 ```
+## ⚙️ Training Details
 ### Training Data
   }
 ```
+## 💡 Information
 - **Developed by:** Cécile Macaire
 - **Funded by [optional]:** GENCI-IDRIS (Grant 2023-AD011013625R1)
 - **License:** Apache-2.0
 - **Finetuned from model:** LeBenchmark/wav2vec2-FR-7K-large
+## 📌 Citation
 ```bibtex
 @inproceedings{macaire24_interspeech,