thesven commited on
Commit
a93f4ae
·
1 Parent(s): cb643a8

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +146 -0
README.md ADDED
@@ -0,0 +1,146 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - thesven/bengali-ai-train-set-tiny
4
+ language:
5
+ - bn
6
+ metrics:
7
+ - wer
8
+ ---
9
+ # thesven/whisper-tiny-bn-thesven
10
+
11
+
12
+
13
+ ## Model Description
14
+
15
+
16
+ This is the `openai/whisper-tiny` model fine-tuned on the OOD-Speech Bengali dataset. The fine-tuning was performed to enhance the model's performance on Bengali speech recognition tasks, especially for out-of-distribution samples.
17
+
18
+
19
+
20
+ ## Model Details
21
+
22
+
23
+
24
+ - **Original Model:** [openai/whisper-tiny on Hugging Face](https://huggingface.co/openai/whisper-tiny)
25
+
26
+ - **Fine-tuned Model:** [thesven/whisper-tiny-bn-thesven on Hugging Face](https://huggingface.co/thesven/whisper-tiny-bn-thesven)
27
+
28
+ - **Model Architecture:** Whisper ASR
29
+
30
+
31
+
32
+ ## Intended Use
33
+
34
+
35
+
36
+ This model is intended for automatic speech recognition (ASR) tasks in the Bengali language, especially for scenarios that involve out-of-distribution samples.
37
+
38
+
39
+ ## Training Data
40
+
41
+
42
+
43
+ The model was fine-tuned using the OOD-Speech Bengali dataset:
44
+
45
+
46
+
47
+ - **Training Samples:** 10,000
48
+
49
+ - **Validation Samples:** 1,000
50
+
51
+ - **Dataset Source:** [OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking](https://arxiv.org/abs/2305.09688)
52
+
53
+
54
+
55
+ ## Evaluation Data
56
+
57
+
58
+
59
+ [Provide details about the evaluation dataset used, if different from the validation set of the training data.]
60
+
61
+
62
+
63
+ ## Fine-tuning Procedure
64
+
65
+
66
+
67
+ The model was fine-tuned using the OOD-Speech Bengali dataset with [specific hyperparameters, optimizer, learning rate, epochs, etc. if available].
68
+
69
+
70
+
71
+ ## Licensing
72
+
73
+
74
+
75
+ This model is released under the MIT License. You are free to use, modify, distribute, or sell it under the terms of the license.
76
+
77
+
78
+
79
+ ```
80
+
81
+ MIT License
82
+
83
+
84
+
85
+ Copyright (c) 2023 thesven
86
+
87
+
88
+
89
+ Permission is hereby granted, free of charge, to any person obtaining a copy
90
+
91
+ of this software and associated documentation files (the "Software"), to deal
92
+
93
+ in the Software without restriction, including without limitation the rights
94
+
95
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
96
+
97
+ copies of the Software, and to permit persons to whom the Software is
98
+
99
+ furnished to do so, subject to the following conditions:
100
+
101
+
102
+
103
+ The above copyright notice and this permission notice shall be included in all
104
+
105
+ copies or substantial portions of the Software.
106
+
107
+
108
+
109
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
110
+
111
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
112
+
113
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
114
+
115
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
116
+
117
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
118
+
119
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
120
+
121
+ SOFTWARE.
122
+
123
+ ```
124
+
125
+
126
+
127
+
128
+ ## Citation
129
+
130
+
131
+
132
+ ```
133
+
134
+ @article{OOD-Speech2023,
135
+
136
+ title={OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking},
137
+
138
+ author={Authors of the OOD-Speech paper},
139
+
140
+ journal={arXiv preprint arXiv:2305.09688},
141
+
142
+ year={2023}
143
+
144
+ }
145
+
146
+ ```