Update README.md
Browse files
README.md
CHANGED
@@ -53,7 +53,52 @@ The test set was similarly used for validation.
|
|
53 |
|
54 |
## Training procedure
|
55 |
|
56 |
-
The script used to perform the training is included in the files of this space:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
57 |
|
58 |
### Training hyperparameters
|
59 |
|
|
|
53 |
|
54 |
## Training procedure
|
55 |
|
56 |
+
The script used to perform the training `run_speech_recognition_seq2seq_streaming.py` is included in the files of this space with the following arguments:
|
57 |
+
|
58 |
+
```
|
59 |
+
--model_name_or_path "openai/whisper-small"
|
60 |
+
--model_revision "main"
|
61 |
+
--do_train True
|
62 |
+
--do_eval True
|
63 |
+
--use_auth_token False
|
64 |
+
--freeze_encoder True
|
65 |
+
--model_index_name "Whisper Small - Greek (el)"
|
66 |
+
--dataset_name "mozilla-foundation/common_voice_11_0"
|
67 |
+
--dataset_config_name "el"
|
68 |
+
--audio_column_name "audio"
|
69 |
+
--text_column_name "sentence"
|
70 |
+
--max_duration_in_seconds 30
|
71 |
+
--train_split_name "train+validation"
|
72 |
+
--eval_split_name "test"
|
73 |
+
--do_lower_case False
|
74 |
+
--do_remove_punctuation False
|
75 |
+
--do_normalize_eval True
|
76 |
+
--language "greek"
|
77 |
+
--task "translate"
|
78 |
+
--shuffle_buffer_size 500
|
79 |
+
--output_dir "./data/finetuningRuns/whisper-sm-el-frzEnc-xlate"
|
80 |
+
--per_device_train_batch_size 16
|
81 |
+
--gradient_accumulation_steps 4
|
82 |
+
--learning_rate 1e-5
|
83 |
+
--warmup_steps 500
|
84 |
+
--max_steps 5000
|
85 |
+
--gradient_checkpointing True
|
86 |
+
--fp16 True
|
87 |
+
--evaluation_strategy "steps"
|
88 |
+
--per_device_eval_batch_size 8
|
89 |
+
--predict_with_generate True
|
90 |
+
--generation_max_length 225
|
91 |
+
--save_steps 1000
|
92 |
+
--eval_steps 1000
|
93 |
+
--logging_steps 25
|
94 |
+
--report_to "tensorboard"
|
95 |
+
--load_best_model_at_end True
|
96 |
+
--metric_for_best_model "wer"
|
97 |
+
--greater_is_better False
|
98 |
+
--push_to_hub False
|
99 |
+
--overwrite_output_dir True
|
100 |
+
|
101 |
+
```
|
102 |
|
103 |
### Training hyperparameters
|
104 |
|