Update README.md
Browse files
README.md
CHANGED
@@ -17,11 +17,11 @@ tags:
|
|
17 |
|
18 |
## Ichigo Whisper
|
19 |
|
20 |
-
Ichigo Whisper is a compact (22M parameters), open-source
|
21 |
|
22 |
-
This
|
23 |
|
24 |
-
Ichigo Whisper is a key component of the Ichigo v0.5 family.
|
25 |
|
26 |
For more details, please refer to our official [blog post]().
|
27 |
|
|
|
17 |
|
18 |
## Ichigo Whisper
|
19 |
|
20 |
+
Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the `Whisper-medium model`, designed to enhance performance on multilingual with minimal impact on its original English capabilities. Unlike models that output continuous embeddings, Ichigo Whisper compresses speech into discrete tokens, making it more compatible with large language models (LLMs) for immediate speech understanding.
|
21 |
|
22 |
+
This speech tokenizer has been trained on over ~400 hours of English data and ~1000 hours of Vietnamese data.
|
23 |
|
24 |
+
Ichigo Whisper is a key component of the [Ichigo v0.5 family]().
|
25 |
|
26 |
For more details, please refer to our official [blog post]().
|
27 |
|