Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ padding: 0;
|
|
20 |
|
21 |
The NeMo Mel Codec is a neural audio codec which compresses mel-spectrograms into a quantized representation and reconstructs audio. The model can be used as a vocoder for speech synthesis.
|
22 |
|
23 |
-
The model works with full-bandwidth 44.1kHz speech. It might have lower performance with low-bandwidth speech (e.g. 16kHz speech upsampled to 44.
|
24 |
|
25 |
| Sample Rate | Frame Rate | Bit Rate | # Codebooks | Codebook Size | Embed Dim | FSQ Levels |
|
26 |
|:-----------:|:----------:|:----------:|:-----------:|:-------------:|:-----------:|:------------:|
|
|
|
20 |
|
21 |
The NeMo Mel Codec is a neural audio codec which compresses mel-spectrograms into a quantized representation and reconstructs audio. The model can be used as a vocoder for speech synthesis.
|
22 |
|
23 |
+
The model works with full-bandwidth 44.1kHz speech. It might have lower performance with low-bandwidth speech (e.g. 16kHz speech upsampled to 44.1kHz) or with non-speech audio.
|
24 |
|
25 |
| Sample Rate | Frame Rate | Bit Rate | # Codebooks | Codebook Size | Embed Dim | FSQ Levels |
|
26 |
|:-----------:|:----------:|:----------:|:-----------:|:-------------:|:-----------:|:------------:|
|