wyz
/

vctk_dns2020_whamr_bsrnn_large_noncausal

Model card Files Files and versions Community

wyz commited on Aug 6, 2024

Commit

ee8fce3

·

verified ·

1 Parent(s): cdbc190

Update README.md

Files changed (1) hide show

README.md +6 -19

README.md CHANGED Viewed

@@ -4,8 +4,6 @@ tags:
 - audio
 - audio-to-audio
 language: en
-datasets:
-- universal_se
 license: cc-by-4.0
 ---
@@ -13,7 +11,7 @@ license: cc-by-4.0
 ### `wyz/vctk_dns2020_whamr_bsrnn_large_noncausal`
-This model was trained by Emrys365 using universal_se recipe in [espnet](https://github.com/espnet/espnet/).
 ### Demo: How to use in ESPnet2
@@ -28,19 +26,19 @@ from espnet2.bin.enh_inference import SeparateSpeech
 # For model downloading + loading
 model = SeparateSpeech.from_pretrained(
-    model_tag=wyz/vctk_dns2020_whamr_bsrnn_large_noncausal,
     normalize_output_wav=True,
-    device=cuda,
 )
 # For loading a downloaded model
 # model = SeparateSpeech(
-#     train_config=exp_vctk_dns20_whamr/enh_train_enh_bsrnn_large_noncausal_raw/config.yaml,
-#     model_file=exp_vctk_dns20_whamr/enh_train_enh_bsrnn_large_noncausal_raw/xxxx.pth,
 #     normalize_output_wav=True,
 #     device=cuda,
 # )
-audio, fs = sf.read(/path/to/noisy/utt1.flac)
 enhanced = model(audio[None, :], fs=fs)[0]
 ```
@@ -67,17 +65,6 @@ enhanced = model(audio[None, :], fs=fs)[0]
 |reverb_et_simu_8ch_multich|2.29|94.59|10.87|10.87|0.00|-8.41|3.12|3.49|3.82|3.83|
 |whamr_tt_mix_single_reverb_max_16k|2.34|94.47|11.98|11.98|0.00|10.41|3.27|3.52|4.10|3.80|
-module
-<!-- Generated by ./scripts/utils/show_enh_score.sh -->
-# RESULTS
-## Environments
-- date: `Thu Jan 11 22:52:46 EST 2024`
-- python version: `3.8.16 (default, Mar  2 2023, 03:21:46)  [GCC 11.2.0]`
-- espnet version: `espnet 202304`
-- pytorch version: `pytorch 2.0.1+cu118`
-- Git hash: `443028662106472c60fe8bd892cb277e5b488651`
-  - Commit date: `Thu May 11 03:32:59 2023 +0000`
 ## enhanced_test_48k

 - audio
 - audio-to-audio
 language: en
 license: cc-by-4.0
 ---
 ### `wyz/vctk_dns2020_whamr_bsrnn_large_noncausal`
+This model was trained by Emrys365 based on the universal_se_v1 recipe in [espnet](https://github.com/espnet/espnet/).
 ### Demo: How to use in ESPnet2
 # For model downloading + loading
 model = SeparateSpeech.from_pretrained(
+    model_tag="wyz/vctk_dns2020_whamr_bsrnn_large_noncausal",
     normalize_output_wav=True,
+    device="cuda",
 )
 # For loading a downloaded model
 # model = SeparateSpeech(
+#     train_config="exp_vctk_dns20_whamr/enh_train_enh_bsrnn_large_noncausal_raw/config.yaml",
+#     model_file="exp_vctk_dns20_whamr/enh_train_enh_bsrnn_large_noncausal_raw/xxxx.pth",
 #     normalize_output_wav=True,
 #     device=cuda,
 # )
+audio, fs = sf.read("/path/to/noisy/utt1.flac")
 enhanced = model(audio[None, :], fs=fs)[0]
 ```
 |reverb_et_simu_8ch_multich|2.29|94.59|10.87|10.87|0.00|-8.41|3.12|3.49|3.82|3.83|
 |whamr_tt_mix_single_reverb_max_16k|2.34|94.47|11.98|11.98|0.00|10.41|3.27|3.52|4.10|3.80|
 ## enhanced_test_48k