Speakerfilter: Deep learning-based target speaker extraction using anchor speech S He, H Li, X Zhang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 27 | 2020 |
DBNet: A dual-branch network architecture processing on spectrum and waveform for single-channel speech enhancement K Zhang, S He, H Li, X Zhang arXiv preprint arXiv:2105.02436, 2021 | 18 | 2021 |
Tea-pse 3.0: Tencent-ethereal-audio-lab personalized speech enhancement system for icassp 2023 dns-challenge Y Ju, J Chen, S Zhang, S He, W Rao, W Zhu, Y Wang, T Yu, S Shang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Mc-spex: Towards effective speaker extraction with multi-scale interfusion and conditional speaker modulation J Chen, W Rao, Z Wang, J Lin, Y Ju, S He, Y Wang, Z Wu arXiv preprint arXiv:2306.16250, 2023 | 7 | 2023 |
Speaker recognition-assisted robust audio deepfake detection. J Pan, S Nie, H Zhang, S He, K Zhang, S Liang, X Zhang, J Tao INTERSPEECH, 4202-4206, 2022 | 7 | 2022 |
Speakerfilter-pro: an improved target speaker extractor combines the time domain and frequency domain S He, H Li, X Zhang 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 6 | 2022 |
Local-global speaker representation for target speaker extraction S He, W Rao, K Zhang, Y Ju, Y Yang, X Zhang, Y Wang, S Shang arXiv preprint arXiv:2210.15849, 2022 | 4 | 2022 |
A robust deep audio splicing detection method via singularity detection feature K Zhang, S Liang, S Nie, S He, J Pan, X Zhang, H Ma, J Yi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 4 | 2022 |
Gesper: A unified framework for general speech restoration J Chen, Y Shi, W Liu, W Rao, S He, A Li, Y Wang, Z Wu, S Shang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
Speech enhancement with intelligent neural homomorphic synthesis S He, W Rao, J Liu, J Chen, Y Ju, X Zhang, Y Wang, S Shang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
ExARN: self-attending RNN for target speaker extraction P Shen, S He, X Zhang arXiv preprint arXiv:2212.01106, 2022 | 2 | 2022 |
Hierarchical Speaker Representation for Target Speaker Extraction S He, H Zhang, W Rao, K Zhang, Y Ju, Y Yang, X Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications S He, J Liu, H Li, Y Yang, F Chen, X Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques C Zhao, S He, X Zhang arXiv preprint arXiv:2402.14225, 2024 | | 2024 |
ScaleFormer: Transformer-based speech enhancement in the multi-scale time domain T Wu, S He, H Zhang, XL Zhang 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | | 2023 |
PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch Interactions for Multi-Channel Speech Enhancement J Pan, S He, T Wu, H Zhang, X Zhang arXiv preprint arXiv:2309.10379, 2023 | | 2023 |
Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement J Pan, S He, H Zhang, X Zhang arXiv preprint arXiv:2309.10393, 2023 | | 2023 |
Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction W Liu, Y Shi, J Chen, W Rao, S He, A Li, Y Wang, Z Wu arXiv preprint arXiv:2306.08454, 2023 | | 2023 |
A Dual-branch Convolutional Network Architecture Processing on both Frequency and Time Domain for Single-channel Speech Enhancement K Zhang, S He, H Li, X Zhang APSIPA Transactions on Signal and Information Processing 12 (3), 2023 | | 2023 |
RAT: RNN-Attention Transformer for Speech Enhancement T Zhang, S He, H Li, X Zhang 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | | 2022 |