Recent Developments on ESPnet Toolkit Boosted by Conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 299 | 2021 |
Wenetspeech: A 10000+ hours multi-domain mandarin corpus for speech recognition B Zhang, H Lv, P Guo, Q Shao, C Yang, L Xie, X Xu, H Bu, X Chen, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 197 | 2022 |
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 92 | 2022 |
An exploration of self-supervised pretrained representations for end-to-end speech recognition X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 85 | 2021 |
Inaudible adversarial perturbations for targeted attack in speaker recognition Q Wang, P Guo, L Xie arXiv preprint arXiv:2005.10637, 2020 | 62 | 2020 |
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021 | 56 | 2021 |
Study of semi-supervised approaches to improving english-mandarin code-switching speech recognition P Guo, H Xu, L Xie, ES Chng arXiv preprint arXiv:1806.06200, 2018 | 48 | 2018 |
Adversarial Regularization for End-to-End Robust Speaker Verification. Q Wang, P Guo, S Sun, L Xie, JHL Hansen Interspeech, 4010-4014, 2019 | 45 | 2019 |
Adversarial regularization for attention based end-to-end robust speech recognition S Sun, P Guo, L Xie, MY Hwang IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (11 …, 2019 | 36 | 2019 |
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 31 | 2024 |
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 30 | 2022 |
Sequence to multi-sequence learning via conditional chain mapping for mixture signals J Shi, X Chang, P Guo, S Watanabe, Y Fujita, J Xu, B Xu, L Xie Advances in Neural Information Processing Systems 33, 3735-3747, 2020 | 25 | 2020 |
ESPnet-ST IWSLT 2021 Offline Speech Translation System H Inaguma, B Yan, S Dalmia, P Guo, J Shi, K Duh, S Watanabe arXiv preprint arXiv:2107.00636, 2021 | 20 | 2021 |
End-to-End ASR with Adaptive Span Self-Attention. X Chang, AS Subramanian, P Guo, S Watanabe, Y Fujita, M Omachi INTERSPEECH, 3595-3599, 2020 | 20 | 2020 |
Boundary and context aware training for cif-based non-autoregressive end-to-end asr F Yu, H Luo, P Guo, Y Liang, Z Yao, L Xie, Y Gao, L Hou, S Zhang 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 18 | 2021 |
Multi-speaker ASR combining non-autoregressive conformer CTC and conditional speaker chain P Guo, X Chang, S Watanabe, L Xie arXiv preprint arXiv:2106.08595, 2021 | 18 | 2021 |
Contextualized end-to-end speech recognition with contextual phrase prediction network K Huang, A Zhang, Z Yang, P Guo, B Mu, T Xu, L Xie arXiv preprint arXiv:2305.12493, 2023 | 17 | 2023 |
NWPU-ASLP system for the voiceprivacy 2022 challenge J Yao, Q Wang, L Zhang, P Guo, Y Liang, L Xie arXiv preprint arXiv:2209.11969, 2022 | 16 | 2022 |
Adversarial training for multi-domain speaker recognition Q Wang, W Rao, P Guo, L Xie 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 13 | 2021 |
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario F Yu, S Zhang, P Guo, Y Liang, Z Du, Y Lin, L Xie 2022 IEEE Spoken Language Technology Workshop (SLT), 144-151, 2023 | 12 | 2023 |