Следене
Shi-Xiong (Austin) Zhang
Shi-Xiong (Austin) Zhang
Други именаShi-Xiong Zhang, Shixiong Zhang
Sr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhD
Потвърден имейл адрес: capitalone.com
Заглавие
Позовавания
Позовавания
Година
An overview of deep-learning-based audio-visual speech enhancement and separation
D Michelsanti, ZH Tan, SX Zhang, Y Xu, M Yu, D Yu, J Jensen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1368-1396, 2021
2632021
End-to-end attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016
2062016
ADL-MVDR: All deep learning MVDR beamformer for target speech separation
Z Zhang, Y Xu, M Yu, SX Zhang, L Chen, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1312021
Time Domain Audio Visual Speech Separation
J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu
Automatic Speech Recognition and Understanding Workshop, ASRU 2019,, 2019
1302019
Computerized intelligent assistant for conferences
A Diamant, KM Ben-Dor, E Krupka, R Halaly, Y Smolin, I Gurvich, ...
US Patent 10,867,610, 2020
1142020
Multi-modal multi-channel target speech separation
R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020
1112020
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1032020
Investigation of Multilingual Deep Neural Networks for Spoken Term Detection
K Knill, MJF Gales, S Rath, P Woodland, SX Zhang
ASRU, 2013
1022013
SIMPLIFYING LONG SHORT-TERM MEMORY ACOUSTIC MODELS FOR FAST TRAINING AND DECODING
Y Miao, J Li, Y Wang, S Zhang, Y Gong
ICASSP, 2016
1002016
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
992019
A comprehensive study of speech separation: spectrogram vs waveform separation
F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu
arXiv preprint arXiv:1905.07497, 2019
922019
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
912019
New era for robust speech recognition: exploiting deep learning
S Watanabe, M Delcroix, F Metze, JR Hershey, et al.
Springer, 2017
64*2017
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
632020
Audio-visual speech separation and dereverberation with a two-stage multimodal network
K Tan, Y Xu, SX Zhang, M Yu, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 542-553, 2020
602020
FAST-RIR: Fast neural diffuse room impulse response generator
A Ratnarajah, SX Zhang, M Yu, Z Tang, D Manocha, D Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
572022
Structured SVMs for automatic speech recognition
SX Zhang, MJF Gales
IEEE Transactions on Audio, Speech, and Language Processing 21 (3), 544-555, 2012
502012
DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
SX Zhang, C Liu, K Yao, Y Gong
ICASSP 2015, 2015
462015
Neural Spatio-Temporal Beamformer for Target Speech Separation
Y Xu, M Yu, SX Zhang, L Chen, C Weng, J Liu, D Yu
arXiv preprint arXiv:2005.03889, 2020
432020
Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives
AS Subramanian, C Weng, M Yu, SX Zhang, Y Xu, S Watanabe, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
422020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20