Automatic speaker age and gender recognition using acoustic and prosodic level information fusion M Li, KJ Han, S Narayanan Computer Speech & Language 27 (1), 151-167, 2013 | 222 | 2013 |
A review of speaker diarization: Recent advances with deep learning TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan Computer Speech & Language 72, 101317, 2022 | 206 | 2022 |
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap TJ Park, KJ Han, M Kumar, S Narayanan IEEE Signal Processing Letters 27, 381-385, 2019 | 96 | 2019 |
The CAPIO 2017 conversational speech recognition system KJ Han, A Chandrashekaran, J Kim, I Lane arXiv preprint arXiv:1801.00059, 2017 | 86 | 2017 |
Strategies to improve the robustness of agglomerative hierarchical clustering under data source variation for speaker diarization KJ Han, S Kim, SS Narayanan IEEE Transactions on Audio, Speech, and Language Processing 16 (8), 1590-1601, 2008 | 75 | 2008 |
State-of-the-art speech recognition using multi-stream self-attention with dilated 1d convolutions KJ Han, R Prieto, T Ma 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 54-61, 2019 | 70 | 2019 |
Robust language identification using convolutional neural network features S Ganapathy, K Han, S Thomas, M Omar, MV Segbroeck, SS Narayanan Fifteenth annual conference of the international speech communication …, 2014 | 66 | 2014 |
A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system. KJ Han, SS Narayanan Interspeech, 1853-1856, 2007 | 54 | 2007 |
Combining five acoustic level modeling methods for automatic speaker age and gender recognition M Li, CS Jung, KJ Han Eleventh annual conference of the international speech communication association, 2010 | 44 | 2010 |
Multistream CNN for robust acoustic modeling KJ Han, J Pan, VKN Tadala, T Ma, D Povey ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 37 | 2021 |
Slue: New benchmark tasks for spoken language understanding evaluation on natural speech S Shon, A Pasad, F Wu, P Brusco, Y Artzi, K Livescu, KJ Han ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 35 | 2022 |
Deep Learning-Based Telephony Speech Recognition in the Wild KJ Han, S Hahm, BH Kim, J Kim, IR Lane INTERSPEECH, 1323-1327, 2017 | 33 | 2017 |
ASAPP-ASR: Multistream CNN and self-attentive SRU for SOTA speech recognition J Pan, J Shapiro, J Wohlwend, KJ Han, T Lei, T Ma arXiv preprint arXiv:2005.10469, 2020 | 30 | 2020 |
Speaker diarization with lexical information TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan arXiv preprint arXiv:2004.06756, 2020 | 30 | 2020 |
Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling KJ Han, SS Narayanan Ninth Annual Conference of the International Speech Communication Association, 2008 | 29 | 2008 |
E-branchformer: Branchformer with enhanced merging for speech recognition K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe 2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023 | 24 | 2023 |
Performance-efficiency trade-offs in unsupervised pre-training for speech recognition F Wu, K Kim, J Pan, KJ Han, KQ Weinberger, Y Artzi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 23 | 2022 |
Identifying a driver of a vehicle SV Myers, S Elwart, WJ Talamonti, JT Mullen, ZD Nelson, T Smith, ... US Patent 9,707,911, 2017 | 22 | 2017 |
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering KJ Han, SS Narayanan 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 22 | 2008 |
Robust speaker clustering strategies to data source variation for improved speaker diarization KJ Han, S Kim, SS Narayanan 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU …, 2007 | 20 | 2007 |