Single channel or multi-channel audio control interface LH Kim, E Visser, R Peri, PL Ton, JP Toman, T Schultz, J Zheng US Patent 10,051,364, 2018 | 60 | 2018 |
Adversarial attack and defense strategies for deep speaker recognition systems A Jati, CC Hsu, M Pal, R Peri, W AbdAlmageed, S Narayanan Computer Speech & Language 68, 101199, 2021 | 52 | 2021 |
Deep neural net based filter prediction for audio event classification and extraction E Visser, Y Guo, LH Kim, R Peri, S Zhang US Patent 9,666,183, 2017 | 46 | 2017 |
Automated evaluation of psychotherapy skills using speech and language technologies N Flemotomos, VR Martinez, Z Chen, K Singla, V Ardulov, R Peri, ... Behavior Research Methods 54 (2), 690-711, 2022 | 37* | 2022 |
Virtual, augmented, and mixed reality E Visser, LH Kim, R Peri US Patent App. 15/238,591, 2018 | 37 | 2018 |
Robust speaker recognition using unsupervised adversarial invariance R Peri, M Pal, A Jati, K Somandepalli, S Narayanan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 29 | 2020 |
Adversarial defense for deep speaker recognition using hybrid adversarial training M Pal, A Jati, R Peri, CC Hsu, W AbdAlmageed, S Narayanan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
Speaker diarization using latent space clustering in generative adversarial network M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 19 | 2020 |
Meta-learning with latent space clustering in generative adversarial network for speaker diarization M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021 | 18 | 2021 |
Method, system and article of manufacture for processing spatial audio LH Kim, R Peri, E Visser US Patent 9,578,439, 2017 | 16 | 2017 |
User-level differential privacy against attribute inference attack of speech emotion recognition in federated learning T Feng, R Peri, S Narayanan arXiv preprint arXiv:2204.02500, 2022 | 13 | 2022 |
Drone flight control E Visser, LH Kim, RDJB Castillo, S Zhang, R Peri US Patent 10,379,534, 2019 | 13 | 2019 |
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech. A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ... Interspeech, 2463-2467, 2019 | 12 | 2019 |
Collaborative audio processing LH Kim, E Visser, R Peri US Patent 9,706,300, 2017 | 12 | 2017 |
Disentanglement for audio-visual emotion recognition using multitask setup R Peri, S Parthasarathy, C Bradshaw, S Sundaram ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
An empirical analysis of information encoded in disentangled neural speaker representations R Peri, H Li, K Somandepalli, A Jati, S Narayanan arXiv preprint arXiv:2002.03520, 2020 | 10 | 2020 |
The Second DIHARD Challenge: System Description for USC-SAIL Team. TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ... INTERSPEECH, 998-1002, 2019 | 10 | 2019 |
Single-channel or multi-channel audio control interface LH Kim, E Visser, R Peri, PL Ton, JP Toman, T Schultz, J Zheng US Patent 10,073,607, 2018 | 9 | 2018 |
Cloud-based processing using local device provided sensor data and labels E Visser, M Jin, LH Kim, R Peri, S Zhang US Patent App. 15/273,496, 2017 | 8 | 2017 |
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems A Sreeram, N Mehlman, R Peri, D Knox, S Narayanan arXiv preprint arXiv:2107.05222, 2021 | 5 | 2021 |