Следене
Taejin Park
Заглавие
Позовавания
Позовавания
Година
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
2262022
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap
TJ Park, KJ Han, M Kumar, S Narayanan
IEEE Signal Processing Letters 27, 381-385, 2019
1012019
TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context
NR Koluguri, T Park, B Ginsburg
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
552022
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 9,319,819, 2016
472016
Musical instrument sound classification with deep convolutional neural network using feature fusion approach
T Park, T Lee
arXiv preprint arXiv:1512.07370, 2015
452015
Multimodal speaker segmentation and diarization using lexical and acoustic cues via sequence to sequence neural networks
TJ Park, P Georgiou
arXiv preprint arXiv:1805.10731, 2018
332018
Speaker diarization with lexical information
TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan
arXiv preprint arXiv:2004.06756, 2020
302020
Speaker diarization using latent space clustering in generative adversarial network
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
212020
Meta-learning with latent space clustering in generative adversarial network for speaker diarization
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021
182021
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations
SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
142020
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech.
A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ...
Interspeech, 2463-2467, 2019
142019
Multi-scale speaker diarization with dynamic scale weighting
TJ Park, NR Koluguri, J Balam, B Ginsburg
arXiv preprint arXiv:2203.15974, 2022
112022
Tackling dynamics in federated incremental learning with variational embedding rehearsal
TJ Park, K Kumatani, D Dimitriadis
arXiv preprint arXiv:2110.09695, 2021
112021
Multi-scale speaker diarization with neural affinity score fusion
TJ Park, M Kumar, S Narayanan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
The Second DIHARD Challenge: System Description for USC-SAIL Team.
TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ...
INTERSPEECH, 998-1002, 2019
112019
Encoding/decoding apparatus for processing channel signal and method therefor
JI Seo, SK Beack, DY Jang, KO Kang, TJ Park, YJ Lee, KW Choi, JW Kim
US Patent 10,068,579, 2018
82018
Apparatus for processing audio signal for sound bar and method therefor
JI Seo, DY Jang, TJ Park, KW Choi, KO Kang, JW Kim
US Patent App. 14/760,770, 2015
82015
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 10,199,045, 2019
62019
A noise robust audio fingerprint extraction technique for mobile devices using gradient histograms
T Park, SK Beack, T Lee
2015 IEEE 5th International Conference on Consumer Electronics-Berlin (ICCE …, 2015
62015
Robust multi-channel speech recognition using frequency aligned network
T Park, K Kumatani, M Wu, S Sundaram
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
52020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20