Следене
Taejin Park
Заглавие
Позовавания
Позовавания
Година
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
1582022
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap
TJ Park, KJ Han, M Kumar, S Narayanan
IEEE Signal Processing Letters 27, 381-385, 2019
872019
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 9,319,819, 2016
452016
Musical instrument sound classification with deep convolutional neural network using feature fusion approach
T Park, T Lee
arXiv preprint arXiv:1512.07370, 2015
392015
Multimodal speaker segmentation and diarization using lexical and acoustic cues via sequence to sequence neural networks
TJ Park, P Georgiou
arXiv preprint arXiv:1805.10731, 2018
302018
Speaker diarization with lexical information
TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan
arXiv preprint arXiv:2004.06756, 2020
252020
TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context
NR Koluguri, T Park, B Ginsburg
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
242022
Meta-learning with latent space clustering in generative adversarial network for speaker diarization
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021
182021
Speaker diarization using latent space clustering in generative adversarial network
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
182020
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations
SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
142020
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech.
A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ...
Interspeech, 2463-2467, 2019
122019
Multi-scale speaker diarization with neural affinity score fusion
TJ Park, M Kumar, S Narayanan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
102021
The Second DIHARD Challenge: System Description for USC-SAIL Team.
TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ...
INTERSPEECH, 998-1002, 2019
92019
Encoding/decoding apparatus for processing channel signal and method therefor
JI Seo, SK Beack, DY Jang, KO Kang, TJ Park, YJ Lee, KW Choi, JW Kim
US Patent 10,068,579, 2018
82018
Apparatus for processing audio signal for sound bar and method therefor
JI Seo, DY Jang, TJ Park, KW Choi, KO Kang, JW Kim
US Patent App. 14/760,770, 2015
72015
A noise robust audio fingerprint extraction technique for mobile devices using gradient histograms
T Park, SK Beack, T Lee
2015 IEEE 5th International Conference on Consumer Electronics-Berlin (ICCE …, 2015
62015
Tackling dynamics in federated incremental learning with variational embedding rehearsal
TJ Park, K Kumatani, D Dimitriadis
arXiv preprint arXiv:2110.09695, 2021
52021
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 10,199,045, 2019
52019
Apparatus and method for generating multimedia data, and apparatus and method for playing multimedia data
YJ Lee, JI Seo, C Keunwoo, TJ Park, KO Kang
US Patent 9,357,325, 2016
52016
Apparatus and method for transmitting watermark robust to acoustic channel distortion
SK Beack, TJ Park, JM Sung, YJ Lee, TJ Lee, KO Kang
US Patent App. 14/881,375, 2016
52016
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20