Следене
Yusuke Fujita
Yusuke Fujita
LY Corp.
Потвърден имейл адрес: linecorp.com - Начална страница
Заглавие
Позовавания
Позовавания
Година
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
2582020
End-to-end neural speaker diarization with self-attention
Y Fujita, N Kanda, S Horiguchi, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
2032019
End-to-end neural speaker diarization with permutation-free objectives
Y Fujita, N Kanda, S Horiguchi, K Nagamatsu, S Watanabe
Interspeech, 4300-4304, 2019
2012019
End-to-end speaker diarization for an unknown number of speakers with encoder-decoder based attractors
S Horiguchi, Y Fujita, S Watanabe, Y Xue, K Nagamatsu
arXiv preprint arXiv:2005.09921, 2020
1512020
Guided source separation meets a strong ASR backend: Hitachi/Paderborn University joint investigation for dinner party ASR
N Kanda, C Boeddeker, J Heitkaemper, Y Fujita, S Horiguchi, ...
arXiv preprint arXiv:1905.12230, 2019
642019
Speaker diarization with region proposal network
Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
622020
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays
N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ...
Proc. CHiME-5, 6-10, 2018
522018
Acoustic modeling for distant multi-talker speech recognition with single-and multi-channel branches
N Kanda, Y Fujita, S Horiguchi, R Ikeshita, K Nagamatsu, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
432019
End-to-end neural diarization: Reformulating speaker diarization as simple multi-label classification
Y Fujita, S Watanabe, S Horiguchi, Y Xue, K Nagamatsu
arXiv preprint arXiv:2003.02966, 2020
402020
Neural speaker diarization with speaker-wise chain rule
Y Fujita, S Watanabe, S Horiguchi, Y Xue, J Shi, K Nagamatsu
arXiv preprint arXiv:2006.01796, 2020
372020
End-to-end speaker diarization as post-processing
S Horiguchi, P Garcia, Y Fujita, S Watanabe, K Nagamatsu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
362021
Online end-to-end neural diarization with speaker-tracing buffer
Y Xue, S Horiguchi, Y Fujita, S Watanabe, P García, K Nagamatsu
2021 IEEE Spoken Language Technology Workshop (SLT), 841-848, 2021
362021
Simultaneous speech recognition and speaker diarization for monaural dialogue recordings with target-speaker acoustic models
N Kanda, S Horiguchi, Y Fujita, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 31-38, 2019
332019
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap
S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ...
arXiv preprint arXiv:2102.01363, 2021
322021
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models.
N Kanda, Y Fujita, K Nagamatsu
Interspeech, 2923-2927, 2018
322018
Investigation of lattice-free maximum mutual information-based acoustic models with sequence-level Kullback-Leibler divergence
N Kanda, Y Fujita, K Nagamatsu
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 69-76, 2017
292017
Encoder-decoder based attractors for end-to-end neural diarization
S Horiguchi, Y Fujita, S Watanabe, Y Xue, P Garcia
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1493-1507, 2022
282022
Auxiliary interference speaker loss for target-speaker speech recognition
N Kanda, S Horiguchi, R Takashima, Y Fujita, K Nagamatsu, S Watanabe
arXiv preprint arXiv:1906.10876, 2019
252019
Acoustic modeling for overlapping speech recognition: JHU CHiME-5 challenge system
V Manohar, SJ Chen, Z Wang, Y Fujita, S Watanabe, S Khudanpur
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
252019
Sequence to multi-sequence learning via conditional chain mapping for mixture signals
J Shi, X Chang, P Guo, S Watanabe, Y Fujita, J Xu, B Xu, L Xie
Advances in Neural Information Processing Systems 33, 3735-3747, 2020
242020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20