CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... 6th International Workshop on Speech Processing in Everyday Environments …, 2020 | 328 | 2020 |
End-to-End Neural Speaker Diarization with Self-attention Y Fujita, N Kanda, S Horiguchi, Y Xue, K Nagamatsu, S Watanabe IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 296-303, 2019 | 271 | 2019 |
End-to-End Neural Speaker Diarization with Permutation-Free Objectives Y Fujita, N Kanda, S Horiguchi, K Nagamatsu, S Watanabe Interspeech, 4300–4304, 2019 | 265 | 2019 |
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors S Horiguchi, Y Fujita, S Watanabe, Y Xue, K Nagamatsu Interspeech, 269-273, 2020 | 190 | 2020 |
Personalized Classifier for Food Image Recognition S Horiguchi, S Amano, M Ogawa, K Aizawa IEEE Transactions on Multimedia 20 (10), 2836-2848, 2018 | 105 | 2018 |
Significance of Softmax-based Features in Comparison to Distance Metric Learning-based Features S Horiguchi, D Ikami, K Aizawa IEEE Transactions on Pattern Analysis and Machine Intelligence 42 (5), 1279-1285, 2020 | 92* | 2020 |
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR N Kanda, C Boeddeker, J Heitkaemper, Y Fujita, S Horiguchi, ... Interspeech, 1248-1252, 2019 | 72 | 2019 |
Encoder-Decoder Based Attractors for End-to-End Neural Diarization S Horiguchi, Y Fujita, S Watanabe, Y Xue, P Garcia IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1493-1507, 2022 | 64 | 2022 |
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ... The 5th International Workshop on Speech Processing in Everyday Environments …, 2018 | 54 | 2018 |
Online End-to-End Neural Diarization with Speaker-Tracing Buffer Y Xue, S Horiguchi, Y Fujita, S Watanabe, P García, K Nagamatsu IEEE Spoken Language Technology Workshop (SLT), 841-848, 2021 | 52 | 2021 |
End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-Label Classification Y Fujita, S Watanabe, S Horiguchi, Y Xue, K Nagamatsu arXiv preprint arXiv:2003.02966, 2020 | 51 | 2020 |
Omnidirectional Pedestrian Detection by Rotation Invariant Training M Tamura, S Horiguchi, T Murakami IEEE Winter Conference on Applications of Computer Vision (WACV), 1989-1998, 2019 | 49 | 2019 |
Face-Voice Matching using Cross-modal Embeddings S Horiguchi, N Kanda, K Nagamatsu ACM International Conference on Multimedia (ACMMM), 1011-1019, 2018 | 48 | 2018 |
Neural Speaker Diarization with Speaker-Wise Chain Rule Y Fujita, S Watanabe, S Horiguchi, Y Xue, J Shi, K Nagamatsu arXiv preprint arXiv:2006.01796, 2020 | 46 | 2020 |
Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches N Kanda, Y Fujita, S Horiguchi, R Ikeshita, K Nagamatsu, S Watanabe IEEE International Conference on Acoustics, Speech, and Signal Processing …, 2019 | 45 | 2019 |
End-to-End Speaker Diarization as Post-Processing S Horiguchi, P García, Y Fujita, S Watanabe, K Nagamatsu IEEE International Conference on Acoustics, Speech and Signal Processing …, 2021 | 44 | 2021 |
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models N Kanda, S Horiguchi, Y Fujita, Y Xue, K Nagamatsu, S Watanabe IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 31-38, 2019 | 43 | 2019 |
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-vector Clustering Systems Combined by DOVER-Lap S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ... arXiv preprint arXiv:2102.01363, 2021 | 41 | 2021 |
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors S Horiguchi, S Watanabe, P Garcia, Y Xue, Y Takashima, Y Kawaguchi IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 98-105, 2022 | 40 | 2022 |
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition N Kanda, S Horiguchi, R Takashima, Y Fujita, K Nagamatsu, S Watanabe Interspeech, 236-240, 2019 | 36 | 2019 |