Improving speaker discrimination of target speech extraction with time-domain speakerbeam M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 117 | 2020 |
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds K Kinoshita, M Delcroix, N Tawara ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 83 | 2021 |
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech K Kinoshita, M Delcroix, N Tawara arXiv preprint arXiv:2105.09040, 2021 | 54 | 2021 |
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder. N Tawara, T Kobayashi, T Ogawa INTERSPEECH, 86-90, 2019 | 43 | 2019 |
Speaker invariant feature extraction for zero-resource languages with adversarial learning T Tsuchiya, N Tawara, T Ogawa, T Kobayashi 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 37 | 2018 |
Frame-level phoneme-invariant speaker embedding for text-independent speaker recognition on extremely short utterances N Tawara, A Ogawa, T Iwata, M Delcroix, T Ogawa ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 32 | 2020 |
Age-vox-celeb: Multi-modal corpus for facial and speech estimation N Tawara, A Ogawa, Y Kitagishi, H Kamiyama ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 25 | 2021 |
Language model domain adaptation via recurrent neural networks with domain-shared and domain-specific representations T Moriokal, N Tawara, T Ogawa, A Ogawa, T Iwata, T Kobayashi 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 10 | 2018 |
Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering N Tawara, T Ogawa, S Watanabe, T Kobayashi 2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012 | 9 | 2012 |
Sequential fish catch forecasting using Bayesian state space models Y Kokaki, N Tawara, T Kobayashi, K Hashimoto, T Ogawa 2018 24th International Conference on Pattern Recognition (ICPR), 776-781, 2018 | 8 | 2018 |
Speaker age estimation using age-dependent insensitive loss Y Kitagishi, H Kamiyama, A Ando, N Tawara, T Mori, S Kobashikawa 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 7 | 2020 |
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions N Tawara, T Ogawa, T Kobayashi 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 7 | 2015 |
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization N Tawara, M Delcroix, A Ando, A Ogawa ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages. Y Higuchi, N Tawara, T Kobayashi, T Ogawa INTERSPEECH, 266-270, 2019 | 6 | 2019 |
Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model. N Tawara, S Watanabe, T Ogawa, T Kobayashi INTERSPEECH, 2905-2908, 2011 | 6 | 2011 |
Multi-stream extension of variational Bayesian HMM clustering (MS-VBx) for combined end-to-end and vector clustering-based diarization M Delcroix, N Tawara, M Diez, F Landini, A Silnova, A Ogawa, T Nakatani, ... arXiv preprint arXiv:2305.13580, 2023 | 5 | 2023 |
Adversarial autoencoder for reducing nonlinear distortion N Tawara, T Kobayashi, M Fujieda, K Katagiri, T Yazu, T Ogawa 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 5 | 2018 |
Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model. N Tawara, T Ogawa, S Watanabe, A Nakamura, T Kobayashi INTERSPEECH, 2166-2169, 2012 | 5 | 2012 |
Blstm-based confidence estimation for end-to-end speech recognition A Ogawa, N Tawara, T Kano, M Delcroix ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 4 | 2021 |
Language Model Data Augmentation Based on Text Domain Transfer. A Ogawa, N Tawara, M Delcroix INTERSPEECH, 4926-4930, 2020 | 4 | 2020 |