Следене
Naohiro Tawara
Naohiro Tawara
NTT Corporation
Потвърден имейл адрес: ieee.org
Заглавие
Позовавания
Позовавания
Година
Improving speaker discrimination of target speech extraction with time-domain speakerbeam
M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1172020
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds
K Kinoshita, M Delcroix, N Tawara
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
832021
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech
K Kinoshita, M Delcroix, N Tawara
arXiv preprint arXiv:2105.09040, 2021
542021
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder.
N Tawara, T Kobayashi, T Ogawa
INTERSPEECH, 86-90, 2019
432019
Speaker invariant feature extraction for zero-resource languages with adversarial learning
T Tsuchiya, N Tawara, T Ogawa, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
372018
Frame-level phoneme-invariant speaker embedding for text-independent speaker recognition on extremely short utterances
N Tawara, A Ogawa, T Iwata, M Delcroix, T Ogawa
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
322020
Age-vox-celeb: Multi-modal corpus for facial and speech estimation
N Tawara, A Ogawa, Y Kitagishi, H Kamiyama
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
252021
Language model domain adaptation via recurrent neural networks with domain-shared and domain-specific representations
T Moriokal, N Tawara, T Ogawa, A Ogawa, T Iwata, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
102018
Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering
N Tawara, T Ogawa, S Watanabe, T Kobayashi
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
92012
Sequential fish catch forecasting using Bayesian state space models
Y Kokaki, N Tawara, T Kobayashi, K Hashimoto, T Ogawa
2018 24th International Conference on Pattern Recognition (ICPR), 776-781, 2018
82018
Speaker age estimation using age-dependent insensitive loss
Y Kitagishi, H Kamiyama, A Ando, N Tawara, T Mori, S Kobashikawa
2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020
72020
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions
N Tawara, T Ogawa, T Kobayashi
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
72015
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization
N Tawara, M Delcroix, A Ando, A Ogawa
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
62024
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages.
Y Higuchi, N Tawara, T Kobayashi, T Ogawa
INTERSPEECH, 266-270, 2019
62019
Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.
N Tawara, S Watanabe, T Ogawa, T Kobayashi
INTERSPEECH, 2905-2908, 2011
62011
Multi-stream extension of variational Bayesian HMM clustering (MS-VBx) for combined end-to-end and vector clustering-based diarization
M Delcroix, N Tawara, M Diez, F Landini, A Silnova, A Ogawa, T Nakatani, ...
arXiv preprint arXiv:2305.13580, 2023
52023
Adversarial autoencoder for reducing nonlinear distortion
N Tawara, T Kobayashi, M Fujieda, K Katagiri, T Yazu, T Ogawa
2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018
52018
Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model.
N Tawara, T Ogawa, S Watanabe, A Nakamura, T Kobayashi
INTERSPEECH, 2166-2169, 2012
52012
Blstm-based confidence estimation for end-to-end speech recognition
A Ogawa, N Tawara, T Kano, M Delcroix
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
42021
Language Model Data Augmentation Based on Text Domain Transfer.
A Ogawa, N Tawara, M Delcroix
INTERSPEECH, 4926-4930, 2020
42020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20