Speech synthesis based on hidden Markov models K Tokuda, Y Nankaku, T Toda, H Zen, J Yamagishi, K Oura Proceedings of the IEEE 101 (5), 1234-1252, 2013 | 589 | 2013 |
An HMM-based singing voice synthesis system K Saino, H Zen, Y Nankaku, A Lee, K Tokuda Ninth International Conference on Spoken Language Processing, 2006 | 152 | 2006 |
Recent development of the HMM-based singing voice synthesis system—Sinsy K Oura, A Mase, T Yamada, S Muto, Y Nankaku, K Tokuda Seventh ISCA Workshop on Speech Synthesis, 2010 | 139 | 2010 |
State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis. YJ Wu, Y Nankaku, K Tokuda Interspeech, 528-531, 2009 | 110 | 2009 |
Singing Voice Synthesis Based on Deep Neural Networks. M Nishimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda Interspeech, 2478-2482, 2016 | 108 | 2016 |
An excitation model for HMM-based speech synthesis based on residual modeling R Maia, T Toda, H Zen, Y Nankaku, K Tokuda | 100 | 2007 |
On the use of kernel PCA for feature extraction in speech recognition A Lima, H Zen, Y Nankaku, C Miyajima, K Tokuda, T Kitamura IEICE TRANSACTIONS on Information and Systems 87 (12), 2802-2811, 2004 | 94 | 2004 |
Continuous stochastic feature mapping based on trajectory HMMs H Zen, Y Nankaku, K Tokuda IEEE Transactions on Audio, Speech, and Language Processing 19 (2), 417-430, 2010 | 85 | 2010 |
Singing voice synthesis based on generative adversarial networks Y Hono, K Hashimoto, K Oura, Y Nankaku, K Tokuda ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 68 | 2019 |
Product of experts for statistical parametric speech synthesis H Zen, MJF Gales, Y Nankaku, K Tokuda IEEE Transactions on Audio, Speech, and Language Processing 20 (3), 794-805, 2011 | 64 | 2011 |
The effect of neural networks in statistical parametric speech synthesis K Hashimoto, K Oura, Y Nankaku, K Tokuda 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 63 | 2015 |
HMM-based singing voice synthesis and its application to Japanese and English K Nakamura, K Oura, Y Nankaku, K Tokuda 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 42 | 2014 |
Sinsy: A deep neural network-based singing voice synthesis system Y Hono, K Hashimoto, K Oura, Y Nankaku, K Tokuda IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2803-2815, 2021 | 41 | 2021 |
Recent development of the DNN-based singing voice synthesis system—sinsy Y Hono, S Murata, K Nakamura, K Hashimoto, K Oura, Y Nankaku, ... 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 40 | 2018 |
Trajectory training considering global variance for speech synthesis based on neural networks K Hashimoto, K Oura, Y Nankaku, K Tokuda 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 39 | 2016 |
Pitch adaptive training for HMM-based singing voice synthesis K Oura, A Mase, Y Nankaku, K Tokuda 2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012 | 39 | 2012 |
Singing voice synthesis based on convolutional neural networks K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda arXiv preprint arXiv:1904.06868, 2019 | 36 | 2019 |
Face recognition based on separable lattice hmms D Kurata, Y Nankaku, K Tokuda, T Kitamura, Z Ghahramani 2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006 | 32 | 2006 |
A trainable excitation model for HMM-based speech synthesis R Maia, T Toda, H Zen, Y Nankaku, K Tokuda target 2, a3, 2007 | 29 | 2007 |
Deterministic annealing EM algorithm in acoustic modeling for speaker and speech recognition Y Itaya, H Zen, Y Nankaku, C Miyajima, K Tokuda, T Kitamura IEICE transactions on information and systems 88 (3), 425-431, 2005 | 29 | 2005 |