Speech synthesis based on hidden Markov models K Tokuda, Y Nankaku, T Toda, H Zen, J Yamagishi, K Oura Proceedings of the IEEE 101 (5), 1234-1252, 2013 | 604 | 2013 |
An HMM-based singing voice synthesis system K Saino, H Zen, Y Nankaku, A Lee, K Tokuda Ninth International Conference on Spoken Language Processing, 2006 | 153 | 2006 |
Recent development of the HMM-based singing voice synthesis system—Sinsy K Oura, A Mase, T Yamada, S Muto, Y Nankaku, K Tokuda Seventh ISCA Workshop on Speech Synthesis, 2010 | 141 | 2010 |
State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis. YJ Wu, Y Nankaku, K Tokuda Interspeech, 528-531, 2009 | 110 | 2009 |
Singing Voice Synthesis Based on Deep Neural Networks. M Nishimura, K Hashimoto, K Oura, Y Nankaku, K Tokuda Interspeech, 2478-2482, 2016 | 109 | 2016 |
An excitation model for HMM-based speech synthesis based on residual modeling R Maia, T Toda, H Zen, Y Nankaku, K Tokuda | 100 | 2007 |
On the use of kernel PCA for feature extraction in speech recognition A Lima, H Zen, Y Nankaku, C Miyajima, K Tokuda, T Kitamura IEICE TRANSACTIONS on Information and Systems 87 (12), 2802-2811, 2004 | 95 | 2004 |
Continuous stochastic feature mapping based on trajectory HMMs H Zen, Y Nankaku, K Tokuda IEEE Transactions on Audio, Speech, and Language Processing 19 (2), 417-430, 2010 | 84 | 2010 |
Singing voice synthesis based on generative adversarial networks Y Hono, K Hashimoto, K Oura, Y Nankaku, K Tokuda ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 71 | 2019 |
The effect of neural networks in statistical parametric speech synthesis K Hashimoto, K Oura, Y Nankaku, K Tokuda 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 64 | 2015 |
Product of experts for statistical parametric speech synthesis H Zen, MJF Gales, Y Nankaku, K Tokuda IEEE Transactions on Audio, Speech, and Language Processing 20 (3), 794-805, 2011 | 64 | 2011 |
HMM-based singing voice synthesis and its application to Japanese and English K Nakamura, K Oura, Y Nankaku, K Tokuda 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 42 | 2014 |
Sinsy: A deep neural network-based singing voice synthesis system Y Hono, K Hashimoto, K Oura, Y Nankaku, K Tokuda IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2803-2815, 2021 | 41 | 2021 |
Recent development of the DNN-based singing voice synthesis system—sinsy Y Hono, S Murata, K Nakamura, K Hashimoto, K Oura, Y Nankaku, ... 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 41 | 2018 |
Trajectory training considering global variance for speech synthesis based on neural networks K Hashimoto, K Oura, Y Nankaku, K Tokuda 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 39 | 2016 |
Pitch adaptive training for HMM-based singing voice synthesis K Oura, A Mase, Y Nankaku, K Tokuda 2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012 | 39 | 2012 |
Singing voice synthesis based on convolutional neural networks K Nakamura, K Hashimoto, K Oura, Y Nankaku, K Tokuda arXiv preprint arXiv:1904.06868, 2019 | 37 | 2019 |
Face recognition based on separable lattice hmms D Kurata, Y Nankaku, K Tokuda, T Kitamura, Z Ghahramani 2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006 | 32 | 2006 |
Hierarchical multi-grained generative model for expressive speech synthesis Y Hono, K Tsuboi, K Sawada, K Hashimoto, K Oura, Y Nankaku, ... arXiv preprint arXiv:2009.08474, 2020 | 30 | 2020 |
Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis T Fujimoto, K Hashimoto, K Oura, Y Nankaku, K Tokuda 10th ISCA Speech Synthesis Workshop. ISCA, Vienna, Austria, 2019 | 29 | 2019 |