Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 118 | 2019 |
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition X Cai, D Dai, Z Wu, X Li, J Li, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 60 | 2021 |
Learning discriminative features from spectrograms using center loss for speech emotion recognition D Dai, Z Wu, R Li, X Wu, J Jia, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 58 | 2019 |
One-shot voice conversion with global speaker embeddings. H Lu, Z Wu, D Dai, R Li, S Kang, J Jia, H Meng Interspeech, 669-673, 2019 | 44 | 2019 |
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT. D Dai, Z Wu, S Kang, X Wu, J Jia, D Su, D Yu, H Meng Interspeech, 2090-2094, 2019 | 24 | 2019 |
Unsupervised cross-lingual speech emotion recognition using domain adversarial neural network X Cai, Z Wu, K Zhong, B Su, D Dai, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 11 | 2021 |
Noise robust tts for low resource speakers using pre-trained model and speech enhancement D Dai, L Chen, Y Wang, M Wang, R Xia, X Song, Z Wu, Y Wang arXiv preprint arXiv:2005.12531, 2020 | 10 | 2020 |
Cloning one’s voice using very limited data in the wild D Dai, Y Chen, L Chen, M Tu, L Liu, R Xia, Q Tian, Y Wang, Y Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 9 | 2022 |
Speaker independent and multilingual/mixlingual speech-driven talking head generation using phonetic posteriorgrams H Huang, Z Wu, S Kang, D Dai, J Jia, T Fu, D Tuo, G Lei, P Liu, D Su, ... 2021 Asia-Pacific Signal and Information Processing Association Annual …, 2021 | 4 | 2021 |