SVD-based adaptive QIM watermarking on stereo audio signals MJ Hwang, JS Lee, MS Lee, HG Kang IEEE Transactions on Multimedia 20 (1), 45-54, 2017 | 92 | 2017 |
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis MJ Hwang, R Yamamoto, E Song, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 37 | 2021 |
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ... arXiv preprint arXiv:2308.11596, 2023 | 36 | 2023 |
LP-WaveNet: Linear prediction-based WaveNet speech synthesis MJ Hwang, F Soong, E Song, X Wang, H Kang, HG Kang 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 31 | 2020 |
HierSpeech: Bridging the gap between text and speech by hierarchical variational inference using self-supervised representations for speech synthesis SH Lee, SB Kim, JH Lee, E Song, MJ Hwang, SW Lee Advances in Neural Information Processing Systems 35, 16624-16636, 2022 | 27 | 2022 |
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim 2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021 | 21 | 2021 |
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators R Yamamoto, E Song, MJ Hwang, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 20 | 2021 |
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model MJ Hwang, R Yamamoto, E Song, JM Kim Proc. INTERSPEECH, online, 2227-2231, 2021 | 14 | 2021 |
Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 14 | 2020 |
Seamless: Multilingual Expressive and Streaming Speech Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ... arXiv preprint arXiv:2312.05187, 2023 | 13 | 2023 |
Language model-based emotion prediction methods for emotional speech synthesis systems HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang arXiv preprint arXiv:2206.15067, 2022 | 12 | 2022 |
Neural text-to-speech with a modeling-by-generation excitation vocoder E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim arXiv preprint arXiv:2008.00132, 2020 | 11 | 2020 |
LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks HK Nguyen, K Jeong, S Um, MJ Hwang, E Song, HG Kang Proc. Interspeech 2021, 3595-3599, 2021 | 7 | 2021 |
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder E Song, R Yamamoto, O Kwon, CH Song, MJ Hwang, S Oh, HW Yoon, ... arXiv preprint arXiv:2206.14984, 2022 | 6 | 2022 |
ExcitGlow: Improving a WaveGlow-based neural vocoder with linear prediction analysis S Oh, H Lim, K Byun, MJ Hwang, E Song, HG Kang 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 6 | 2020 |
Modeling-by-generation-structured noise compensation algorithm for glottal vocoding speech synthesis system MJ Hwang, E Song, K Byun, HG Kang 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 4 | 2018 |
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems. MJ Hwang, E Song, JS Kim, HG Kang INTERSPEECH, 912-916, 2018 | 3 | 2018 |
Linear prediction-based parallel wavegan speech synthesis MJ Hwang, HW Yoon, CH Song, JS Kim, JM Kim, E Song 2022 International Conference on Electronics, Information, and Communication …, 2022 | 2 | 2022 |
Effective data augmentation methods for neural text-to-speech systems S Oh, O Kwon, MJ Hwang, JM Kim, E Song 2022 International Conference on Electronics, Information, and Communication …, 2022 | 2 | 2022 |
Parameter enhancement for MELP speech codec in noisy communication environment MJ Hwang, HG Kang arXiv preprint arXiv:1906.08407, 2019 | 2 | 2019 |