Follow
Eunwoo Song
Eunwoo Song
Voice, Naver Cloud
Verified email at navercorp.com - Homepage
Title
Cited by
Cited by
Year
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
R Yamamoto, E Song, JM Kim
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
8002020
Effective spectral and excitation modeling techniques for LSTM-RNN-based speech synthesis systems
E Song, FK Soong, HG Kang
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (11 …, 2017
682017
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation
R Yamamoto, E Song, JM Kim
arXiv preprint arXiv:1904.04472, 2019
542019
ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems
E Song, K Byun, HG Kang
2019 27th European Signal Processing Conference (EUSIPCO), 1-5, 2019
412019
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis
MJ Hwang, R Yamamoto, E Song, JM Kim
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
362021
LP-WaveNet: Linear prediction-based WaveNet speech synthesis
MJ Hwang, F Soong, E Song, X Wang, H Kang, HG Kang
2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020
312020
HierSpeech: Bridging the gap between text and speech by hierarchical variational inference using self-supervised representations for speech synthesis
SH Lee, SB Kim, JH Lee, E Song, MJ Hwang, SW Lee
Advances in Neural Information Processing Systems 35, 16624-16636, 2022
272022
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
R Yamamoto, E Song, MJ Hwang, JM Kim
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
202021
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss
E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim
2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021
202021
Cross-speaker emotion transfer for low-resource text-to-speech using non-parallel voice conversion with pitch-shift data augmentation
R Terashima, R Yamamoto, E Song, Y Shirahata, HW Yoon, JM Kim, ...
arXiv preprint arXiv:2204.10020, 2022
152022
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model.
MJ Hwang, R Yamamoto, E Song, JM Kim
Interspeech, 2227-2231, 2021
142021
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network
MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
142020
Language model-based emotion prediction methods for emotional speech synthesis systems
HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang
arXiv preprint arXiv:2206.15067, 2022
122022
Neural text-to-speech with a modeling-by-generation excitation vocoder
E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim
arXiv preprint arXiv:2008.00132, 2020
112020
Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system
E Song, YS Joo, HG Kang
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
102015
Period VITS: variational inference with explicit pitch modeling for end-to-end emotional speech synthesis
Y Shirahata, R Yamamoto, E Song, R Terashima, JM Kim, K Tachibana
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.
HK Nguyen, K Jeong, SY Um, MJ Hwang, E Song, HG Kang
Interspeech, 3595-3599, 2021
72021
Effective parameter estimation methods for an excitnet model in generative text-to-speech systems
O Kwon, E Song, JM Kim, HG Kang
arXiv preprint arXiv:1905.08486, 2019
72019
Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model.
E Song, HG Kang
INTERSPEECH, 874-878, 2015
72015
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
E Song, R Yamamoto, O Kwon, CH Song, MJ Hwang, S Oh, HW Yoon, ...
arXiv preprint arXiv:2206.14984, 2022
62022
The system can't perform the operation now. Try again later.
Articles 1–20