A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 435 | 2019 |
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram R Yamamoto, E Song, JM Kim ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 433 | 2020 |
librosa/librosa: 0.6. 3 B McFee, M McVicar, S Balke, C Thomé, C Raffel, D Lee, O Nieto, ... URL: https://doi. org/10.5281/zenodo 2564164, 2019 | 241* | 2019 |
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020 | 133 | 2020 |
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation R Yamamoto, E Song, JM Kim arXiv preprint arXiv:1904.04472, 2019 | 41 | 2019 |
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis MJ Hwang, R Yamamoto, E Song, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 16 | 2021 |
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim 2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021 | 16 | 2021 |
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators R Yamamoto, E Song, MJ Hwang, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 14 | 2021 |
Ryry: A real-time score-following automatic accompaniment playback system capable of real performances with errors, repeats and jumps S Sako, R Yamamoto, T Kitamura International Conference on Active Media Technology, 134-145, 2014 | 13 | 2014 |
Score following handling performances with arbitrary repeats and skips and automatic accompaniment E Nakamura, H Takeda, R Yamamoto, Y Saito, S Sako, S Sagayama IPSJ Journal 54 (4), 1338-1349, 2013 | 13 | 2013 |
Semi-supervised speaker adaptation for end-to-end speech synthesis with pretrained models K Inoue, S Hara, M Abe, T Hayashi, R Yamamoto, S Watanabe ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 12 | 2020 |
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 12 | 2020 |
Neural text-to-speech with a modeling-by-generation excitation vocoder E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim arXiv preprint arXiv:2008.00132, 2020 | 10 | 2020 |
Wavenet vocoder R Yamamoto | 10 | 2018 |
Robust on-line algorithm for real-time audio-to-score alignment based on a delayed decision and anticipation framework R Yamamoto, S Sako, T Kitamura 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 9 | 2013 |
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021 | 8 | 2021 |
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model. MJ Hwang, R Yamamoto, E Song, JM Kim Interspeech, 2227-2231, 2021 | 7 | 2021 |
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis K Futamata, B Park, R Yamamoto, K Tachibana arXiv preprint arXiv:2104.12395, 2021 | 2 | 2021 |
Real-time audio to score alignment using segmental conditional random fields and linear dynamical system R Yamamoto, S Sako, T Kitarmura Proceedings of the International Society for Music Information Retrieval …, 2012 | 2 | 2012 |
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang arXiv preprint arXiv:2206.15067, 2022 | | 2022 |