Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 1793 | 2018 |
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018 | 501 | 2018 |
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018 | 174 | 2018 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 142 | 2019 |
SATzilla2012: Improved algorithm selection based on cost-sensitive classification models L Xu, F Hutter, J Shen, HH Hoos, K Leyton-Brown Proceedings of SAT Challenge, 57-58, 2012 | 98 | 2012 |
Parallel tacotron: Non-autoregressive and controllable tts I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 54 | 2021 |
Neural program synthesis with priority queue training DA Abolafia, M Norouzi, J Shen, R Zhao, QV Le arXiv preprint arXiv:1801.03526, 2018 | 48 | 2018 |
Non-attentive tacotron: Robust and controllable neural TTS synthesis including unsupervised duration modeling J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu arXiv preprint arXiv:2010.04301, 2020 | 40 | 2020 |
In teacher we trust: Learning compressed models for pedestrian detection J Shen, N Vesdapunt, VN Boddeti, KM Kitani arXiv preprint arXiv:1612.00478, 2016 | 31 | 2016 |
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS Y Jia, H Zen, J Shen, Y Zhang, Y Wu arXiv preprint arXiv:2103.15060, 2021 | 28 | 2021 |
Parallel tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu arXiv preprint arXiv:2103.14574, 2021 | 19 | 2021 |
Synthesizing speech from text using neural networks Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ... US Patent 10,971,170, 2021 | 13 | 2021 |
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.(2018) J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... arXiv preprint cs.CL/1712.05884, 2018 | 2 | 2018 |
Modelling intonation in spectrograms for neural vocoder based text-to-speech V Wan, J Shen, H Silen, R Clark | 1 | 2020 |
Building a text-to-speech system from a small amount of speech data Y Jia, B Chun, ODA Yusuke, N Casagrande, T Iyer, F Luo, ... US Patent 11,335,321, 2022 | | 2022 |
Parallel Tacotron Non-Autoregressive and Controllable TTS I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun US Patent App. 17/327,076, 2022 | | 2022 |
Text-to-speech using duration prediction Y Zhang, I Elias, B Chun, Y Jia, Y Wu, M Chrzanowski, J Shen US Patent App. 17/492,543, 2022 | | 2022 |
Examining Scaling and Transfer of Language Model Architectures for Machine Translation B Zhang, B Ghorbani, A Bapna, Y Cheng, X Garcia, J Shen, O Firat arXiv preprint arXiv:2202.00528, 2022 | | 2022 |
Synthesis of Speech from Text in a Voice of a Target Speaker Using Neural Networks Y Jia, Z Chen, Y Wu, J Shen, R Pang, RJ Weiss, IL Moreno, F Ren, ... US Patent App. 17/055,951, 2021 | | 2021 |
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Alignments I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu | | 2021 |