Unsupervised learning for sequence-to-sequence text-to-speech for low-resource languages H Zhang, Y Lin Proc. Interspeech 2020, 3161-3165, 2020 | 34 | 2020 |
Dgc-vector: A new speaker embedding for zero-shot voice conversion R Xiao, H Zhang, Y Lin ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 12 | 2022 |
Improve few-shot voice cloning using multi-modal learning H Zhang, Y Lin ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
Improve Cross-Lingual Text-To-Speech Synthesis on Monolingual Corpora with Pitch Contour Information. H Zhan, H Zhang, W Ou, Y Lin Interspeech, 1599-1603, 2021 | 9 | 2021 |
Sequence-to-sequence models for small-footprint keyword spotting H Zhang, J Zhang, Y Wang arXiv preprint arXiv:1811.00348, 2018 | 9 | 2018 |
The NeteaseGames system for voice conversion challenge 2020 with vector-quantization variational autoencoder and WaveNet H Zhang In Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | 6 | 2020 |
Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech H Zhan, X Yu, H Zhang, Y Zhang, Y Lin arXiv preprint arXiv:2110.07192, 2021 | 4 | 2021 |
Revisiting IPA-based Cross-lingual Text-to-speech H Zhang, H Zhan, Y Zhang, X Yu, Y Lin arXiv preprint arXiv:2110.07187, 2021 | 4 | 2021 |
Data augmentation for long-tailed and imbalanced polyphone disambiguation in mandarin Y Zhang, H Zhang, Y Lin ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 3 | 2022 |
Rawnet: Fast end-to-end neural vocoder Y He, Y Wang arXiv preprint arXiv:1904.05351, 2019 | 3 | 2019 |
End-to-end models with auditory attention in multi-channel keyword spotting H Zhang, J Zhang, Y Wang arXiv preprint arXiv:1811.00350, 2018 | 3 | 2018 |
NSV-TTS: Non-Speech Vocalization Modeling And Transfer In Emotional Text-To-Speech H Zhang, X Yu, Y Lin ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
TEXT-VC: TEXT-GUIDED ANY-TO-MANY VOICE CONVERSION H Zhang, H Zhan, L Yue | 1 | 2022 |
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data H Zhang, Y Lin arXiv preprint arXiv:2110.07210, 2021 | | 2021 |
Improve Text Analysis for Mandarin Chinese TTS Using Data Augmentation H Zhang, Y Zhang, Y Lin | | |
CROSS-SPEAKER STYLE TRANSFER USING CURRICULUM LEARNING AND DATA AUGMENTATION H Zhang, H Zhan, X Yu, Y Lin | | |