Improving Prosody Modelling with Cross-Utterance Bert Embeddings for End-to-End Speech Synthesis G Xu, W Song, Z Zhang, C Zhang, X He, B Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 55 | 2021 |
Weighted graph model based sentence clustering and ranking for document summarization SS Ge, Z Zhang, H He The 4th International Conference on Interaction Sciences, 90-95, 2011 | 47 | 2011 |
Speaker state classification based on fusion of asymmetric simple partial least squares (SIMPLS) and support vector machines DY Huang, Z Zhang, SS Ge Computer Speech & Language 28 (2), 392-419, 2014 | 37 | 2014 |
Telerobotic Pointing Gestures Shape Human Spatial Cognition JJ Cabibihan, WC So, S Saj, Z Zhang International Journal of Social Robotics, 1-10, 2012 | 33 | 2012 |
Incremental learning for end-to-end automatic speech recognition L Fu, X Li, L Zi, Z Zhang, Y Wu, X He, B Zhou 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 28 | 2021 |
A light-weight method of building an LSTM-RNN-based bilingual TTS system H Ming, Y Lu, Z Zhang, M Dong 2017 International Conference on Asian Language Processing (IALP), 201-205, 2017 | 28 | 2017 |
Speaker State Classification Based on Fusion of Asymmetric SIMPLS and Support Vector Machines DY Huang, SS Ge, Z Zhang Twelfth Annual Conference of the International Speech Communication Association, 2011 | 25 | 2011 |
Design and development of Nancy, a social robot SS Ge, JJ Cabibihan, Z Zhang, Y Li, C Meng, H He, MR Safizadeh, YB Li, ... Ubiquitous Robots and Ambient Intelligence (URAI), 2011 8th International …, 2011 | 24 | 2011 |
Mutual-reinforcement document summarization using embedded graph based sentence clustering for storytelling Z Zhang, SS Ge, H He Information Processing & Management 48 (4), 767-778, 2012 | 20 | 2012 |
A saliency-driven robotic head with bio-inspired saccadic behaviors for social robotics H He, SS Ge, Z Zhang Autonomous Robots 36 (3), 225-240, 2014 | 16 | 2014 |
Mandarin Prosodic Phrase Prediction based on Syntactic Trees Z Zhang, F Wu, C Yang, M Dong, F Zhou 9th ISCA Speech Synthesis Workshop}, 160-165, 2016 | 15 | 2016 |
Dian: duration informed auto-regressive network for voice cloning W Song, X Yuan, Z Zhang, C Zhang, Y Wu, X He, B Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
Bottom-up saliency detection for attention determination SS Ge, H He, Z Zhang Machine Vision and Applications, 1-14, 2011 | 11 | 2011 |
Node-level parallelization for deep neural networks with conditional independent graph F Zhou, F Wu, Z Zhang, M Dong Neurocomputing 267, 261-270, 2017 | 8 | 2017 |
Visual Attention Prediction Using Saliency Determination of Scene Understanding for Social Robots H He, SS Ge, Z Zhang International Journal of Social Robotics, 1-12, 2011 | 8 | 2011 |
Efficient WaveGlow: An Improved WaveGlow Vocoder with Enhanced Speed. W Song, G Xu, Z Zhang, C Zhang, X He, B Zhou INTERSPEECH, 225-229, 2020 | 7 | 2020 |
MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy YJ Zhang, W Song, Y Yue, Z Zhang, Y Wu, X He arXiv preprint arXiv:2211.06170, 2022 | 6 | 2022 |
Onset detection based on fusion of simpls and superflux Z Zhang, D Huang, R Zhao, M Dong Music Information Retrieval Evaluation eXchange, 2013 | 6 | 2013 |
Prosody Modelling with Pre-trained Cross-utterance Representations for Improved Speech Synthesis YJ Zhang, C Zhang, W Song, Z Zhang, Y Wu, X He IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 5 | 2023 |
Multi-speaker Multi-style Speech Synthesis with Timbre and Style Disentanglement W Song, Y Yue, Y Zhang, Z Zhang, Y Wu, X He National Conference on Man-Machine Speech Communication, 132-140, 2022 | 5 | 2022 |