A noise-robust self-supervised pre-training model based speech representation learning for automatic speech recognition QS Zhu, J Zhang, ZQ Zhang, MH Wu, X Fang, LR Dai ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 42 | 2022 |
A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition QS Zhu, J Zhang, ZQ Zhang, LR Dai IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 33* | 2023 |
Robust data2vec: Noise-robust speech representation learning for asr by combining regression and improved contrastive learning QS Zhu, L Zhou, J Zhang, SJ Liu, YC Hu, LR Dai ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 25 | 2023 |
VatLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning Q Zhu, L Zhou, Z Zhang, S Liu, B Jiao, J Zhang, L Dai, D Jiang, J Li, F Wei IEEE Transactions on Multimedia, 2023 | 24 | 2023 |
Gradient remedy for multi-task learning in end-to-end noise-robust speech recognition Y Hu, C Chen, R Li, Q Zhu, ES Chng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 15 | 2023 |
Supervised and self-supervised pretraining based COVID-19 detection using acoustic breathing/cough/speech signals XY Chen*, QS Zhu*, J Zhang, LR Dai *:Equal Contribution; ICASSP 2022-2022 IEEE International Conference on …, 2022 | 13 | 2022 |
Wav2code: Restore clean speech representations via codebook lookup for noise-robust asr Y Hu, C Chen, Q Zhu, ES Chng IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 8 | 2023 |
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions J Zhang, QT Xu, QS Zhu, ZH Ling Interspeech 2023, 2023 | 8 | 2023 |
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition Y Hu, R Li, C Chen, H Zou, Q Zhu, ES Chng IJCAI 2023, 2023 | 4 | 2023 |
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition. Q Zhu, J Zhang, M Wu, X Fang, LR Dai Interspeech, 4334-4338, 2021 | 4 | 2021 |
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization XY Zhao, QS Zhu, J Zhang 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022 | 3 | 2022 |
Noise-aware Speech Enhancement using Diffusion Probabilistic Model Y Hu, C Chen, R Li, Q Zhu, ES Chng arXiv preprint arXiv:2307.08029, 2023 | 2 | 2023 |
Rep2wav: Noise Robust text-to-speech Using self-supervised representations Q Zhu, Y Gu, C Weng, Y Hu, L Dai, J Zhang arXiv preprint arXiv:2308.14553, 2023 | 1 | 2023 |
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition Y Hu, R Li, C Chen, C Qin, Q Zhu, ES Chng ACL 2023, 2023 | 1 | 2023 |
Eeg2vec: Self-Supervised Electroencephalographic Representation Learning Q Zhu, X Zhao, J Zhang, Y Gu, C Weng, Y Hu arXiv preprint arXiv:2305.13957, 2023 | 1 | 2023 |
A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text Y Du, J Zhang, Q Zhu, L Dai, MH Wu, X Fang, ZW Yang Proc. Interspeech 2022, 2613-2617, 2022 | 1 | 2022 |
DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis Y Gu, Q Zhu, G Lei, C Weng, D Su ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
An Experimental Comparison of Noise-Robust Text-To-Speech Synthesis Systems Based On Self-Supervised Representation X Zhao, Q Zhu, Y Hu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation L Zhu, Qiushi and Zhang, Jie and Gu, Yu and Hu, Yuchen and Dai Proceedings of the AAAI Conference on Artificial Intelligence 38, 19768-19776, 2024 | | 2024 |
Speech Enhancement with Multi-granularity Vector Quantization X Zhao, Q Zhu, J Zhang, Y Zhou, P Liu 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | | 2023 |