Zhehuai Chen

Cited by

	All	Since 2019
Citations	1271	1149
h-index	19	17
i10-index	35	31

400

200

100

300

20162017201820192020202120222023202413 12 84 109 84 99 161 398 297

Public access

View all

8 articles

3 articles

available

not available

Based on funding mandates

Co-authors

Bhuvana RamabhadranManager, GoogleVerified email at google.com
Andrew RosenbergGoogleVerified email at google.com
Kai Yu（俞凯）Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yu ZhangOpenAIVerified email at csail.mit.edu
Gary WangGoogleVerified email at google.com
Yonghui WuGoogle BrainVerified email at google.com
Ankur BapnaSoftware Engineer, Google DeepmindVerified email at google.com
Yanmin QianProfessor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yongqiang WangResearch Scientist, GoogleVerified email at google.com
Heiga ZenPrincipal Scientist (Director), Google DeepMindVerified email at google.com
Mike SeltzerFacebookVerified email at fb.com
Christian FuegenFacebook Inc.Verified email at fb.com
Mahaveer JainGoogleVerified email at google.com
Jasha DroppoAmazonVerified email at amazon.com
Wei HanOpenAIVerified email at illinois.edu
Rohit PrabhavalkarStaff Research Scientist, GoogleVerified email at google.com
Hainan XuNVIDIAVerified email at nvidia.com
Yimeng ZhuangSamsung Research China - Beijing (SRC-B)Verified email at samsung.com
Parisa HaghaniGoogleVerified email at google.com
Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com

Zhehuai Chen

NVIDIA

Verified email at nvidia.com - Homepage

Speech Recognition Speech Synthesis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023	178	2023
Maestro: Matched speech text representations through modality matching Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, A Bapna, ... arXiv preprint arXiv:2204.03409, 2022	89	2022
Progressive joint modeling in unsupervised single-channel overlapped speech recognition Z Chen, J Droppo, J Li, W Xiong IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (1), 184-196, 2017	82	2017
Knowledge Distillation for Sequence Model. M Huang, Y You, Z Chen, Y Qian, K Yu Interspeech, 3703-3707, 2018	71	2018
Improving speech recognition using consistent predictions on synthesized speech G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, Y Wu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	58	2020
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR. Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen Interspeech, 3490-3494, 2019	55	2019
End-to-end contextual speech recognition using class language models and a token passing decoder Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	53	2019
Phone synchronous speech recognition with ctc lattices Z Chen, Y Zhuang, Y Qian, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (1), 90-101, 2016	43	2016
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection. Z Chen, A Rosenberg, Y Zhang, G Wang, B Ramabhadran, PJ Moreno Interspeech, 556-560, 2020	38	2020
Injecting text in self-supervised speech pretraining Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, G Wang, P Moreno 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	35	2021
On modular training of neural acoustics-to-word model for lvcsr Z Chen, Q Liu, H Li, K Yu 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	34	2018
Joist: A joint speech and text streaming model for asr TN Sainath, R Prabhavalkar, A Bapna, Y Zhang, Z Huo, Z Chen, B Li, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2023	30	2023
Tacotron: Towards end-to-end speech synthesis. arXiv 2017 Y Wang, R Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017	29	2017
Phone Synchronous Decoding with CTC Lattice. Z Chen, W Deng, T Xu, K Yu Interspeech, 1923-1927, 2016	24	2016
Tts4pretrain 2.0: Advancing the use of text and speech in asr pretraining with consistency and contrastive losses Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, G Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	22	2022
Sequence discriminative training for deep learning based acoustic keyword spotting Z Chen, Y Qian, K Yu Speech Communication 102, 100-111, 2018	22	2018
Sequence modeling in unsupervised single-channel overlapped speech recognition Z Chen, J Droppo 2018 IEEE international conference on acoustics, speech and signal …, 2018	21	2018
Palm 2 technical report. arXiv 2023 R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 0	20
A gpu-based wfst decoder with exact lattice generation Z Chen, J Luitjens, H Xu, Y Wang, D Povey, S Khudanpur arXiv preprint arXiv:1804.03243, 2018	19	2018
Accented speech recognition: Benchmarking, pre-training, and diverse data A Aksënova, Z Chen, CC Chiu, D van Esch, P Golik, W Han, L King, ... arXiv preprint arXiv:2205.08014, 2022	18	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors