Follow
Yashesh Gaur
Yashesh Gaur
Meta AI
Verified email at cs.cmu.edu
Title
Cited by
Cited by
Year
Exploring neural transducers for end-to-end speech recognition
E Battenberg, J Chen, R Child, A Coates, YGY Li, H Liu, S Satheesh, ...
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
266*2017
On the comparison of popular end-to-end models for large scale speech recognition
J Li, Y Wu, Y Gaur, C Wang, R Zhao, S Liu
arXiv preprint arXiv:2005.14327, 2020
1392020
Internal language model estimation for domain-adaptive end-to-end speech recognition
Z Meng, S Parthasarathy, E Sun, Y Gaur, N Kanda, L Lu, X Chen, R Zhao, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 243-250, 2021
912021
Serialized output training for end-to-end overlapped speech recognition
N Kanda, Y Gaur, X Wang, Z Meng, T Yoshioka
arXiv preprint arXiv:2003.12687, 2020
902020
Joint speaker counting, speech recognition, and speaker identification for overlapped speech of any number of speakers
N Kanda, Y Gaur, X Wang, Z Meng, Z Chen, T Zhou, T Yoshioka
arXiv preprint arXiv:2006.10930, 2020
692020
Robust speech recognition using generative adversarial networks
A Sriram, H Jun, Y Gaur, S Satheesh
2018 IEEE international conference on acoustics, speech and signal …, 2018
682018
The effects of automatic speech recognition quality on human transcription latency
Y Gaur, WS Lasecki, F Metze, JP Bigham
Proceedings of the 13th International Web for All Conference, 1-8, 2016
552016
Minimum latency training strategies for streaming sequence-to-sequence ASR
H Inaguma, Y Gaur, L Lu, J Li, Y Gong
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
532020
Domain adaptation via teacher-student learning for end-to-end speech recognition
Z Meng, J Li, Y Gaur, Y Gong
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
502019
Internal language model training for domain-adaptive end-to-end speech recognition
Z Meng, N Kanda, Y Gaur, S Parthasarathy, E Sun, L Lu, X Chen, J Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
472021
Speaker adaptation for attention-based end-to-end speech recognition
Z Meng, Y Gaur, J Li, Y Gong
arXiv preprint arXiv:1911.03762, 2019
402019
Investigation of end-to-end speaker-attributed ASR for continuous multi-talker recordings
N Kanda, X Chang, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka
2021 IEEE Spoken Language Technology Workshop (SLT), 809-816, 2021
382021
A Federated Approach in Training Acoustic Models.
D Dimitriadis, RG Ken'ichi Kumatani, R Gmyr, Y Gaur, SE Eskimez
Interspeech, 981-985, 2020
382020
Streaming multi-talker ASR with token-level serialized output training
N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ...
arXiv preprint arXiv:2202.00842, 2022
372022
Large-scale pre-training of end-to-end multi-talker ASR for meeting transcription with single distant microphone
N Kanda, G Ye, Y Wu, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka
arXiv preprint arXiv:2103.16776, 2021
332021
Viola: Unified codec language models for speech recognition, synthesis, and translation
T Wang, L Zhou, Z Zhang, Y Wu, S Liu, Y Gaur, Z Chen, J Li, F Wei
arXiv preprint arXiv:2305.16107, 2023
302023
End-to-end speaker-attributed ASR with transformer
N Kanda, G Ye, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka
arXiv preprint arXiv:2104.02128, 2021
302021
Internal language model adaptation with text-only data for end-to-end speech recognition
Z Meng, Y Gaur, N Kanda, J Li, X Chen, Y Wu, Y Gong
arXiv preprint arXiv:2110.05354, 2021
232021
Transcribe-to-diarize: Neural speaker diarization for unlimited number of speakers using end-to-end speaker-attributed ASR
N Kanda, X Xiao, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
212022
Systems and methods for robust speech recognition using generative adversarial networks
A Sriram, HW Jun, G Yashesh, S Satheesh
US Patent 10,971,142, 2021
212021
The system can't perform the operation now. Try again later.
Articles 1–20