Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 889 | 2021 |
Speech lab in a box: a Mandarin speech toolbox to jumpstart speech related research. E Chang, Y Shi, JL Zhou, C Huang INTERSPEECH, 2799-2802, 2001 | 104 | 2001 |
A system for spoken query information retrieval on mobile devices E Chang, F Seide, HM Meng, Z Chen, Y Shi, YC Li IEEE Transactions on Speech and Audio processing 10 (8), 531-541, 2002 | 95 | 2002 |
Handwriting symbol recognition accuracy using speech input L Ma, Y Shi, FKP Soong US Patent 8,077,975, 2011 | 68 | 2011 |
Improving readability for automatic speech recognition transcription J Liao, S Eskimez, L Lu, Y Shi, M Gong, L Shou, H Qu, M Zeng ACM Transactions on Asian and Low-Resource Language Information Processing …, 2023 | 65 | 2023 |
Segmental tonal modeling for phone set design in Mandarin LVCSR C Huang, Y Shi, J Zhou, M Chu, T Wang, E Chang 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 52 | 2004 |
Speech-language pre-training for end-to-end spoken language understanding Y Qian, X Bianv, Y Shi, N Kanda, L Shen, Z Xiao, M Zeng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 45 | 2021 |
i-code: An integrative and composable multimodal learning framework Z Yang, Y Fang, C Zhu, R Pryzant, D Chen, Y Shi, Y Xu, Y Qian, M Gao, ... Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 10880 …, 2023 | 43 | 2023 |
Tone articulation modeling for Mandarin spontaneous speech recognition J Zhou, Y Tian, Y Shi, C Huang, E Chang 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 42 | 2004 |
Mixed-lingual pre-training for cross-lingual summarization R Xu, C Zhu, Y Shi, M Zeng, X Huang arXiv preprint arXiv:2010.08892, 2020 | 32 | 2020 |
Spectrogram-based formant tracking via particle filters Y Shi, E Chang 2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003 | 30 | 2003 |
Symbol graph generation in handwritten mathematical expression recognition Y Shi, FKP Soong, JI Zhou US Patent 7,885,456, 2011 | 29 | 2011 |
Towards spoken-document retrieval for the enterprise: Approximate word-lattice indexing with text indexers F Seide, P Yu, Y Shi 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU …, 2007 | 22 | 2007 |
Listen, look and deliberate: Visual context-aware speech recognition using pre-trained text-video representations S Ghorbani, Y Gaur, Y Shi, J Li 2021 IEEE Spoken Language Technology Workshop (SLT), 621-628, 2021 | 17 | 2021 |
Symbol graph based discriminative training and rescoring for improved math symbol recognition ZX Luo, Y Shi, FK Soong 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 16 | 2008 |
Optimizing alignment of speech and language latent spaces for end-to-end speech recognition and understanding W Wang, S Ren, Y Qian, S Liu, Y Shi, Y Qian, M Zeng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 15 | 2022 |
Approximateword-lattice indexing with text indexers: Time-Anchored Lattice Expansion P Yu, Y Shi, F Seide 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 15 | 2008 |
Florence: A new foundation model for computer vision. arXiv 2021 L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 14 | 2021 |
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition K Kumatani, R Gmyr, FC Salinas, L Liu, W Zuo, D Patel, E Sun, Y Shi arXiv preprint arXiv:2112.05820, 2021 | 13 | 2021 |
Position-dependent phonetic models for reliable pronunciation identification P Liu, Y Shi, FKP Soong US Patent 8,135,590, 2012 | 13 | 2012 |