Follow
Xiaodong Cui
Xiaodong Cui
Principal Research Scientist, IBM T. J. Watson Research Center
Verified email at us.ibm.com - Homepage
Title
Cited by
Cited by
Year
Data augmentation for deep neural network acoustic modeling
X Cui, V Goel, B Kingsbury
IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (9), 1469 …, 2015
4432015
English conversational telephone speech recognition by humans and machines
G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ...
arXiv preprint arXiv:1703.02136, 2017
4142017
Dilated recurrent neural networks
S Chang, Y Zhang, W Han, M Yu, X Guo, W Tan, X Cui, M Witbrock, ...
Advances in neural information processing systems 30, 2017
2502017
Hybrid 8-bit floating point (HFP8) training and inference for deep neural networks
X Sun, J Choi, CY Chen, N Wang, S Venkataramani, VV Srinivasan, X Cui, ...
Advances in neural information processing systems 32, 2019
1192019
A database of vocal tract resonance trajectories for research in speech processing
L Deng, X Cui, R Pruvenok, J Huang, S Momen, Y Chen, A Alwan
2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006
1192006
Multilingual representations for low resource speech recognition and keyword search
J Cui, B Kingsbury, B Ramabhadran, A Sethy, K Audhkhasi, X Cui, ...
2015 IEEE workshop on automatic speech recognition and understanding (ASRU …, 2015
962015
Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
X Cui, A Alwan
IEEE Transactions on Speech and Audio Processing 13 (6), 1161-1172, 2005
902005
System combination and score normalization for spoken term detection
J Mamou, J Cui, X Cui, MJF Gales, B Kingsbury, K Knill, L Mangu, ...
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
892013
Ultra-low precision 4-bit training of deep neural networks
X Sun, N Wang, CY Chen, J Ni, A Agrawal, X Cui, S Venkataramani, ...
Advances in Neural Information Processing Systems 33, 1796-1807, 2020
732020
Stereo-based stochastic mapping for robust speech recognition
M Afify, X Cui, Y Gao
IEEE transactions on audio, speech, and language processing 17 (7), 1325-1334, 2009
722009
Evolutionary stochastic gradient descent for optimization of deep neural networks
X Cui, W Zhang, Z Tüske, M Picheny
Advances in neural information processing systems 31, 2018
692018
A high-performance Cantonese keyword search system
B Kingsbury, J Cui, X Cui, MJF Gales, K Knill, J Mamou, L Mangu, ...
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
612013
Data augmentation for deep convolutional neural network acoustic modeling
X Cui, V Goel, B Kingsbury
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
562015
Tball data collection: the making of a young children's speech corpus
A Kazemzadeh, H You, M Iseli, B Jones, X Cui, M Heritage, P Price, ...
Ninth European Conference on Speech Communication and Technology, 2005
532005
Towards better understanding of adaptive gradient algorithms in generative adversarial nets
M Liu, Y Mroueh, J Ross, W Zhang, X Cui, P Das, T Yang
arXiv preprint arXiv:1912.11940, 2019
512019
A study of variable-parameter Gaussian mixture hidden Markov modeling for noisy speech recognition
X Cui, Y Gong
IEEE transactions on audio, speech, and language processing 15 (4), 1366-1376, 2007
502007
Developing speech recognition systems for corpus indexing under the IARPA Babel program
J Cui, X Cui, B Ramabhadran, J Kim, B Kingsbury, J Mamou, L Mangu, ...
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
492013
An empirical study of confusion modeling in keyword search for low resource languages
M Saraclar, A Sethy, B Ramabhadran, L Mangu, J Cui, X Cui, B Kingsbury, ...
2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 464-469, 2013
442013
A decentralized parallel algorithm for training generative adversarial nets
M Liu, W Zhang, Y Mroueh, X Cui, J Ross, T Yang, P Das
Advances in Neural Information Processing Systems 33, 11056-11070, 2020
422020
Adaptation of children’s speech with limited data based on formant-like peak alignment
X Cui, A Alwan
Computer speech & language 20 (4), 400-419, 2006
402006
The system can't perform the operation now. Try again later.
Articles 1–20