Follow
Xia Song
Title
Cited by
Cited by
Year
Ms marco: A human-generated machine reading comprehension dataset
T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, R Majumder, L Deng
14562016
Ms marco: A human generated machine reading comprehension dataset
P Bajaj, D Campos, N Craswell, L Deng, J Gao, X Liu, R Majumder, ...
arXiv preprint arXiv:1611.09268, 2016
6092016
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model
S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ...
arXiv preprint arXiv:2201.11990, 2022
4752022
InfoXLM: An information-theoretic framework for cross-lingual language model pre-training
Z Chi, L Dong, F Wei, N Yang, S Singhal, W Wang, X Song, XL Mao, ...
arXiv preprint arXiv:2007.07834, 2020
2922020
Language is not all you need: Aligning perception with language models
S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...
Advances in Neural Information Processing Systems 36, 2024
2462024
Coco-lm: Correcting and contrasting text sequences for language model pretraining
Y Meng, C Xiong, P Bajaj, P Bennett, J Han, X Song
Advances in Neural Information Processing Systems 34, 23102-23114, 2021
1762021
Transformer-xh: Multi-evidence reasoning with extra hop attention
C Zhao, C Xiong, C Rosset, X Song, P Bennett, S Tiwary
International Conference on Learning Representations, 2019
1142019
Xlm-e: Cross-lingual language model pre-training via electra
Z Chi, S Huang, L Dong, S Ma, B Zheng, S Singhal, P Bajaj, X Song, ...
arXiv preprint arXiv:2106.16138, 2021
982021
Neural ranking models with multiple document fields
H Zamani, B Mitra, X Song, N Craswell, S Tiwary
Proceedings of the eleventh ACM international conference on web search and …, 2018
962018
Pushing the limits of narrow precision inferencing at cloud scale with microsoft floating point
B Darvish Rouhani, D Lo, R Zhao, M Liu, J Fowers, K Ovtcharov, ...
Advances in neural information processing systems 33, 10271-10281, 2020
942020
Ms marco: A human generated machine reading comprehension dataset
DF Campos, T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, ...
ArXiv, abs/1611.09268 29, 2016
842016
Leading conversational search by suggesting useful questions
C Rosset, C Xiong, X Song, D Campos, N Craswell, S Tiwary, P Bennett
Proceedings of the web conference 2020, 1160-1170, 2020
822020
A length-extrapolatable transformer
Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ...
arXiv preprint arXiv:2212.10554, 2022
642022
Deltalm: Encoder-decoder pre-training for language generation and translation by augmenting pretrained multilingual encoders
S Ma, L Dong, S Huang, D Zhang, A Muzio, S Singhal, HH Awadalla, ...
arXiv preprint arXiv:2106.13736, 2021
602021
Knowledge-aware language model pretraining
C Rosset, C Xiong, M Phan, X Song, P Bennett, S Tiwary
arXiv preprint arXiv:2007.00655, 2020
602020
Generic intent representation in web search
H Zhang, X Song, C Xiong, C Rosset, PN Bennett, N Craswell, S Tiwary
Proceedings of the 42nd International ACM SIGIR Conference on Research and …, 2019
502019
Consistency regularization for cross-lingual fine-tuning
B Zheng, L Dong, S Huang, W Wang, Z Chi, S Singhal, W Che, T Liu, ...
arXiv preprint arXiv:2106.08226, 2021
402021
On the representation collapse of sparse mixture of experts
Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ...
Advances in Neural Information Processing Systems 35, 34600-34613, 2022
392022
Ms marco: A human-generated machine reading comprehension dataset.(2016)
T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, R Majumder, L Deng
arXiv preprint arXiv:1611.09268, 2016
392016
Metro: Efficient denoising pretraining of large scale autoencoding language models with model generated signals
P Bajaj, C Xiong, G Ke, X Liu, D He, S Tiwary, TY Liu, P Bennett, X Song, ...
arXiv preprint arXiv:2204.06644, 2022
322022
The system can't perform the operation now. Try again later.
Articles 1–20