Heyang Qin
Heyang Qin
Verified email at - Homepage
Cited by
Cited by
Deepspeed-chat: Easy, fast and affordable rlhf training of chatgpt-like models at all scales
Z Yao, RY Aminabadi, O Ruwase, S Rajbhandari, X Wu, AA Awan, ...
arXiv preprint arXiv:2308.01320, 2023
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 2024
Swift machine learning model serving scheduling: a region based reinforcement learning approach
H Qin, S Zawad, Y Zhou, L Yang, D Zhao, F Yan
Proceedings of the International Conference for High Performance Computing …, 2019
The age of correlated features in supervised learning based forecasting
MKC Shisher, H Qin, L Yang, F Yan, Y Sun
IEEE INFOCOM 2021-IEEE Conference on Computer Communications Workshops …, 2021
Reinforcement-learning-empowered MLaaS scheduling for serving intelligent internet of things
H Qin, S Zawad, Y Zhou, S Padhi, L Yang, F Yan
IEEE Internet of Things Journal 7 (7), 6325-6337, 2020
Zero++: Extremely efficient collective communication for giant model training
G Wang, H Qin, SA Jacobs, C Holmes, S Rajbhandari, O Ruwase, F Yan, ...
arXiv preprint arXiv:2306.10209, 2023
Nemo: An open-source transformer-supercharged benchmark for fine-grained wildfire smoke detection
A Yazdi, H Qin, CB Jordan, L Yang, F Yan
Remote Sensing 14 (16), 3979, 2022
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
C Holmes, M Tanaka, M Wyatt, AA Awan, J Rasley, S Rajbhandari, ...
arXiv preprint arXiv:2401.08671, 2024
Simigrad: Fine-grained adaptive batching for large scale training using gradient similarity measurement
H Qin, S Rajbhandari, O Ruwase, F Yan, L Yang, Y He
Advances in Neural Information Processing Systems 34, 20531-20544, 2021
ZeRO++: Extremely Efficient Collective Communication for Large Model Training
G Wang, H Qin, SA Jacobs, X Wu, C Holmes, Z Yao, S Rajbhandari, ...
The Twelfth International Conference on Learning Representations, 2023
Scalable and Efficient Machine Learning as a Service
H Qin
University of Nevada, Reno, 2022
The Age of Correlated Features in Supervised Learning based Forecasting
M Kamran Chowdhury Shisher, H Qin, L Yang, F Yan, Y Sun
arXiv e-prints, arXiv: 2103.00092, 2021
The system can't perform the operation now. Try again later.
Articles 1–12