Honghao Wei
Honghao Wei
Assistant Professor of EECS, Washington State University
Verified email at - Homepage
Cited by
Cited by
Triple-q: A model-free algorithm for constrained reinforcement learning with sublinear regret and zero constraint violation
H Wei, X Liu, L Ying
International Conference on Artificial Intelligence and Statistics, 3274-3307, 2022
Spectrum anomalies autonomous detection in cognitive radio using hidden markov models
W Honghao, J Yunfeng, W Lei
2015 IEEE advanced information technology, electronic and automation control …, 2015
A provably-efficient model-free algorithm for infinite-horizon average-reward constrained Markov decision processes
H Wei, X Liu, L Ying
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3868-3876, 2022
Online convex optimization with hard constraints: Towards the best of two worlds and beyond
H Guo, X Liu, H Wei, L Ying
Advances in Neural Information Processing Systems 35, 36426-36439, 2022
General range model for multi‐channel SAR/GMTI with curvilinear flight trajectory
Z Chen, Y Zhou, L Zhang, H Wei, C Lin, N Liu, J Wan
Electronics Letters 55 (2), 111-112, 2019
QuickStop: A Markov optimal stopping approach for quickest misinformation detection
H Wei, X Kang, W Wang, L Ying
Proceedings of the ACM on Measurement and Analysis of Computing Systems 3 (2 …, 2019
Provably efficient model-free algorithms for non-stationary CMDPs
H Wei, A Ghosh, N Shroff, L Ying, X Zhou
International Conference on Artificial Intelligence and Statistics, 6527-6570, 2023
Scalable and sample efficient distributed policy gradient algorithms in multi-agent networked systems
X Liu, H Wei, L Ying
arXiv preprint arXiv:2212.06357, 2022
Fork: A forward-looking actor for model-free reinforcement learning
H Wei, L Ying
2021 60th IEEE Conference on Decision and Control (CDC), 1554-1559, 2021
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks
H Wei, X Liu, W Wang, L Ying
NeurIPS-23 Spotlight, 2023
On low-complexity quickest intervention of mutated diffusion processes through local approximation
Q Zhang, H Wei, W Wang, L Ying
Proceedings of the Twenty-Third International Symposium on Theory …, 2022
Adversarially Trained Actor Critic for offline CMDPs
H Wei, X Peng, X Liu, A Ghosh
arXiv preprint arXiv:2401.00629, 2024
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
H Wei, X Liu, L Ying
AAAI, 2024
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Z Zhou, H Wei, L Ying
arXiv preprint arXiv:2309.15395, 2023
A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems
H Wei, Z Yang, X Liu, Z Qin, X Tang, L Ying
IEEE Transactions on Intelligent Transportation Systems, 2023
Provably Efficient Algorithms for Safe Reinforcement Learning
H Wei
The system can't perform the operation now. Try again later.
Articles 1–16