Triple-q: A model-free algorithm for constrained reinforcement learning with sublinear regret and zero constraint violation H Wei, X Liu, L Ying International Conference on Artificial Intelligence and Statistics, 3274-3307, 2022 | 59* | 2022 |
A provably-efficient model-free algorithm for infinite-horizon average-reward constrained Markov decision processes H Wei, X Liu, L Ying Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3868-3876, 2022 | 31 | 2022 |
Spectrum anomalies autonomous detection in cognitive radio using hidden markov models W Honghao, J Yunfeng, W Lei 2015 IEEE advanced information technology, electronic and automation control …, 2015 | 28 | 2015 |
Online convex optimization with hard constraints: Towards the best of two worlds and beyond H Guo, X Liu, H Wei, L Ying Advances in Neural Information Processing Systems 35, 36426-36439, 2022 | 27 | 2022 |
Provably efficient model-free algorithms for non-stationary cmdps H Wei, A Ghosh, N Shroff, L Ying, X Zhou International Conference on Artificial Intelligence and Statistics, 6527-6570, 2023 | 15 | 2023 |
QuickStop: A Markov optimal stopping approach for quickest misinformation detection H Wei, X Kang, W Wang, L Ying Proceedings of the ACM on Measurement and Analysis of Computing Systems 3 (2 …, 2019 | 13 | 2019 |
General range model for multi‐channel SAR/GMTI with curvilinear flight trajectory Z Chen, Y Zhou, L Zhang, H Wei, C Lin, N Liu, J Wan Electronics Letters 55 (2), 111-112, 2019 | 12 | 2019 |
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks H Wei, X Liu, W Wang, L Ying NeurIPS-23 Spotlight, 2023 | 10 | 2023 |
Scalable and sample efficient distributed policy gradient algorithms in multi-agent networked systems X Liu, H Wei, L Ying arXiv preprint arXiv:2212.06357, 2022 | 9 | 2022 |
A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems H Wei, Z Yang, X Liu, Z Qin, X Tang, L Ying IEEE Transactions on Intelligent Transportation Systems, 2023 | 7 | 2023 |
Fork: A forward-looking actor for model-free reinforcement learning H Wei, L Ying 2021 60th IEEE Conference on Decision and Control (CDC), 1554-1559, 2021 | 4 | 2021 |
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration H Wei, X Liu, L Ying AAAI, 2024 | 2 | 2024 |
On low-complexity quickest intervention of mutated diffusion processes through local approximation Q Zhang, H Wei, W Wang, L Ying Proceedings of the Twenty-Third International Symposium on Theory …, 2022 | 2 | 2022 |
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Q Zhang, H Wei, L Ying RLC 2024, 2024 | 1 | 2024 |
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs Z Zhou, H Wei, L Ying arXiv preprint arXiv:2309.15395, 2023 | 1 | 2023 |
Provably Efficient Algorithms for Safe Reinforcement Learning H Wei | 1 | 2023 |
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization X Peng, H Guo, J Zhang, D Zou, Z Shao, H Wei, X Liu arXiv preprint arXiv:2410.19933, 2024 | | 2024 |
Optimistic Joint Flow Control and Link Scheduling with Unknown Utility Functions X Liu, H Wei, L Ying Mobihoc 2024, 271-280, 2024 | | 2024 |
Vehicle repositioning determination for vehicle pool L Ying, WEI Honghao, Y Zixian US Patent App. 18/507,042, 2024 | | 2024 |
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning H Wei, X Peng, A Ghosh, X Liu NeurIPS 2024, 2024 | | 2024 |