Baoxiang Wang

Cited by

	All	Since 2019
Citations	447	411
h-index	9	9
i10-index	9	9

140

105

2016201720182019202020212022202320245 8 21 36 45 58 78 123 69

Public access

View all

7 articles

1 article

available

not available

Based on funding mandates

Co-authors

Kun KuangZhejiang UniversityVerified email at zju.edu.cn
Furui LiuZhejiang Lab and UCAS and Zhejiang UniversityVerified email at zhejianglab.com
Hongyuan ZhaThe Chinese University of Hong Kong, ShenzhenVerified email at cuhk.edu.cn
Jing DongThe Chinese University of Hong Kong, ShenzhenVerified email at umich.edu
Fei WuProfessor of Computer Science, Zhejiang UniversityVerified email at cs.zju.edu.cn
Jun XIAOInstitute of Artificial Intelligence, Zhejiang UniversityVerified email at zju.edu.cn
Nidhi HegdeUniversity of AlbertaVerified email at ualberta.ca
Matthew E. TaylorAssociate Professor, University of AlbertaVerified email at ualberta.ca
Shuai Li (李帅)Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Andrej BogdanovChinese University of Hong KongVerified email at cse.cuhk.edu.hk

Baoxiang Wang

Assistant Professor, The Chinese University of Hong Kong Shenzhen

Verified email at cse.cuhk.edu.hk - Homepage

reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Contextual combinatorial cascading bandits S Li, B Wang, S Zhang, W Chen International conference on machine learning, 1245-1253, 2016	140	2016
Privacy-preserving q-learning with functional noise in continuous spaces B Wang, N Hegde Advances in Neural Information Processing Systems 32, 2019	56	2019
Shapley counterfactual credits for multi-agent reinforcement learning J Li, K Kuang, B Wang, F Liu, L Chen, F Wu, J Xiao Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021	49	2021
Paid: Prioritizing app issues for developers by tracking user reviews over versions C Gao, B Wang, P He, J Zhu, Y Zhou, MR Lyu 2015 IEEE 26th international symposium on software reliability engineering …, 2015	48	2015
Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control K Young, B Wang, ME Taylor International Joint Conference on Artificial Intelligence (IJCAI) 2019, 2018	26*	2018
Deconfounded value decomposition for multi-agent reinforcement learning J Li, K Kuang, B Wang, F Liu, L Chen, C Fan, F Wu, J Xiao International Conference on Machine Learning, 12843-12856, 2022	15	2022
Multilinear extension of -submodular functions B Wang, H Zhou arXiv e-prints, arXiv: 2107.07103, 2021	15	2021
Beyond winning and losing: modeling human motivations and behaviors using inverse reinforcement learning B Wang, T Sun, SX Zheng Artificial Intelligence and Interactive Digital Entertainment (AIIDE) 2019., 2018	15*	2018
Semantically aligned task decomposition in multi-agent reinforcement learning W Li, D Qiao, B Wang, X Wang, B Jin, H Zha arXiv preprint arXiv:2305.10865, 2023	12	2023
Improved regret bounds for linear adversarial mdps via linear optimization F Kong, X Zhang, B Wang, S Li arXiv preprint arXiv:2302.06834, 2023	9	2023
Online policy optimization for robust MDP J Dong, J Li, B Wang, J Zhang arXiv preprint arXiv:2209.13841, 2022	9	2022
Learning from good trajectories in offline multi-agent reinforcement learning Q Tian, K Kuang, F Liu, B Wang Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11672 …, 2023	7	2023
Learning adversarial linear mixture markov decision processes with bandit feedback and unknown transition C Zhao, R Yang, B Wang, S Li The Eleventh International Conference on Learning Representations, 2022	6	2022
Learning fair representations via distance correlation minimization D Guo, C Wang, B Wang, H Zha IEEE Transactions on Neural Networks and Learning Systems, 2022	6	2022
Algorithms and theory for supervised gradual domain adaptation J Dong, S Zhou, B Wang, H Zhao arXiv preprint arXiv:2204.11644, 2022	6	2022
Combinatorial bandits under strategic manipulations J Dong, K Li, S Li, B Wang Proceedings of the Fifteenth ACM International Conference on Web Search and …, 2022	5	2022
Policy optimization with second-order advantage information J Li, B Wang International Joint Conference on Artificial Intelligence (IJCAI) 2018 …, 2018	4	2018
Online Influence Maximization under Decreasing Cascade Model F Kong, J Xie, B Wang, T Yao, S Li arXiv preprint arXiv:2305.15428, 2023	3	2023
Private Q-Learning with Functional Noise in Continuous Spaces B Wang, N Hegde The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2019	3	2019
Learning adversarial low-rank markov decision processes with unknown transition and full-information feedback C Zhao, R Yang, B Wang, X Zhang, S Li Advances in Neural Information Processing Systems 36, 2024	2	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors