Volgen
Hai Zhang
Titel
Geciteerd door
Geciteerd door
Jaar
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
H Zhang, H Yu, J Zhao, D Zhang, H Zhou, C Huang, C Ye
Advances in Neural Information Processing Systems 36, 2024
22024
Multi-agent Decision-making at Unsignalized Intersections with Reinforcement Learning from Demonstrations
C Huang, J Zhao, H Zhou, H Zhang, X Zhang, C Ye
2023 IEEE Intelligent Vehicles Symposium (IV), 1-6, 2023
12023
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
L Li*, H Zhang*, X Zhang, S Zhu, J Zhao, PA Heng
arXiv preprint arXiv:2402.02429, 2024
2024
Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery
X Zhang, H Zhang, H Zhou, C Huang, D Zhang, C Ye, J Zhao
IEEE Robotics and Automation Letters 9 (1), 491 - 498, 2023
2023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–4