Hui Wang
Hui Wang
PhD Candidate, Leiden Institute of Advanced Computer Science
Geverifieerd e-mailadres voor liacs.leidenuniv.nl - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Alternative loss functions in alphazero-like self-play
H Wang, M Emmerich, M Preuss, A Plaat
2019 IEEE Symposium Series on Computational Intelligence (SSCI), 155-162, 2019
92019
Hyper-parameter sweep on alphazero general
H Wang, M Emmerich, M Preuss, A Plaat
arXiv preprint arXiv:1903.08129, 2019
82019
Assessing the potential of classical q-learning in general game playing
H Wang, M Emmerich, A Plaat
Benelux Conference on Artificial Intelligence, 138-150, 2018
82018
Monte carlo q-learning for general game playing
H Wang, M Emmerich, A Plaat
arXiv preprint arXiv:1802.05944, 2018
82018
Analysis of Hyper-Parameters for Small Games: Iterations or Epochs in Self-Play?
H Wang, M Emmerich, M Preuss, A Plaat
arXiv preprint arXiv:2003.05988, 2020
62020
Tackling Morpion Solitaire with AlphaZero-like Ranked Reward Reinforcement Learning
H Wang, M Preuss, M Emmerich, A Plaat
Proceeding of 2020 22nd International Symposium on Symbolic and Numeric …, 2020
42020
Warm-Start AlphaZero Self-Play Search Enhancements
H Wang, M Preuss, A Plaat
Parallel Problem Solving from Nature – PPSN XVI. PPSN 2020. Lecture Notes in …, 2020
32020
A search optimization method for rule learning in board games
H Wang, Y Tang, J Liu, W Chen
Pacific Rim International Conference on Artificial Intelligence, 174-181, 2018
32018
A consensus value approach for influence maximization in social networks
F Yang, H Wang, Y Tang, J Liu, W Chen
2017 IEEE International Conference on Agents (ICA), 8-13, 2017
12017
Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning
H Wang, M Preuss, A Plaat
arXiv preprint arXiv:2105.06136, 2021
2021
An Improved Bounded Rational Negotiation Model Based on Answer Set Program
LY Gao, H Wang, W Chen
Computer Science and Artificial Intelligence: Proceedings of the …, 2018
2018
A Complex Networked Method of Sorting Negotiation Demand Based on Answer Set Programs
H Wang, L Li, L Gao, W Chen
Intelligent Automation & Soft Computing, 1-6, 2017
2017
Szabó, Péter 210 Szuhai, Iulia-Monica 169 Tajti, Tibor 84 Tamburri, Damian A. 17 Tarba, Ionut-Adrian 251
A Nair, A Naumowicz, V Negru, TT Nguyen, C Obreja, Z Onet-Marian, ...
Policy or Value? Loss Function and Playing Strength in AlphaZero-like Self-play
H Wang, M Emmerich, M Preuss, A Plaat
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–14