Follow
Hongming Zhang
Title
Cited by
Cited by
Year
Deep Reinforcement Learning: Fundamentals, Research, and Applications
H Dong, Z Ding, S Zhang, H Yuan, H Zhang, J Zhang, Y Huang, T Yu, ...
Springer Singapore, 2020
2212020
Taxonomy of reinforcement learning algorithms
H Zhang, T Yu
Deep reinforcement learning: Fundamentals, research and applications, 125-133, 2020
682020
AlphaZero
H Zhang, T Yu
Deep Reinforcement Learning: Fundamentals, Research and Applications, 391-415, 2020
162020
Efficient reinforcement learning development with rlzoo
Z Ding, T Yu, H Zhang, Y Huang, G Li, Q Guo, L Mai, H Dong
Proceedings of the 29th ACM International Conference on Multimedia, 3759-3762, 2021
11*2021
Picor: Multi-task deep reinforcement learning with policy correction
F Bai, H Zhang, T Tao, Z Wu, Y Wang, B Xu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 6728-6736, 2023
52023
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay
H Zhang, C Xiao, H Wang, J Jin, B Xu, M Müller
The Eleventh International Conference on Learning Representations, 2023
22023
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning
H Zhang, T Ren, C Xiao, D Schuurmans, B Dai
Forty-first International Conference on Machine Learning, 2024
12024
A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning
H Zhang, K Sun, B Xu, L Kong, M Müller
arXiv preprint arXiv:2109.09889, 2021
12021
Combine Deep Q-Networks with Actor-Critic
H Zhang, T Yu, R Huang
Deep Reinforcement Learning: Fundamentals, Research and Applications, 213-245, 2020
12020
A logarithmic barrier method for proximal policy optimization
C Zeng, H Zhang
arXiv preprint arXiv:1812.06502, 2018
12018
Monte Carlo Tree Search in the Presence of Transition Uncertainty
F Kohankhaki, K Aghakasiri, H Zhang, TH Wei, C Gao, M Müller
Proceedings of the AAAI Conference on Artificial Intelligence 38 (18), 20151 …, 2024
2024
Build generally reusable agent-environment interaction models
J Jin, H Zhang, J Luo
arXiv preprint arXiv:2211.08234, 2022
2022
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning
J Long, H Zhang, T Yu, B Xu
arXiv preprint arXiv:1908.06758, 2019
2019
RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space
H Zhang, F Cheng, B Xu, F Chen, J Liu, W Wu
2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–14