Fully parameterized quantile function for distributional reinforcement learning D Yang, L Zhao, Z Lin, T Qin, J Bian, TY Liu
Advances in neural information processing systems 32, 2019
154 2019 Maniskill: Generalizable manipulation skill benchmark with large-scale demonstrations T Mu, Z Ling, F Xiang, D Yang, X Li, S Tao, Z Huang, Z Jia, H Su
arXiv preprint arXiv:2107.14483, 2021
90 2021 Individualized indicator for all: Stock-wise technical indicator optimization with stock embedding Z Li, D Yang, L Zhao, J Bian, T Qin, TY Liu
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019
53 2019 Distributional reward decomposition for reinforcement learning Z Lin, L Zhao, D Yang, T Qin, TY Liu, G Yang
Advances in neural information processing systems 32, 2019
17 2019 RD : Reward Decomposition with Representation Decomposition Z Lin*, D Yang*, L Zhao, T Qin, G Yang, TY Liu
Advances in Neural Information Processing Systems 33, 2020
9 2020 RD2 reward decomposition with representation disentanglement Z Lin, D Yang, L Zhao, T Qin, G Yang, T Liu
Proceedings of the 34th International Conference on Neural Information …, 2020
6 2020 Individualized Indicator for All Z Li, D Yang, L Zhao, J Bian, T Qin, TY Liu
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019
2 2019 Defensive Quantization Layer For Convolutional Network Against Adversarial Attack S Song, Q Wang, D Yang, Y Song, X Liu, T Zhang
2019