Volgen
Martha White
Martha White
Geverifieerd e-mailadres voor ualberta.ca - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Off-Policy Actor-Critic
T Degris, M White, RS Sutton
Twenty-Ninth International Conference on Machine Learning, 2012
5732012
Meta-learning representations for continual learning
K Javed, M White
Advances in Neural Information Processing Systems 32, 2019
2962019
An emphatic approach to the problem of off-policy temporal-difference learning
RS Sutton, AR Mahmood, M White
The Journal of Machine Learning Research 17 (1), 2603-2631, 2016
2612016
Supervised autoencoders: Improving generalization performance with unsupervised regularizers
L Le, A Patterson, M White
Advances in neural information processing systems 31, 2018
2332018
Convex multi-view subspace learning
M White, X Zhang, D Schuurmans, Y Yu
Advances in neural information processing systems 25, 2012
1912012
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Q Lan, Y Pan, A Fyshe, M White
International Conference on Learning Representations, 2020
1342020
Estimating the class prior and posterior from noisy positives and unlabeled data
S Jain, M White, P Radivojac
Advances in neural information processing systems 29, 2016
1252016
Unifying task specification in reinforcement learning
M White
International Conference on Machine Learning, 2016
1052016
Sim2Real in Robotics and Automation: Applications and Challenges
S Höfer, K Bekris, A Handa, JC Gamboa, M Mozifian, F Golemo, ...
IEEE Transactions on Automation Science and Engineering 18 (2), 398-400, 2021
852021
An off-policy policy gradient theorem using emphatic weightings
E Imani, E Graves, M White
Advances in Neural Information Processing Systems 31, 2018
762018
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
Y Pan, K Banman, M White
International Conference on Learning Representations, 2021
70*2021
Recovering true classifier performance in positive-unlabeled learning
S Jain, M White, P Radivojac
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
672017
Nonparametric semi-supervised learning of class proportions
S Jain, M White, MW Trosset, P Radivojac
arXiv preprint arXiv:1601.01944, 2016
582016
Optimizing for the Future in Non-Stationary MDPs
Y Chandak, G Theocharous, S Shankar, S Mahadevan, M White, ...
International Conference on Machine Learning, 2020
562020
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
International Joint Conference on Artificial Intelligence, 2018
542018
Relaxed clipping: A global training method for robust regression and classification
Y Yu, M Yang, L Xu, M White, D Schuurmans
Advances in Neural Information Processing Systems 23, 2011
532011
The utility of sparse representations for control in reinforcement learning
V Liu, R Kumaraswamy, L Le, M White
AAAI Conference on Artificial Intelligence, 2019
502019
A greedy approach to adapting the trace parameter for temporal difference learning
M White, A White
International Conference on Autonomous Agents & Multiagent Systems, 557-565, 2016
432016
Improving Regression Performance with Distributional Losses
E Imani, M White
Advances in Neural Information Processing Systems, 2018
422018
Investigating practical, linear temporal difference learning
A White, M White
Autonomous Agents and Multiagent Sytems, 2016
402016
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20