Volgen
Gal Dalal
Gal Dalal
Sr. Research Scientist, Nvidia
Geverifieerd e-mailadres voor nvidia.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Safe exploration in continuous action spaces
G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa
arXiv preprint arXiv:1801.08757, 2018
2542018
Finite Sample Analyses for TD (0) with Function Approximation
G Dalal, B Szörényi, G Thoppe, S Mannor
Association for the Advancement of Artificial Intelligence (AAAI) 2018, 2018
1332018
Finite sample analysis of two-timescale stochastic approximation with applications to reinforcement learning
G Dalal, B Szorenyi, G Thoppe, S Mannor
31st Annual Conference on Learning Theory (COLT) 75, 1-35, 2018
1022018
A tale of two-timescale reinforcement learning with the tightest finite-time bound
G Dalal, B Szorenyi, G Thoppe
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3701-3708, 2020
342020
Beyond the one step greedy approach in reinforcement learning
Y Efroni, G Dalal, B Scherrer, S Mannor
Proceedings of The 35th International Conference on Machine Learning (ICML 2018), 2018
332018
Hierarchical Decision Making In Electricity Grid Management
G Dalal, E Gilboa, S Mannor
Proceedings of The 33rd International Conference on Machine Learning (ICML …, 2016
312016
Anomaly Detection in Large Databases Using Behavioral Patterning
H Mazzawi, G Dalal, D Rozenblat, L Ein-Dor, M Ninio, O Lavi
2017 IEEE 33rd International Conference on Data Engineering (ICDE 2017), 2017
292017
Chance-constrained outage scheduling using a machine learning proxy
G Dalal, E Gilboa, S Mannor, L Wehenkel
IEEE Transactions on Power Systems 34 (4), 2019
272019
Supervised Learning for Optimal Power Flow as a Real-Time Proxy
R Canyasse, G Dalal, S Mannor
IEEE PES Innovative Smart Grid Technologies (ISGT 2017) 8, 2017
272017
How to combine tree-search methods in reinforcement learning
Y Efroni, G Dalal, B Scherrer, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2019) 33 …, 2019
232019
Unit commitment using nearest neighbor as a short-term proxy
G Dalal, E Gilboa, S Mannor, L Wehenkel
20th Power Systems Computation Conference (PSCC'18), 2018
172018
Reinforcement learning for the unit commitment problem
G Dalal, S Mannor
2015 IEEE Eindhoven PowerTech, 1-6, 2015
162015
Multiple-step greedy policies in approximate and online reinforcement learning
Y Efroni, G Dalal, B Scherrer, S Mannor
Advances in Neural Information Processing Systems (NIPS 2018), 5238-5247, 2018
152018
Reinforcement learning for datacenter congestion control
C Tessler, Y Shpigelman, G Dalal, A Mandelbaum, D Haritan Kazakov, ...
ACM SIGMETRICS Performance Evaluation Review 49 (2), 43-46, 2022
92022
Multiple-step greedy policies in online and approximate reinforcement learning
Y Efroni, G Dalal, B Scherrer, S Mannor
arXiv preprint arXiv:1805.07956, 2018
92018
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
A Inci, E Bolotin, Y Fu, G Dalal, S Mannor, D Nellans, D Marculescu
EMC2 (The Sixth Workshop on Energy Efficient Machine Learning and Cognitive …, 2020
72020
Distributed Scenario-Based Optimization for Asset Management in a Hierarchical Decision Making Environment
G Dalal, E Gilboa, S Mannor
19th Power Systems Computation Conference (PSCC'16), 2016
72016
Acting in Delayed Environments with Non-Stationary Markov Policies
E Derman, G Dalal, S Mannor
International Conference on Learning Representations (ICLR), 2021
62021
Finite sample analysis for TD (0) with linear function approximation
G Dalal, B Szörényi, G Thoppe, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2018), 2018
62018
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
G Dalal, A Hallak, S Dalton, S Mannor, G Chechik
Advances in Neural Information Processing Systems 34, 5518-5530, 2021
32021
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20