Doina Precup

Geciteerd door

	Alles	Sinds 2019
Citaties	32679	23598
h-index	63	54
i10-index	234	184

6000

3000

1500

4500

20022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024118 119 179 217 246 307 327 331 320 380 408 477 590 610 875 1085 1896 2611 3405 4322 5251 5919 2062

Openbare toegang

Alles bekijken

61 artikelen

5 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaGeverifieerd e-mailadres voor cs.mcgill.ca
Satinder SinghGoogle DeepMind / U. of MichiganGeverifieerd e-mailadres voor umich.edu
Prakash PanangadenProfessor of Computer Science, McGill UniversityGeverifieerd e-mailadres voor cs.mcgill.ca
Tal ArbelProfessor of Electrical & Computer Engineering, McGill UniversityGeverifieerd e-mailadres voor cim.mcgill.ca
Riashat IslamResearch ScientistGeverifieerd e-mailadres voor dreamfold.ai
Andre BarretoResearch Scientist, Google DeepMindGeverifieerd e-mailadres voor google.com
Emmanuel BengioMcGill UniversityGeverifieerd e-mailadres voor mail.mcgill.ca
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARGeverifieerd e-mailadres voor umontreal.ca
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchGeverifieerd e-mailadres voor technion.ac.il
David SilverDeepMind, UCLGeverifieerd e-mailadres voor google.com
Jean HarbOpenAIGeverifieerd e-mailadres voor openai.com
Guilherme Sant AnnaProfessor (Full) of Pediatrics, McGill UniversityGeverifieerd e-mailadres voor mcgill.ca
Philip WarrickPerigen Inc.Geverifieerd e-mailadres voor perigen.com
Csaba SzepesvariDeepMind & University of AlbertaGeverifieerd e-mailadres voor cs.ualberta.ca
Norm FernsGeverifieerd e-mailadres voor normferns.com
Jordan FrankSoftware Engineer, FacebookGeverifieerd e-mailadres voor cs.mcgill.ca
Amir-massoud FarahmandUniversity of TorontoGeverifieerd e-mailadres voor cs.toronto.edu
Pablo Samuel CastroGoogleGeverifieerd e-mailadres voor google.com
Hamid MaeiNetflixGeverifieerd e-mailadres voor netflix.com
Borja BalleDeepMindGeverifieerd e-mailadres voor google.com

Volgen

Doina Precup

DeepMind and McGill University

Geverifieerd e-mailadres voor cs.mcgill.ca

Artificial Intelligence machine learning reinforcement learning


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
The multimodal brain tumor image segmentation benchmark (BRATS) BH Menze, A Jakab, S Bauer, J Kalpathy-Cramer, K Farahani, J Kirby, ... IEEE transactions on medical imaging 34 (10), 1993-2024, 2014	5439	2014
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4311	1999
Deep reinforcement learning that matters P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2247	2018
Off-policy deep reinforcement learning without exploration S Fujimoto, D Meger, D Precup International conference on machine learning, 2052-2062, 2019	1383	2019
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1190	2017
Eligibility traces for off-policy policy evaluation D Precup Computer Science Department Faculty Publication Series, 80, 2000	922	2000
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	699	2009
Learning with pseudo-ensembles P Bachman, O Alsharif, D Precup Advances in neural information processing systems 27, 2014	632	2014
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup The 10th International Conference on Autonomous Agents and Multiagent …, 2011	579	2011
Algorithms for multi-armed bandit problems V Kuleshov, D Precup arXiv preprint arXiv:1402.6028, 2014	534	2014
Reward is enough D Silver, S Singh, D Precup, RS Sutton Artificial Intelligence 299, 103535, 2021	515	2021
Off-policy temporal-difference learning with function approximation D Precup, RS Sutton, S Dasgupta ICML, 417-424, 2001	458	2001
Learning options in reinforcement learning M Stolle, D Precup Abstraction, Reformulation, and Approximation: 5th International Symposium …, 2002	451	2002
Exploring uncertainty measures in deep networks for multiple sclerosis lesion detection and segmentation T Nair, D Precup, DL Arnold, T Arbel Medical image analysis 59, 101557, 2020	446	2020
Temporal abstraction in reinforcement learning D Precup University of Massachusetts Amherst, 2000	388	2000
Metrics for Finite Markov Decision Processes. N Ferns, P Panangaden, D Precup UAI 4, 162-169, 2004	336	2004
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	329	2009
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	328	2015
Reproducibility of benchmarked deep reinforcement learning tasks for continuous control R Islam, P Henderson, M Gomrokchi, D Precup arXiv preprint arXiv:1708.04133, 2017	303	2017
Gradient starvation: A learning proclivity in neural networks M Pezeshki, O Kaba, Y Bengio, AC Courville, D Precup, G Lajoie Advances in Neural Information Processing Systems 34, 1256-1272, 2021	236	2021

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs