W Bradley Knox

Geciteerd door

	Alles	Sinds 2019
Citaties	3733	2379
h-index	23	18
i10-index	33	25

580

290

145

435

200920102011201220132014201520162017201820192020202120222023202423 27 54 69 119 147 139 214 215 294 298 404 466 462 562 182

Openbare toegang

Alles bekijken

4 artikelen

0 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Peter StoneProfessor of Computer Science, The University of Texas at AustinGeverifieerd e-mailadres voor cs.utexas.edu
Cynthia BreazealProfessor Media Arts and Sciences, MIT Media LabGeverifieerd e-mailadres voor media.mit.edu
Maya CakmakUniversity of WashingtonGeverifieerd e-mailadres voor cs.washington.edu
Bradley C. LoveProfessor of Cognitive and Decision Sciences, University College LondonGeverifieerd e-mailadres voor ucl.ac.uk
Todd KuleszaUser Experience Researcher, GoogleGeverifieerd e-mailadres voor google.com
Saleema AmershiMicrosoft ResearchGeverifieerd e-mailadres voor microsoft.com
Alessandro AllieviImperial College LondonGeverifieerd e-mailadres voor imperial.ac.uk
Scott NiekumAssociate Professor, University of Massachusetts AmherstGeverifieerd e-mailadres voor cs.umass.edu
Hayley HungAssociate Professor, Delft University of TechnologyGeverifieerd e-mailadres voor tudelft.nl
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoGeverifieerd e-mailadres voor cs.ox.ac.uk
Guangliang LiAssociate Professor, College of Electrical Engineering, Ocean University of China, Qingdao, ChinaGeverifieerd e-mailadres voor ouc.edu.cn
Ross OttoDepartment of Psychology, McGill UniversityGeverifieerd e-mailadres voor mcgill.ca
Jin Joo Lee, PhDAmazon Lab126Geverifieerd e-mailadres voor amazon.com
W. Todd MaddoxWayne Holtzman Chair and Professor of Psychology, University of TexasGeverifieerd e-mailadres voor utexas.edu
Serena BoothMITGeverifieerd e-mailadres voor mit.edu
Felix SchmittBosch Center for Artificial IntelligenceGeverifieerd e-mailadres voor de.bosch.com
Jolie Baumann WormwoodUniversity of New HampshireGeverifieerd e-mailadres voor unh.edu
David DeStenoNortheastern UniversityGeverifieerd e-mailadres voor northeastern.edu
Brian GlassPostdoctoral Researcher of Psychology and Computer Science, University College London, University ofGeverifieerd e-mailadres voor qmul.ac.uk
Samuel SpauldingMedia Lab, Massachusetts Institute of TechnologyGeverifieerd e-mailadres voor media.mit.edu

Volgen

W Bradley Knox

Research Scientist at UT Austin

Geverifieerd e-mailadres voor cs.utexas.edu - Homepage

Reward functions Alignment RLHF Reinforcement Learning Human-Robot Interaction


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
Power to the people: The role of humans in interactive machine learning S Amershi, M Cakmak, WB Knox, T Kulesza AI Magazine 35 (4), 105-120, 2014	1126	2014
Interactively shaping agents via human reinforcement: The TAMER framework WB Knox, P Stone Proceedings of the 5th International Conference on Knowledge Capture (K-CAP …, 2009	580	2009
Combining manual feedback with subsequent MDP reward signals for reinforcement learning WB Knox, P Stone Proceedings of the 9th International Conference on Autonomous Agents and …, 2010	266	2010
Reinforcement learning from simultaneous human and MDP reward WB Knox, P Stone Proceedings of the 11th International Conference on Autonomous Agents and …, 2012	253*	2012
Tamer: Training an agent manually via evaluative reinforcement WB Knox, P Stone 2008 7th IEEE international conference on development and learning, 292-297, 2008	200	2008
Training a robot via human feedback: A case study WB Knox, P Stone, C Breazeal International Conference on Social Robotics (ICSR), 460-470, 2013	170	2013
Computationally modeling interpersonal trust JJ Lee, B Knox, J Baumann, C Breazeal, D DeSteno Frontiers in psychology 4, 56004, 2013	123	2013
The nature of belief-directed exploratory choice in human decision-making WB Knox, AR Otto, P Stone, B Love Frontiers in Psychology 2, 2012	96	2012
How humans teach agents: A new experimental perspective WB Knox, BD Glass, BC Love, WT Maddox, P Stone International Journal of Social Robotics 4 (4), 409-421, 2012	95	2012
Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance WB Knox, P Stone Artificial Intelligence 225, 24-50, 2015	76	2015
Reinforcement Learning from Human Reward: Discounting in Episodic Tasks WB Knox, P Stone 21st IEEE International Symposium on Robot and Human Interactive …, 2012	70	2012
Reward (Mis)design for Autonomous Driving WB Knox, A Allievi, H Banzhaf, F Schmitt, P Stone arXiv preprint arXiv:2104.13906, 2021	66	2021
The EMPATHIC Framework for Task Learning from Implicit Human Feedback Y Cui, Q Zhang, A Allievi, P Stone, S Niekum, WB Knox Conference on Robot Learning (CoRL), 2020	56	2020
Learning from Human-Generated Reward WB Knox University of Texas at Austin, 2012	56	2012
Know thine enemy: A champion RoboCup coach agent G Kuhlmann, WB Knox, P Stone Proceedings of the National Conference on Artificial Intelligence 21 (2), 1463, 2006	48	2006
Using informative behavior to increase engagement in the tamer framework G Li, H Hung, S Whiteson, WB Knox Proceedings of the 2013 international conference on autonomous agents and …, 2013	42	2013
Learning non-myopically from human-generated reward WB Knox, P Stone Proceedings of the 2013 international conference on Intelligent user …, 2013	42	2013
Design Principles for Creating Human-Shapable Agents. WB Knox, IR Fasel, P Stone AAAI Spring Symposium: Agents that Learn from Human Teachers, 79-86, 2009	34	2009
Physiological and behavioral signatures of reflective exploratory choice AR Otto, WB Knox, AB Markman, BC Love Cognitive, Affective, & Behavioral Neuroscience 14, 1167-1183, 2014	27	2014
The perils of trial-and-error reward design: misdesign through overfitting and invalid task specifications S Booth, WB Knox, J Shah, S Niekum, P Stone, A Allievi Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 5920-5929, 2023	25	2023

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs