Volgen
Michael Littman
Michael Littman
Geverifieerd e-mailadres voor brown.edu - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore
Journal of artificial intelligence research 4, 237-285, 1996
120111996
Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra
Artificial intelligence 101 (1-2), 99-134, 1998
58061998
Markov games as a framework for multi-agent reinforcement learning
ML Littman
Machine learning proceedings 1994, 157-163, 1994
40611994
Measuring praise and criticism: Inference of semantic orientation from association
PD Turney, ML Littman
acm Transactions on Information Systems (tois) 21 (4), 315-346, 2003
24192003
Activity recognition from accelerometer data
N Ravi, N Dandekar, P Mysore, ML Littman
Aaai 5 (2005), 1541-1546, 2005
22992005
Packet routing in dynamically changing networks: A reinforcement learning approach
J Boyan, M Littman
Advances in neural information processing systems 6, 1993
12091993
Learning policies for partially observable environments: Scaling up
ML Littman, AR Cassandra, LP Kaelbling
Machine Learning Proceedings 1995, 362-370, 1995
10321995
Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman
Aaai 94, 1023-1028, 1994
10251994
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvári
Machine learning 38, 287-308, 2000
10232000
Friend-or-foe Q-learning in general-sum games
ML Littman
ICML 1 (2001), 322-328, 2001
9302001
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
8162013
On the complexity of solving Markov decision problems
ML Littman, TL Dean, LP Kaelbling
arXiv preprint arXiv:1302.4971, 2013
7502013
Interactions between learning and evolution
D Ackley, M Littman
Artificial life II 10, 487-509, 1991
7401991
Predictive representations of state
M Littman, RS Sutton
Advances in neural information processing systems 14, 2001
7252001
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
AR Cassandra, ML Littman, NL Zhang
arXiv preprint arXiv:1302.1525, 2013
6962013
Computerized cross-language document retrieval using latent semantic indexing
TK Landauer, ML Littman
US Patent 5,301,109, 1994
6611994
An analysis of model-based interval estimation for Markov decision processes
AL Strehl, ML Littman
Journal of Computer and System Sciences 74 (8), 1309-1331, 2008
6422008
PAC model-free reinforcement learning
AL Strehl, L Li, E Wiewiora, J Langford, ML Littman
Proceedings of the 23rd international conference on Machine learning, 881-888, 2006
6422006
Towards a unified theory of state abstraction for MDPs.
L Li, TJ Walsh, ML Littman
AI&M 1 (2), 3, 2006
6202006
Algorithms for sequential decision-making
ML Littman
Brown University, 1996
6021996
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20