Leslie Kaelbling
Leslie Kaelbling
Άγνωστη συνεργασία
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα csail.mit.edu
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore
Journal of artificial intelligence research 4, 237-285, 1996
89111996
Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra
Artificial intelligence 101 (1-2), 99-134, 1998
44741998
Learning in embedded systems
LP Kaelbling
MIT press, 1993
8871993
Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman
Aaai 94, 1023-1028, 1994
8741994
Learning policies for partially observable environments: Scaling up
ML Littman, AR Cassandra, LP Kaelbling
Machine Learning Proceedings 1995, 362-370, 1995
8601995
Acting under uncertainty: Discrete Bayesian models for mobile-robot navigation
AR Cassandra, LP Kaelbling, JA Kurien
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and …, 1996
7151996
On the complexity of solving Markov decision problems
ML Littman, TL Dean, LP Kaelbling
arXiv preprint arXiv:1302.4971, 2013
6622013
Hierarchical planning in the now
LP Kaelbling, T Lozano-Pérez
Workshops at the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010
5812010
Effective reinforcement learning for mobile robots
WD Smart, LP Kaelbling
Proceedings 2002 IEEE International Conference on Robotics and Automation …, 2002
5192002
An architecture for intelligent reactive systems
LP Kaelbling
Reasoning about actions and plans, 395-410, 1987
4951987
The synthesis of digital machines with provable epistemic properties
SJ Rosenschein, LP Kaelbling
Theoretical aspects of reasoning about knowledge, 83-98, 1986
4641986
To transfer or not to transfer
MT Rosenstein, Z Marx, LP Kaelbling, TG Dietterich
NIPS 2005 workshop on transfer learning 898, 1-4, 2005
4582005
Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons.
D Chapman, LP Kaelbling
IJCAI 91, 726-731, 1991
3831991
Hierarchical solution of Markov decision processes using macro-actions
M Hauskrecht, N Meuleau, LP Kaelbling, TL Dean, C Boutilier
arXiv preprint arXiv:1301.7381, 2013
3812013
Learning to cooperate via policy search
L Peshkin, KE Kim, N Meuleau, LP Kaelbling
arXiv preprint cs/0105032, 2001
3632001
Action and planning in embedded agents
LP Kaelbling, SJ Rosenschein
Robotics and autonomous systems 6 (1-2), 35-48, 1990
3481990
Belief space planning assuming maximum likelihood observations
R Platt Jr, R Tedrake, L Kaelbling, T Lozano-Perez
3352010
Practical reinforcement learning in continuous spaces
WD Smart, LP Kaelbling
ICML, 903-910, 2000
3352000
Integrated task and motion planning in belief space
LP Kaelbling, T Lozano-Pérez
The International Journal of Robotics Research 32 (9-10), 1194-1227, 2013
3332013
Planning under time constraints in stochastic domains
T Dean, LP Kaelbling, J Kirman, A Nicholson
Artificial Intelligence 76 (1-2), 35-74, 1995
3321995
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20