In this workwepresent amethodology for intelligent motion planning inanuncertain environment usinga non-local sensor, suchas a radar.This methodology isapplied toan unmannedhelicopter navigating a cluttered urbanenvironment. We show thattheproblemofmotionplanning ina uncertain environment, undercertain assumptions, canbeposedas theadaptive optimal control ofanuncertain Markov decision process, characterized by a known,control dependent system, andanunknown, control independent environment. Thestrategy formotionplanning then reduces tocomputing thecontrol policy basedonthe current estimate oftheenvironment, alsoknownasthe "certainty equivalence principle" intheadaptive control literature. Ourmethodology allows theinclusion ofa non-local sensor intotheproblemformulation, which significantly accelerates the convergence of the estimation andplanning algorithms. Further weshow thatthemotionplanning andestimation problems, as formulated inthispaperpossess special structure which can be exploited to significantly reducethe computational burdenoftheassociated algorithms. We applythismethodology totheproblemofmotion planning foran unmannedhelicopter ina partially knownmodeloftheTexasA&M campus.
[1]
John N. Tsitsiklis,et al.
Neuro-Dynamic Programming
,
1996,
Encyclopedia of Machine Learning.
[2]
Sebastian Thrun,et al.
Probabilistic Algorithms in Robotics
,
2000,
AI Mag..
[3]
Richard S. Sutton,et al.
Reinforcement Learning
,
1992,
Handbook of Machine Learning.
[4]
Jean-Claude Latombe,et al.
Robot motion planning
,
1970,
The Kluwer international series in engineering and computer science.
[5]
Wolfram Burgard,et al.
Sonar-Based Mapping of Large-Scale Mobile Robot Environments using EM
,
1999,
ICML.
[6]
Steven M. LaValle,et al.
Robot Motion Planning: A Game-Theoretic Foundation
,
2000,
Algorithmica.
[7]
R. Bellman.
Dynamic programming.
,
1957,
Science.