Temporal logic robot control based on automata learning of environmental dynamics

We develop a technique to automatically generate a control policy for a robot moving in an environment that includes elements with unknown, randomly changing behavior. The robot is required to achieve a surveillance mission, in which a certain request needs to be serviced repeatedly, while the expected time inbetween consecutive services is minimized and additional temporal logic constraints are satisfied. We define a fragment of linear temporal logic to describe such a mission and formulate the problem as a temporal logic game. Our approach is based on two main ideas. First, we extend results in automata learning to detect patterns of the unknown behavior of the elements in the environment. Second, we employ an automata–theoretic method to generate the control policy. We show that the obtained control policy converges to an optimal one when the partially unknown behavior patterns are fully learned. In addition, we illustrate the method in an experimental setup, in which an unmanned ground vehicle, with the help of a cooperating unmanned aerial vehicle (UAV), satisfies a temporal logic requirement in a partitioned environment whose regions are controlled by barriers with unknown behavior.

[1]  Thomas Wilke,et al.  Automata logics, and infinite games: a guide to current research , 2002 .

[2]  Thomas Wilke,et al.  Automata Logics, and Infinite Games , 2002, Lecture Notes in Computer Science.

[3]  Herbert G. Tanner,et al.  (Sub)regular robotic languages , 2011, 2011 19th Mediterranean Conference on Control & Automation (MED).

[4]  Enrique Vidal,et al.  Learning Locally Testable Languages in the Strict Sense , 1990, ALT.

[5]  Calin Belta,et al.  LTL Control in Uncertain Environments with Probabilistic Satisfaction Guarantees , 2011, ArXiv.

[6]  Christel Baier,et al.  Principles of Model Checking (Representation and Mind Series) , 2008 .

[7]  Bud Mishra,et al.  Discrete event models+temporal logic=supervisory controller: automatic synthesis of locomotion controllers , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[8]  Yushan Chen,et al.  LTL robot motion control based on automata learning of environmental dynamics , 2012, 2012 IEEE International Conference on Robotics and Automation.

[9]  Christel Baier,et al.  Experiments with deterministic omega-automata for formulas of linear temporal logic , 2006, Theor. Comput. Sci..

[10]  Calin Belta,et al.  A symbolic approach to controlling piecewise affine systems , 2010, 49th IEEE Conference on Decision and Control (CDC).

[11]  Lydia E. Kavraki,et al.  Sampling-based motion planning with temporal goals , 2010, 2010 IEEE International Conference on Robotics and Automation.

[12]  Ufuk Topcu,et al.  Receding horizon temporal logic planning for dynamical systems , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[13]  Christel Baier,et al.  Experiments with Deterministic w-Automata for Formulas of Linear Temporal Logic , 2006 .

[14]  Calin Belta,et al.  Dealing with Nondeterminism in Symbolic Control , 2008, HSCC.

[15]  E. Frazzoli,et al.  Complex mission optimization for Multiple-UAVs using Linear Temporal Logic , 2008, 2008 American Control Conference.

[16]  Calin Belta,et al.  Automatic Deployment of Distributed Teams of Robots From Temporal Logic Motion Specifications , 2010, IEEE Transactions on Robotics.

[17]  Steven M. LaValle,et al.  Planning algorithms , 2006 .

[18]  James Jay Horning,et al.  A study of grammatical inference , 1969 .

[19]  Christel Baier,et al.  Principles of model checking , 2008 .

[20]  Calin Belta,et al.  MDP optimal control under temporal logic constraints , 2011, IEEE Conference on Decision and Control and European Control Conference.

[21]  Krishnendu Chatterjee,et al.  Faster and dynamic algorithms for maximal end-component decomposition and related graph problems in probabilistic verification , 2011, SODA '11.

[22]  Dana Angluin,et al.  Inference of Reversible Languages , 1982, JACM.

[23]  Hadas Kress-Gazit,et al.  Courteous Cars , 2008, IEEE Robotics & Automation Magazine.

[24]  Hadas Kress-Gazit,et al.  Courteous Cars Decentralized Multiagent Traffic Coordination , 2008 .

[25]  Leonor Becerra-Bonache,et al.  Learning DFA from Correction and Equivalence Queries , 2006, ICGI.

[26]  Yushan Chen,et al.  Formal Approach to the Deployment of Distributed Robotic Teams , 2012, IEEE Transactions on Robotics.

[27]  Mihalis Yannakakis,et al.  The complexity of probabilistic verification , 1995, JACM.