An efficient algorithm for discrete-time hidden mode stochastic hybrid systems

In this paper, we propose an efficient algorithm to find an optimal control policy in a discrete-time hidden mode stochastic hybrid system, which is a special case of partially observable discrete-time stochastic hybrid systems in which only discrete states are hidden. Many human-centered systems can be modeled as such systems, in which the intent of the human operator is unknown and can be modeled as the hidden mode. In the literature, the optimal control problem of hidden mode stochastic hybrid system is known to have high computational complexity due to the continuous state space. In this paper, we will tackle this computational challenge by using local quadratic functions to approximate the optimal expected reward, which does not have a closed-form expression in general. We will show the efficacy of our proposed method, and the significant improvement in the computational time.

[1]  John Lygeros,et al.  Discrete time stochastic hybrid dynamical games: Verification & controller synthesis , 2011, IEEE Conference on Decision and Control and European Control Conference.

[2]  N. Roy,et al.  The Belief Roadmap: Efficient Planning in Belief Space by Factoring the Covariance , 2009, Int. J. Robotics Res..

[3]  Pascal Poupart,et al.  Point-Based Value Iteration for Continuous POMDPs , 2006, J. Mach. Learn. Res..

[4]  John Lygeros,et al.  Verification of discrete time stochastic hybrid systems: A stochastic reach-avoid decision problem , 2010, Autom..

[5]  Dana Kulic,et al.  Estimating intent for human-robot interaction , 2003 .

[6]  Domitilla Del Vecchio,et al.  Control of hybrid automata with hidden modes: Translation to a perfect state information problem , 2010, 49th IEEE Conference on Decision and Control (CDC).

[7]  M. Oishi,et al.  Computational Techniques for Reachability Analysis of Partially Observable Discrete Time Stochastic Hybrid Systems , 2014, 1404.5906.

[8]  Meeko M. K. Oishi,et al.  Reachability for partially observable discrete time stochastic hybrid systems , 2014, Autom..

[9]  Alex Pentland,et al.  Modeling and Prediction of Human Behavior , 1999, Neural Computation.

[10]  Ron Alterovitz,et al.  Motion planning under uncertainty using iterative local optimization in belief space , 2012, Int. J. Robotics Res..

[11]  Yiannis Demiris,et al.  Prediction of intent in robotics and multi-agent systems , 2007, Cognitive Processing.

[12]  Satoshi Kagami,et al.  A probabilistic model of human motion and navigation intent for mobile robot path planning , 2000, 2009 4th International Conference on Autonomous Robots and Agents.

[13]  Brian C. Williams,et al.  Mode Estimation of Probabilistic Hybrid Systems , 2002, HSCC.

[14]  Tetsuo Tomiyama,et al.  Human-Intent Detection and Physically Interactive Control of a Robot Without Force Sensors , 2010, IEEE Transactions on Robotics.

[15]  John Lygeros,et al.  Probabilistic reachability and safety for controlled discrete time stochastic hybrid systems , 2008, Autom..

[16]  A. Pentland,et al.  Towards real-time recognition of driver intentions , 1997, Proceedings of Conference on Intelligent Transportation Systems.

[17]  Domitilla Del Vecchio,et al.  Safety Control of Hidden Mode Hybrid Systems , 2012, IEEE Transactions on Automatic Control.

[18]  Claire J. Tomlin,et al.  Optimal control of partially observable discrete time stochastic hybrid systems for safety specifications , 2013, 2013 American Control Conference.

[19]  Pradip Sheth,et al.  User intent in a shared control framework for pedestrian mobility aids , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[20]  Emilio Frazzoli,et al.  Hidden mode tracking control for a class of hybrid systems , 2013, 2013 American Control Conference.

[21]  Nikos A. Vlassis,et al.  Perseus: Randomized Point-based Value Iteration for POMDPs , 2005, J. Artif. Intell. Res..

[22]  John Lygeros,et al.  Computational Approaches to Reachability Analysis of Stochastic Hybrid Systems , 2007, HSCC.