Robot-Control Based on Extended Markov Tracking: Initial Experiments

Extended Markov Tracking (EMT) is a computationally tractable method for the online estimation of Markovian system dynamics. In this paper, we present our initial experimentation with EMT-based control applied to robotic motion. In our experiments, a robot uses a predetermined mapping of the world onto an abstract model, over which EMT Control is applied; this dictates the choice of an abstract action, which in turn is mapped back into actual robot operation. Simulations in which a robot was constrained to follow a target show that although the abstract model was (intentionally) only weakly coherent with the real dynamics of the robot’s world, EMT Control was able to provide reasonable performance. We also demonstrate that EMT-provided data provides sensible information for action model calibration. We do so by constructing a calibration scheme based on a training technique and simple data statistics. The scheme is then validated by carrying out additional robot motion control simulations, using the calibrated abstract model.

[1]  Hung Hai Bui,et al.  A General Model for Online Probabilistic Plan Recognition , 2003, IJCAI.

[2]  Richard T. Vaughan,et al.  The Player/Stage Project: Tools for Multi-Robot and Distributed Sensor Systems , 2003 .

[3]  Jim Blythe,et al.  Planning Under Uncertainty in Dynamic Domains , 1998 .

[4]  Peter Stone,et al.  Simultaneous Calibration of Action and Sensor Models on a Mobile Robot , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[5]  Manuela M. Veloso,et al.  Recognizing Probabilistic Opponent Movement Models , 2001, RoboCup.

[6]  Sebastian Thrun,et al.  Online self-calibration for mobile robots , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[7]  Richard Fikes,et al.  STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[8]  Michael P. Wellman,et al.  Probabilistic State-Dependent Grammars for Plan Recognition , 2000, UAI.

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  Eugene Fink,et al.  Integrating planning and learning: the PRODIGY architecture , 1995, J. Exp. Theor. Artif. Intell..

[11]  Edmund H. Durfee,et al.  A Survey of Research in Distributed, Continual Planning , 1999, AI Mag..

[12]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[13]  Dana S. Nau,et al.  Control Strategies in HTN Planning: Theory Versus Practice , 1998, AAAI/IAAI.

[14]  Richard Washington,et al.  Markov tracking for agent coordination , 1998, AGENTS '98.

[15]  Michael P. Wellman,et al.  Planning and Control , 1991 .