Modeling Natural Language Controlled Robotic Operations

There are multiple ways to control a robotic system. Most of them require the users to have prior knowledge about robots. Natural language based control is a very promising method due to its versatility, ease of use, and without the need for extensive training for novice users. Since natural language instructions from users cannot be understood by the robots directly, the linguistic input has to be processed into a formal representation which captures the task specification and removes the ambiguity inherent in natural language. For most of existing natural language controlled robotic system, they assume the given language instructions are already in correct orders. However, it is very likely for untrained users to give commands in a mixed order based on their direct observation and intuitive thinking. Simply following the order of the commands can lead to failures of tasks. To provide a remedy for the problem, we propose a framework named Dependency Relation Matrix (DRM) to model and organize the linguistic input, in order to figure out a feasible subtask sequence for later execution. Besides, the proposed approach projects both linguistic input and sensory information into the same space, and use the difference between the goal specification and temporal sensory feedback to drive the system moving forward. It also helps to monitor the progress of task execution by comparing the temporal sensory feedback and the goal configuration. In this paper, we describe the DRM framework in detail, and illustrate the utility of this approach with experiment results.

[1]  W. M. Wonham,et al.  The control of discrete event systems , 1989 .

[2]  Yunyi Jia,et al.  Teaching Robots New Actions through Natural Language Instructions , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[3]  Stéphane Lafortune,et al.  Active diagnosis of discrete event systems , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[4]  Albert S. Huang,et al.  Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands , 2017, ArXiv.

[5]  A. Michel,et al.  Lyapunov Stability of a Class of Discrete Event Systems , 1991 .

[6]  Hadas Kress-Gazit,et al.  Provably correct reactive control from natural language , 2015, Auton. Robots.

[7]  Michael Beetz,et al.  Towards robots conducting chemical experiments , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  Luke S. Zettlemoyer,et al.  Learning to Parse Natural Language Commands to a Robot Control System , 2012, ISER.

[9]  Matthias Scheutz,et al.  Tell me when and why to do it! Run-time planner model updates via natural language instruction , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[10]  Kevin Lee,et al.  Tell me Dave: Context-sensitive grounding of natural language to manipulation instructions , 2014, Int. J. Robotics Res..

[11]  T. Lozano-Perez,et al.  Robot programming , 1983, Proceedings of the IEEE.

[12]  Yoshihiko Nakamura,et al.  Motion-Language Association Model for Human-Robot Communication , 2010, ISER.

[13]  Hadas Kress-Gazit,et al.  Temporal-Logic-Based Reactive Mission and Motion Planning , 2009, IEEE Transactions on Robotics.

[14]  Peter E. Caines,et al.  Dynamical consistency in hierarchical supervisory control , 2002, IEEE Trans. Autom. Control..

[15]  Yunyi Jia,et al.  Saliency-Guided Detection of Unknown Objects in RGB-D Indoor Scenes , 2015, Sensors.

[16]  Manuela M. Veloso,et al.  Using dialog and human observations to dictate tasks to a learning robot assistant , 2008, Intell. Serv. Robotics.

[17]  Albert S. Huang,et al.  Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Language , 2013 .

[18]  Yunyi Jia,et al.  Back to the Blocks World: Learning New Actions through Situated Human-Robot Dialogue , 2014, SIGDIAL Conference.

[19]  Benjamin Kuipers,et al.  Walk the Talk: Connecting Language, Knowledge, and Action in Route Instructions , 2006, AAAI.

[20]  Guido Bugmann,et al.  Personal Robot Training via Natural-Language Instructions. , 2001 .

[21]  Yunyi Jia,et al.  Analytic approach for natural language based supervisory control of robotic manipulations , 2016, 2016 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[22]  Yoshihiko Nakamura,et al.  Generating action descriptions from statistically integrated representations of human motions and sentences , 2016, Neural Networks.

[23]  Geoffrey Biggs,et al.  A Survey of Robot Programming Systems , 2010 .