Understanding and executing instructions for everyday manipulation tasks from the World Wide Web

Service robots will have to accomplish more and more complex, open-ended tasks and regularly acquire new skills. In this work, we propose a new approach to the problem of generating plans for such household robots. Instead composing them from atomic actions — the common approach in robot planning — we propose to transform task descriptions on web sites like ehow.com into executable robot plans. We present methods for automatically converting the instructions from natural language into a formal, logic-based representation, for resolving the word senses using the WordNet database and the Cyc ontology, and for exporting the generated plans into the mobile robot's plan language RPL. We discuss the problem of inferring information that is missing in these descriptions and the problem of grounding the abstract task descriptions in the perception and action system, and we propose techniques for solving them. The whole system works autonomously without human interaction. It has successfully been tested with a set of about 150 natural language directives, of which up to 80% could be correctly transformed.

[1]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2]  Armin Müller Transformational Planning for Autonomous Household Robots using Libraries of Robust and Flexible Plans , 2008 .

[3]  Michael Beetz Concurrent reactive plans: anticipating and forestalling execution failures , 2000 .

[4]  Rohit J. Kate,et al.  Learning to Transform Natural to Formal Languages , 2005, AAAI.

[5]  Michael J. Witbrock,et al.  An Introduction to the Syntax and Content of Cyc , 2006, AAAI Spring Symposium: Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering.

[6]  John S. Zelek,et al.  Human-robot interaction with minimal spanning natural language template for autonomous and tele-operated control , 1997, Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97.

[7]  Stefanie Tellex,et al.  Spatial routines for a simulated speech-controlled vehicle , 2006, HRI '06.

[8]  Michael Beetz,et al.  Plan-Based Control of Robotic Agents , 2002, Lecture Notes in Computer Science.

[9]  Matthai Philipose,et al.  Mining models of human activities from the web , 2004, WWW '04.

[10]  Dejan Pangercic,et al.  Visual Scene Detection and Interpretation using Encyclopedic Knowledge and Formal Description Logic , 2009 .

[11]  Drew McDermott,et al.  A reactive plan language , 1991 .

[12]  Nico Blodow,et al.  The Assistive Kitchen — A demonstration scenario for cognitive technical systems , 2007, RO-MAN 2008 - The 17th IEEE International Symposium on Robot and Human Interactive Communication.

[13]  Michael Beetz,et al.  Accurate Human Motion Capture Using an Ergonomics-Based Anthropometric Human Model , 2008, AMDO.

[14]  Michael Beetz,et al.  Using Physics- and Sensor-based Simulation for High-Fidelity Temporal Projection of Realistic Robot Behavior , 2009, ICAPS.

[15]  Masayuki Inaba,et al.  Cooking for humanoid robot, a task that needs symbolic and geometric reasonings , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[16]  Nico Blodow,et al.  Towards 3D Point cloud based object maps for household environments , 2008, Robotics Auton. Syst..

[17]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[18]  Rakesh Gupta,et al.  Common Sense Data Acquisition for Indoor Mobile Robots , 2004, AAAI.

[19]  Tamim Asfour,et al.  A cognitive architecture for a humanoid robot: a first approach , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[20]  Alexandra Kirsch,et al.  Integration of programming and learning in a control language for autonomous robots performing everyday activities , 2008 .

[21]  Michael Beetz,et al.  3D model selection from an internet database for robotic vision , 2009, 2009 IEEE International Conference on Robotics and Automation.

[22]  Michael Beetz,et al.  GrAM: Reasoning with Grounded Action Models by Combining Knowledge Representation and Data Mining , 2006, Towards Affordance-Based Robot Control.

[23]  Maria Fox,et al.  Modelling Mixed Discrete-Continuous Domains for Planning , 2006, J. Artif. Intell. Res..

[24]  Moritz Tenorth,et al.  KNOWROB — knowledge processing for autonomous personal robots , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Deb Roy,et al.  Grounded Situation Models for Robots: Where words and percepts meet , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.