Learning to Follow Navigational Directions

We present a system that learns to follow navigational natural language directions. Where traditional models learn from linguistic annotation or word distributions, our approach is grounded in the world, learning by apprenticeship from routes through a map paired with English descriptions. Lacking an explicit alignment between the text and the reference path makes it difficult to determine what portions of the language describe which aspects of the route. We learn this correspondence with a reinforcement learning algorithm, using the deviation of the route we follow from the intended path as a reward signal. We demonstrate that our system successfully grounds the meaning of spatial terms like above and south into geometric properties of paths.

[1]  C. Tanz Studies in the acquisition of deictic terms , 1983 .

[2]  Leonard Talmy,et al.  How Language Structures Space , 1983 .

[3]  Anne H. Anderson,et al.  The Hcrc Map Task Corpus , 1991 .

[4]  Terry Regier,et al.  The Human Semantic Potential: Spatial Language and Constrained Connectionism , 1996 .

[5]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[6]  C. Fillmore Lectures on Deixis , 1997 .

[7]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[8]  Benjamin Kuipers,et al.  The Spatial Semantic Hierarchy , 2000, Artif. Intell..

[9]  S. Levinson Space in language and cognition: Explorations in cognitive diversity , 2003 .

[10]  Alan M. MacEachren,et al.  Communicating Vague Spatial Concepts in Human-GIS Interactions: A Collaborative Dialogue Approach , 2003, COSIT.

[11]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[12]  James Richard Curran,et al.  From distributional to semantic similarity , 2004 .

[13]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15]  Deb Roy,et al.  Interpretation of Spatial Language in a Map Navigation Task , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Luke S. Zettlemoyer,et al.  Learning Context-Dependent Mappings from Sentences to Logical Form , 2009, ACL.

[17]  Luke S. Zettlemoyer,et al.  Reinforcement Learning for Mapping Instructions to Actions , 2009, ACL.

[18]  Nicholas Roy,et al.  Where to go: Interpreting natural directions using global inference , 2009, 2009 IEEE International Conference on Robotics and Automation.

[19]  Stefanie Tellex,et al.  Toward understanding natural language directions , 2010, 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI).