A Game-Theoretic Approach to Generating Spatial Descriptions

Language is sensitive to both semantic and pragmatic effects. To capture both effects, we model language use as a cooperative game between two players: a speaker, who generates an utterance, and a listener, who responds with an action. Specifically, we consider the task of generating spatial references to objects, wherein the listener must accurately identify an object described by the speaker. We show that a speaker model that acts optimally with respect to an explicit, embedded listener model substantially outperforms one that is trained to directly generate spatial descriptions.

[1]  J. Austin How to do things with words , 1962 .

[2]  J. O. Urmson,et al.  How to Do Things with Words@@@The William James Lectures , 1963 .

[3]  H. Grice Logic and conversation , 1975 .

[4]  B. Landau,et al.  “What” and “where” in spatial language and spatial cognition , 1993 .

[5]  Geoffrey E. Hinton Products of experts , 1999 .

[6]  Laura A. Carlson,et al.  Grounding spatial language in perception: an empirical and computational investigation. , 2001, Journal of experimental psychology. General.

[7]  Deb Roy,et al.  Grounded Semantic Composition for Visual Scenes , 2011, J. Artif. Intell. Res..

[8]  J. Feldman,et al.  Embodied meaning in a neural theory of language , 2004, Brain and Language.

[9]  Chen Yu,et al.  On the Integration of Grounding Language and Learning Objects , 2004, AAAI.

[10]  Jasmine Coles Journal of Experimental Psychology: Animal Behavior , 2007 .

[11]  Luke S. Zettlemoyer,et al.  Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.

[12]  Deb Roy,et al.  Representing Intentions in a Cognitive Model of Language Acquisition: Effects of Phrase Structure on Situated Verb Learning , 2007, AAAI Spring Symposium: Intentions in Intelligent Systems.

[13]  Raymond J. Mooney,et al.  Learning Synchronous Grammars for Semantic Parsing with Lambda Calculus , 2007, ACL.

[14]  David DeVault,et al.  Managing ambiguities across utterances in dialogue , 2007 .

[15]  Raymond J. Mooney,et al.  Learning to sportscast: a test of grounded language acquisition , 2008, ICML '08.

[16]  Noah D. Goodman,et al.  A Bayesian Model of the Acquisition of Compositional Semantics , 2008 .

[17]  Dan Roth,et al.  Reading to Learn: Constructing Features from Semantic Abstracts , 2009, EMNLP.

[18]  Michael C. Frank,et al.  PSYCHOLOGICAL SCIENCE Research Article Using Speakers ’ Referential Intentions to Model Early Cross-Situational Word Learning , 2022 .

[19]  D. Roy Grounding spatial prepositions for video search , 2009 .

[20]  Dan Klein,et al.  Learning Semantic Correspondences with Less Supervision , 2009, ACL.

[21]  Luke S. Zettlemoyer,et al.  Reinforcement Learning for Mapping Instructions to Actions , 2009, ACL.

[22]  Stefanie Tellex,et al.  Toward understanding natural language directions , 2010, HRI 2010.

[23]  Gerhard Jäger,et al.  Game theory in semantics and pragmatics , 2012 .

[24]  Péter Szigetvári,et al.  What and When? , 2019, Inauguration and Liturgical Kingship in the Long Twelfth Century.