Learning linguistic constructions grounded in qualitative action models

Aiming at the design of adaptive artificial agents which are able to learn autonomously from experience and human tutoring, in this paper we present a system for learning syntactic constructions grounded in perception. These constructions are learned from examples of natural language utterances and parallel performances of actions, i.e. their trajectories and involved objects. From the input, the system learns linguistic structures and qualitative action models. Action models are represented as Hidden Markov Models over sequences of qualitative relations between a trajector and a landmark and abstract away from concrete action trajectories. Learning of action models is driven by linguistic observations, and linguistic patterns are, in turn, grounded in learned action models. The proposed system is applicable for both language understanding and language generation. We present empirical results, showing that the learned action models generalize well over concrete instances of the same action and also to novel performers, while allowing accurate discrimination between different actions. Further, we show that the system is able to describe novel dynamic scenes and to understand novel utterances describing such scenes.

[1]  Tiago V. Maia,et al.  Learning Grammatical Constructions , 2001 .

[2]  Mark Steedman,et al.  A Probabilistic Model of Syntactic and Semantic Acquisition from Child-Directed Utterances and their Meanings , 2012, EACL.

[3]  Raúl Rojas,et al.  Theorie der neuronalen Netze , 1993 .

[4]  Peter Ford Dominey,et al.  Exploring the acquisition and production of grammatical constructions through human-robot interaction with echo state networks , 2014, Front. Neurorobot..

[5]  Katsushi Ikeuchi,et al.  Modeling manipulation interactions by hidden Markov models , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Biing-Hwang Juang,et al.  The segmental K-means algorithm for estimating parameters of hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[7]  Stefan Wermter,et al.  Embodied Language Understanding with a Multiple Timescale Recurrent Neural Network , 2013, ICANN.

[8]  Afsaneh Fazly,et al.  A Probabilistic Computational Model of Cross-Situational Word Learning , 2010, Cogn. Sci..

[9]  Nico Van de Weghe,et al.  A Qualitative Trajectory Calculus and the Composition of Its Relations , 2005, GeoS.

[10]  Peter Ford Dominey,et al.  Learning to talk about events from narrated video in a construction grammar framework , 2005, Artif. Intell..

[11]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[12]  Philipp Cimiano,et al.  A Computational Model for the Item-Based Induction of Construction Networks , 2014, Cogn. Sci..

[13]  J. Siskind A computational study of cross-situational techniques for learning word-to-meaning mappings , 1996, Cognition.

[14]  Luc Steels,et al.  Fluid Construction Grammar: The New Kid on the Block , 2012, EACL.

[15]  Alex Pentland,et al.  Learning words from sights and sounds: a computational model , 2002, Cogn. Sci..

[16]  Friedrich M. Wahl,et al.  Acquisition of statistical motion patterns in dynamic environments and their application to mobile robot motion planning , 1997, Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97.

[17]  Jerome A. Feldman,et al.  From Molecule to Metaphor - A Neural Theory of Language , 2006 .

[18]  Satoshi Nakamura,et al.  Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models , 2011, Adv. Robotics.

[19]  Stefanie Tellex,et al.  Object schemas for grounding language in a responsive robot , 2008, Connect. Sci..

[20]  Nancy Chang,et al.  STRUCTURED CONNECTIONIST MODELS OF LANGUAGE, COGNITION AND ACTION , 2005 .

[21]  Andreas Stolcke,et al.  Hidden Markov Model} Induction by Bayesian Model Merging , 1992, NIPS.

[22]  Olivier Sigaud,et al.  Learning a repertoire of actions with deep neural networks , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.