Symbolization and Imitation Learning of Motion Sequence Using Competitive Modules

In this research the authors evaluate a new method for control using several prediction models and recognition of movement series. In MOSAIC (MOdule Selection And Identification for Control), which uses a prediction model with several modules as proposed by Wolpert and Kawato (1998), a module that pairs a prediction model which predicts the future state to be controlled and a controller are switched and assembled based on the size of the prediction error in the prediction model. The authors propose a method using MOSAIC to divide continuous time patterns for human or robot movement into their constituent parts as several series of movement elements. Moreover, the authors evaluate a method to recognize movement patterns of another person using one's own module and imitation learning based on this method. From the results of simulations of acrobot control, the authors show that symbolization of movement patterns and imitation learning based on that are possible. © 2006 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 89(9): 42–53, 2006; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ecjc.20267

[1]  G. Rizzolatti,et al.  Premotor cortex and the recognition of motor actions. , 1996, Brain research. Cognitive brain research.

[2]  Mitsuo Kawato,et al.  Multiple Paired Forward-Inverse Models for Human Motor Learning and Control , 1998, NIPS.

[3]  M. Arbib,et al.  Language within our grasp , 1998, Trends in Neurosciences.

[4]  A. Goldman,et al.  Mirror neurons and the simulation theory of mind-reading , 1998, Trends in Cognitive Sciences.

[5]  W.A. Gruver,et al.  Design limitations of PD versus fuzzy controllers for the Acrobot , 1997, Proceedings of International Conference on Robotics and Automation.

[6]  D M Wolpert,et al.  Multiple paired forward and inverse models for motor control , 1998, Neural Networks.

[7]  Mitsuo Kawato,et al.  MOSAIC Model for Sensorimotor Learning and Control , 2001, Neural Computation.

[8]  M. Kawato,et al.  Cerebro-cerebellar functional connectivity revealed by the laterality index in tool-use learning. , 1999, Neuroreport.

[9]  Mark W. Spong,et al.  The swing up control problem for the Acrobot , 1995 .

[10]  Mitsuo Kawato,et al.  Multiple Model-Based Reinforcement Learning , 2002, Neural Computation.

[11]  Hiroshi Imamizu,et al.  Human cerebellar activity reflecting an acquired internal model of a new tool , 2000, Nature.

[12]  Gary Boone,et al.  Efficient reinforcement learning: model-based Acrobot control , 1997, Proceedings of International Conference on Robotics and Automation.

[13]  G. Rizzolatti,et al.  Understanding motor events: a neurophysiological study , 2004, Experimental Brain Research.

[14]  W. Fleming,et al.  Controlled Markov processes and viscosity solutions , 1992 .