Learning to Count by Think Aloud Imitation

Although necessary, learning to discover new solutions is often long and difficult, even for supposedly simple tasks such as counting. On the other hand, learning by imitation provides a simple way to acquire knowledge by watching other agents do. In order to learn more complex tasks by imitation than mere sequences of actions, a Think Aloud protocol is introduced, with a new neuro-symbolic network. The latter uses time in the same way as in a Time Delay Neural Network, and is added basic first order logic capacities. Tested on a benchmark counting task, learning is very fast, generalization is accurate, whereas there is no initial bias toward counting.

[1]  Anthony V. Robins,et al.  Local Learning Algorithms for Sequential Tasks in Neural Networks , 1998, J. Adv. Comput. Intell. Intell. Informatics.

[2]  Laurent Orseau,et al.  Short Term Memories and Forcing the Re-use of Knowledge for Generalization , 2005, ICANN.

[3]  Laurent Orseau,et al.  The Principle of Presence: A Heuristic for Growing Knowledge Structured Neural Networks , 2005 .

[4]  Edmund Furse,et al.  A MODEL OF IMITATION LEARNING OF ALGORITHMS FROM WORKED EXAMPLES , 2001, Cybern. Syst..

[5]  Mark B. Ring Child: A First Step Towards Continual Learning , 1998, Learning to Learn.

[6]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[7]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[8]  Terrence J. Sejnowski,et al.  NETtalk: a parallel network that learns to read aloud , 1988 .

[9]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[10]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[11]  Peter Tiño,et al.  Learning long-term dependencies is not as difficult with NARX networks , 1995, NIPS.

[12]  Aude Billard,et al.  Goal-Directed Imitation in a Humanoid Robot , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[13]  Henry Lieberman,et al.  Watch what I do: programming by demonstration , 1993 .