Exploratory learning structures in artificial cognitive systems

The major goal of the COSPAL project is to develop an artificial cognitive system architecture, with the ability to autonomously extend its capabilities. Exploratory learning is one strategy that allows an extension of competences as provided by the environment of the system. Whereas classical learning methods aim at best for a parametric generalization, i.e., concluding from a number of samples of a problem class to the problem class itself, exploration aims at applying acquired competences to a new problem class, and to apply generalization on a conceptual level, resulting in new models. Incremental or online learning is a crucial requirement to perform exploratory learning. In the COSPAL project, we mainly investigate reinforcement-type learning methods for exploratory learning, and in this paper we focus on the organization of cognitive systems for efficient operation. Learning is used over the entire system. It is organized in the form of four nested loops, where the outermost loop reflects the user-reinforcement-feedback loop, the intermediate two loops switch between different solution modes at symbolic respectively sub-symbolic level, and the innermost loop performs the acquired competences in terms of perception-action cycles. We present a system diagram which explains this process in more detail. We discuss the learning strategy in terms of learning scenarios provided by the user. This interaction between user ('teacher') and system is a major difference to classical robotics systems, where the system designer places his world model into the system. We believe that this is the key to extendable robust system behavior and successful interaction of humans and artificial cognitive systems. We furthermore address the issue of bootstrapping the system, and, in particular, the visual recognition module. We give some more in-depth details about our recognition method and how feedback from higher levels is implemented. The described system is however work in progress and no final results are available yet. The available preliminary results that we have achieved so far, clearly point towards a successful proof of the architecture concept.

[1]  R. Passingham,et al.  Objects automatically potentiate action: an fMRI study of implicit processing , 2003, The European journal of neuroscience.

[2]  Hans-Hellmut Nagel,et al.  Cognitive Vision Systems, Sampling the Spectrum of Approaches [based on a Dagstuhl seminar] , 2006, Cognitive Vision Systems.

[3]  Hans Knutsson,et al.  Signal processing for computer vision , 1994 .

[4]  Giulio Sandini,et al.  A Survey of Artificial Cognitive Systems: Implications for the Autonomous Development of Mental Capabilities in Computational Agents , 2007, IEEE Transactions on Evolutionary Computation.

[5]  Keiji Tanaka,et al.  Neuronal Correlates of Goal-Based Motor Selection in the Prefrontal Cortex , 2003, Science.

[6]  Ronald C. Arkin,et al.  An Behavior-based Robotics , 1998 .

[7]  Gösta H. Granlund,et al.  Organization of Architectures for Cognitive Vision Systems , 2006, Cognitive Vision Systems.

[8]  David Windridge,et al.  Unsupervised Symbol Grounding and Cognitive Bootstrapping in Cognitive Vision , 2005, ICIAP.

[9]  Václav Hlaváč A Novel Algorithm for Learning Support Vector Machines with Structured Output Spaces , 2006 .

[10]  Thomas S. Huang,et al.  Learning to extract temporal signal patterns from temporal signal sequence , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[11]  David Windridge,et al.  Open-Ended Inference of Relational Representations in the COSPAL Perception-Action Architecture , 2007 .

[12]  Gerald Sommer,et al.  On-line Learning with Dynamic Cell Structures , 2004 .

[13]  Alan D. Christiansen,et al.  Learning reliable manipulation strategies without initial physical models , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[14]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[15]  Ralf Salomon Improving the DAC architecture by using proprioceptive sensors , 1998 .

[16]  Ehud Rivlin,et al.  The AV-Shell , 1995 .

[17]  Nicholas R. Jennings,et al.  Intelligent agents: theory and practice , 1995, The Knowledge Engineering Review.

[18]  Yiannis Aloimonos,et al.  Exploratory active vision: theory , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Jiri Matas,et al.  Linear Predictors for Fast Simultaneous Modeling and Tracking , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[20]  David Windridge,et al.  Visual Bootstrapping for Unsupervised Symbol Grounding , 2006, ACIVS.

[21]  David Windridge,et al.  Cognitive Bootstrapping: A Survey of Bootstrap Mechanisms for Emergent Cognition , 2005 .

[22]  G. Aschersleben,et al.  The Theory of Event Coding (TEC): a framework for perception and action planning. , 2001, The Behavioral and brain sciences.

[23]  Y. Demiris,et al.  From motor babbling to hierarchical learning by imitation: a robot developmental pathway , 2005 .

[24]  Florian Hoppe Local learning for visual robotic systems , 2007 .

[25]  R. Held,et al.  MOVEMENT-PRODUCED STIMULATION IN THE DEVELOPMENT OF VISUALLY GUIDED BEHAVIOR. , 1963, Journal of comparative and physiological psychology.

[26]  Rajesh P. N. Rao,et al.  An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[27]  Ulf Ziemann Sensory‐motor integration in human motor cortex at the pre‐motoneurone level: beyond the age of simple MEP measurements , 2001, The Journal of physiology.

[28]  Alexander Riegler,et al.  When is a cognitive system embodied? , 2002, Cognitive Systems Research.

[29]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[30]  D. Stuss,et al.  Cognitive neuroscience. , 1993, Current opinion in neurobiology.

[31]  Á. Pascual-Leone,et al.  The role of the dorsolateral prefrontal cortex during sequence learning is specific for spatial information. , 2001, Cerebral cortex.

[32]  Salvatore Gaglio,et al.  A Cognitive Architecture for Artificial Vision , 1997, Artif. Intell..

[33]  M. Felsberg,et al.  Correspondence-free Associative Learning , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[34]  Michael Felsberg,et al.  Channel smoothing: efficient robust smoothing of low-level signal features , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Michael S. Gazzaniga,et al.  The Social Brain: Discovering the Networks of the Mind , 1985 .

[36]  Richard Bowden,et al.  Learning Distances for Arbitrary Visual Features , 2006, BMVC.

[37]  C. Frith The social brain? , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[38]  Michael Felsberg,et al.  Efficient computation of channel-coded feature maps through piecewise polynomials , 2009, Image Vis. Comput..

[39]  Jiri Matas,et al.  Geometric Hashing with Local Affine Frames , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[40]  Bernhard Schölkopf,et al.  View-Based Cognitive Mapping and Path Planning , 1995, Adapt. Behav..

[41]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Harry Wechsler,et al.  Integrating perception, action and learning , 1991, SGAR.

[43]  A. Schwartz,et al.  Differential Representation of Perception and Action in the Frontal Cortex , 2004, Science.

[44]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[45]  Michael Felsberg,et al.  P-Channels: Robust Multivariate M-Estimation of Large Datasets , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[46]  Björn Johansson,et al.  The application of an oblique-projected Landweber method to a model of supervised learning , 2006, Math. Comput. Model..

[47]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[48]  Peter Redgrave,et al.  Layered Control Architectures in Robots and Vertebrates , 1999, Adapt. Behav..

[49]  Michael Felsberg,et al.  Real-Time Visual Recognition of Objects and Scenes Using P-Channel Matching , 2007, SCIA.

[50]  Jiří Matas,et al.  Wald's Sequential Analysis for Time-constrained Vision Problems , 2008 .

[51]  Emanuele Trucco,et al.  Computer and Robot Vision , 1995 .

[52]  Gösta H. Granlund,et al.  A Cognitive Vision Architecture Integrating Neural Networks with Symbolic Processing , 2005, Künstliche Intell..

[53]  J. Fuster,et al.  From perception to action: temporal integrative functions of prefrontal and parietal neurons. , 1999, Cerebral cortex.

[54]  Michael Felsberg,et al.  Accurate Interpolation in Appearance-Based Pose Estimation , 2007, SCIA.

[55]  Gerald Sommer,et al.  An Adaptive Classification Algorithm Using Robust Incremental Clustering , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[56]  Dirk Timmermann,et al.  Role-based Strategies at Example of RoboCup , 2005, Artificial Intelligence and Applications.

[57]  Gösta H. Granlund,et al.  An Associative Perception-Action Structure Using a Localized Space Variant Information Representation , 2000, AFPAC.

[58]  Ben J. A. Kröse,et al.  Distributed adaptive control: The self-organization of structured behavior , 1992, Robotics Auton. Syst..

[59]  Tomaso Poggio,et al.  Image Representations for Visual Learning , 1996, Science.

[60]  Per-Erik Forssén,et al.  Channel Associative Networks for Multiple Valued Mappings , 2006 .

[61]  Friedemann Pulvermüller,et al.  Brain mechanisms linking language and action , 2005, Nature Reviews Neuroscience.

[62]  Giulio Sandini,et al.  Developmental robotics: a survey , 2003, Connect. Sci..

[63]  S. Anderson,et al.  Attentional processes link perception and action , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[64]  Norbert Krüger,et al.  Three Dilemmas of Signal- and Symbol-Based Representations in Computer Vision , 2005, BVAI.

[65]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[66]  Ralf Der,et al.  From Motor Babbling to Purposive Actions: Emerging Self-exploration in a Dynamical Systems Approach to Early Robot Development , 2006, SAB.

[67]  Ruzena Bajcsy,et al.  The Problem of Signal and Symbol Integration: A Study of Cooperative Mobile Autonomous Agent Behaviors , 1995, KI.

[68]  R. A. Brooks,et al.  Intelligence without Representation , 1991, Artif. Intell..

[69]  Roger W. Sperry,et al.  Science and moral priority: Merging mind, brain, and human values , 1983 .

[70]  Michael Felsberg,et al.  Visual Servoing for Floppy Robots using LWPR , 2007 .

[71]  Tony R. Martinez,et al.  Reduction Techniques for Instance-Based Learning Algorithms , 2000, Machine Learning.

[72]  Per-Erik Forssén,et al.  Autonomous Learning of Object Appearances using Colour Contour Frames , 2006, The 3rd Canadian Conference on Computer and Robot Vision (CRV'06).

[73]  Michael Felsberg,et al.  Real-time view-based pose recognition and interpolation for tracking initialization , 2007, Journal of Real-Time Image Processing.

[74]  Michael Felsberg,et al.  A COSPAL Subsystem : Solving a Shape-Sorter Puzzle , 2005, AAAI 2005.

[75]  Richard Bowden,et al.  Problem solving through imitation , 2009, Image Vis. Comput..

[76]  Stephan Al-Zubi,et al.  Imitation Learning and Transferring of Human Movement and Hand Grasping to Adapt to Environment Changes , 2006, Human Motion.

[77]  Ehud Rivlin,et al.  AV-Shell, an Environment for Autonomous Robotic Applications Using Active Vision , 1999, Auton. Robots.

[78]  Gerald Sommer,et al.  Fusion Algorithm for Locally Arranged Linear Models , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[79]  J. PaulF.M.,et al.  Distributed Adaptive Control : Explorations in robotics andthe biology of learning , 1998 .

[80]  S. P. Springer,et al.  Left brain, right brain , 1981 .

[81]  Richard Bowden,et al.  Learning Responses to Visual Stimuli: A Generic Approach , 2007 .

[82]  Domenico Parisi,et al.  Using Motor Babbling and Hebb Rules for Modeling the Development of Reaching with Obstacles and Grasping , 2008 .

[83]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[84]  Gösta H. Granlund,et al.  The complexity of vision , 1999, Signal Process..

[85]  J. Kevin O'Regan,et al.  Is There Something Out There? Inferring Space from Sensorimotor Dependencies , 2003, Neural Computation.

[86]  Ehud Rivlin,et al.  Functional 3D Object Classification Using Simulation of Embodied Agent , 2006, BMVC.

[87]  M. Jeannerod The cognitive neuroscience of action , 1997, Trends in Cognitive Sciences.

[88]  Gösta H. Granlund Does Vision Inevitably Have to be Active , 1998 .

[89]  C. Shatz,et al.  Synaptic Activity and the Construction of Cortical Circuits , 1996, Science.

[90]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[91]  B. Hochner,et al.  Control of Octopus Arm Extension by a Peripheral Motor Program , 2001, Science.

[92]  Tomaso Poggio,et al.  Computational Models of Object Recognition in Cortex: A Review , 2000 .

[93]  David Vernon,et al.  Cognitive Vision: the Case for Embodied Perception , 2005 .