An integrated neural-symbolic cognitive agent architecture for training and assessment in simulators

Training and assessment of complex tasks has always been a complex task in itself. Training simulators can be used for training and assessment of low-order skills. High-order skills (e.g. safe driving, leadership, tactical manoeuvring, etc.) are generally trained and assessed by human experts, due to its complex nature (i.e. many temporal relations, biased behaviour and poorly documented). This paper proposes a new cognitive agent architecture that is able to model this complex behaviour and use it for the assessment and training of both low- and high-order skills. Therefore the agent integrates learning from observation, temporal logic and probalistic reasoning in a unified architecture that is based on Neural-Symbolic Learning and Reasoning. This so-called Neural Symbolic Cognitive Agent (NSCA) architecture combines encoding temporal logic based expert knowledge and learning new knowledge by observing experts and trainees during task execution in a simulator. The learned knowledge can be extracted in temporal logic rules for validation. Learning and reasoning is done using a Recurrent Temporal Restricted Boltzmann Machine (RTRBM). For training organizations, this provides a quicker, cost-saving and more objective evaluation of the trainee in simulation-based training. A prototype NSCA has been developed and tested as part of a three-year research project on assessment in driving simulators for training and certification, and will be tested in various other domains, such as jetfighter pilot training and strategic command and control training.

[1]  Michael E. Bratman,et al.  Intention, Plans, and Practical Reason , 1991 .

[2]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[3]  Artur S. d'Avila Garcez,et al.  A Connectionist Cognitive Model for Temporal Synchronisation and Learning , 2007, AAAI.

[4]  Dov M. Gabbay,et al.  Neural-Symbolic Cognitive Reasoning , 2008, Cognitive Technologies.

[5]  Eduardo Salas,et al.  Scaled Worlds: Development, Validation and Applications , 2004 .

[6]  Anand S. Rao,et al.  BDI Agents: From Theory to Practice , 1995, ICMAS.

[7]  H.L.H. de Penning,et al.  Integrating Training Simulations and e-Learning Systems: The SimSCORM Platform (Integratie van Training Simulaties en e-Learning Systemen: Het SimSCORM Platform) , 2008 .

[8]  Geoffrey E. Hinton,et al.  Deep Boltzmann Machines , 2009, AISTATS.

[9]  Bart Kosko,et al.  Neural networks and fuzzy systems: a dynamical systems approach to machine intelligence , 1991 .

[10]  Alan F. Murray,et al.  Continuous restricted Boltzmann machine with an implementable training algorithm , 2003 .

[11]  Gadi Pinkas,et al.  Reasoning, Nonmonotonicity and Learning in Connectionist Networks that Capture Propositional Knowledge , 1995, Artif. Intell..

[12]  Geoffrey E. Hinton,et al.  The Recurrent Temporal Restricted Boltzmann Machine , 2008, NIPS.

[13]  Ron Sun A neural network model of causality , 1994, IEEE Trans. Neural Networks.

[14]  Avelino J. Gonzalez,et al.  Learning tactical human behavior through observation of human performance , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[15]  MSc DIC PhD Artur S. d’Avila Garcez MEng,et al.  Neural-Symbolic Learning Systems , 2002, Perspectives in Neural Computing.