Learning and control in virtual reality for machine intelligence

This paper presents a Virtual Reality (VR) interactive platform for learning and control for machine intelligence based on Adaptive Dynamic Programming (ADP). Recent research results have provided strong evidences that ADP could be a key technique for brain-like intelligent systems design, and VR is a powerful human-computer interface which can provide a three-dimensional (3D) active virtual environment. Converge these two subjects, we design an interactive system to facilitate and demonstrate the learning and control in VR environment with ADP. Our main goal in this paper is two-fold. First, we demonstrate that VR could be a useful platform to demonstrate and visualize machine intelligence research through the simulated 3D environment. Second, the integration of VR into machine intelligence research can provide a powerful platform to simulate, validate and facilitate the real-time interaction between the intelligent system and an unstructured environment. We discuss the detailed design strategy of the VR platform, and also demonstrated the interactive system performance based on the triple link inverted pendulum benchmark.

[1]  Huaguang Zhang,et al.  Adaptive Dynamic Programming: An Introduction , 2009, IEEE Computational Intelligence Magazine.

[2]  C. Y. Kuo,et al.  Real time stabilisation of a triple link inverted pendulum using single control input , 1997 .

[3]  K. G. Eltohamy,et al.  Nonlinear optimal control of a triple link inverted pendulum with single control input , 1998 .

[4]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  Akira Kageyama,et al.  Introduction to Virtual Reality Visualization by the CAVE system , 2005 .

[6]  Hilary McLellan Virtual reality : a selected bibliography , 1992 .

[7]  Jozef Novak-Marcincin Application of the Virtual Reality Modeling Language for Design of Automated Workplaces , 2007 .

[8]  Philippe Coiffet,et al.  Virtual Reality Technology , 1992 .

[9]  Haibo He,et al.  A three-network architecture for on-line learning and optimization based on adaptive dynamic programming , 2012, Neurocomputing.

[10]  F. Nagi,et al.  MODELING AND SIMULATION OF DIFFERENTIAL DRIVE MOBILE ROBOTICS SYSTEM , 2006 .

[11]  Haibo He,et al.  Adaptive Learning and Control for MIMO System Based on Adaptive Dynamic Programming , 2011, IEEE Transactions on Neural Networks.

[12]  Ricki G. Ingalls,et al.  Introduction to simulation , 2013, 2013 Winter Simulations Conference (WSC).

[13]  Jennie Si,et al.  Online learning control by association and reinforcement , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.