Deep learning of biomimetic visual perception for virtual humans

Future generations of advanced, autonomous virtual humans will likely require artificial vision systems that more accurately model the human biological vision system. With this in mind, we propose a strongly biomimetic model of visual perception within a novel framework for human sensorimotor control. Our framework features a biomechanically simulated, musculoskeletal human model actuated by numerous skeletal muscles, with two human-like eyes whose retinas have spatially nonuniform distributions of photoreceptors not unlike biological retinas. The retinal photoreceptors capture the scene irradiance that reaches them, which is computed using ray tracing. Within the sensory subsystem of our model, which continuously operates on the photoreceptor outputs, are 10 automatically-trained, deep neural networks (DNNs). A pair of DNNs drive eye and head movements, while the other 8 DNNs extract the sensory information needed to control the arms and legs. Thus, exclusively by means of its egocentric, active visual perception, our biomechanical virtual human learns, by synthesizing its own training data, efficient, online visuomotor control of its eyes, head, and limbs to perform tasks involving the foveation and visual pursuit of target objects coupled with visually-guided reaching actions to intercept the moving targets.

[1]  Carol O'Sullivan,et al.  Synthetic Vision and Memory for Autonomous Virtual Humans , 2002, Comput. Graph. Forum.

[2]  António Lucas Soares,et al.  An Efficient Synthetic Vision System for 3D Multi-character Systems , 2003, IVA.

[3]  J M Miller,et al.  Visual acuity modeling using optical raytracing of schematic eyes. , 1995, American journal of ophthalmology.

[4]  Dinesh K. Pai,et al.  Eyecatch: simulating visuomotor coordination for object interception , 2012, ACM Trans. Graph..

[5]  Dana H. Ballard,et al.  Modeling embodied visual behaviors , 2007, TAP.

[6]  Eftychios Sifakis,et al.  Comprehensive biomechanical modeling and simulation of the upper body , 2009, TOGS.

[7]  J. Pettré,et al.  A synthetic-vision based steering approach for crowd simulation , 2010, ACM Trans. Graph..

[8]  Daniel Thalmann,et al.  Navigation for digital actors based on synthetic vision, memory, and learning , 1995, Comput. Graph..

[9]  Daniel Thalmann,et al.  Autonomous Virtual Actors Based on Virtual Sensors , 1997, Creating Personalities for Synthetic Actors.

[10]  Razvan Pascanu,et al.  Theano: A CPU and GPU Math Compiler in Python , 2010, SciPy.

[11]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[12]  Demetri Terzopoulos,et al.  Active Perception in Virtual Humans , 2000 .

[13]  J. Koenderink,et al.  Visual detection of spatial contrast; Influence of location in the visual field, target extent and illuminance level , 1978, Biological Cybernetics.

[14]  Michael F. Deering,et al.  A photon accurate model of the human eye , 2005, SIGGRAPH '05.

[15]  Stewart W. Wilson On the Retino-Cortical Mapping , 1983, Int. J. Man Mach. Stud..

[16]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[17]  Jean-Claude Latombe,et al.  Fast synthetic vision, memory, and learning models for virtual humans , 1999, Proceedings Computer Animation 1999.

[18]  Dinesh K. Pai,et al.  Fast ray-tracing of human eye optics on Graphics Processing Units , 2014, Comput. Methods Programs Biomed..

[19]  E. L. Schwartz,et al.  Spatial mapping in the primate sensory projection: Analytic structure and relevance to perception , 1977, Biological Cybernetics.

[20]  Demetri Terzopoulos,et al.  Autonomous pedestrians , 2005, SCA '05.

[21]  E. Schwartz,et al.  Space-variant computer vision: a graph-theoretic approach , 2004 .

[22]  Demetri Terzopoulos,et al.  Animat vision: Active vision in artificial animals , 1995, Proceedings of IEEE International Conference on Computer Vision.

[23]  Bruno Arnaldi,et al.  A new application for saliency maps: synthetic vision of autonomous actors , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[24]  Demetri Terzopoulos,et al.  Artificial fishes: physics, locomotion, perception, behavior , 1994, SIGGRAPH.

[25]  Tao Zhou,et al.  Deep learning of biomimetic sensorimotor control for biomechanical human animation , 2018, ACM Trans. Graph..

[26]  Daniel Thalmann,et al.  A vision-based approach to behavioural animation , 1990, Comput. Animat. Virtual Worlds.

[27]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.