Visual gaze analysis of robotic pedestrians moving in urban space

This study is founded on the idea that an analysis of the visual gaze dynamics of pedestrians can increase our understanding of how important architectural features in urban environments are perceived by pedestrians. The results of such an analysis can lead to improvements in urban design. However, a technical challenge arises when trying to determine the gaze direction of pedestrians recorded on video. High ‘noise’ levels and the subtlety of human gaze dynamics hamper precise calculations. However, as robots can be programmed and analysed more efficiently than humans, this study uses them for developing and training a gaze analysis system with the aim to later apply it to human video data using the machine learning technique of manifold alignment. For this study, a laboratory was set up to become a model street scene in which autonomous humanoid robots of approximately 55 cm in height simulate the behaviour of human pedestrians. The experiments compare the inputs from several cameras as the robot walks down the model street and changes its behaviour upon encountering ‘visually attractive objects’. Overhead recordings and the robot's internal joint signals are analysed after filtering to provide ‘true’ data against which the recorded data can be compared for accuracy testing. A central component of the research is the calculation of a torus-like manifold that represents all the different three-dimensional (3D) head directions of a robot head and allows the ordering of extracted 3D gaze vectors obtained from video sequences. We briefly describe how the obtained multidimensional trajectory data can be analysed by using a temporal behaviour analysis technique based on support vector machines that was developed separately.

[1]  Sethuraman Panchanathan,et al.  Biased Manifold Embedding: A Framework for Person-Independent Head Pose Estimation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Jacky Baltes,et al.  RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29 - July 5, 2009] , 2010, RoboCup.

[3]  Luc Van Gool,et al.  Populating Ancient Pompeii with Crowds of Virtual Romans , 2007, VAST.

[4]  Stefania Bandini,et al.  Modeling, Simulating, and Visualizing Crowd Dynamics with Computational Tools Based on Situated Cellular Agents , 2009 .

[5]  Daniel Thalmann,et al.  Virtual humans: thirty years of research, what next? , 2005, The Visual Computer.

[6]  Manuela M. Veloso,et al.  SSL-Vision: The Shared Vision System for the RoboCup Small Size League , 2009, RoboCup.

[7]  Nasser Kehtarnavaz,et al.  Real-Time Head Pose Estimation on Mobile Platforms , 2009 .

[8]  Stephan K. Chalup,et al.  Architectural evaluation of simulated pedestrian spatial behaviour , 2011 .

[9]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Amy H Auchincloss,et al.  A new tool for epidemiology: the usefulness of dynamic-agent models in understanding place effects on health. , 2008, American journal of epidemiology.

[11]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[12]  Nicu Sebe,et al.  Webcam-Based Visual Gaze Estimation , 2009, ICIAP.

[13]  Jean-Philippe Vert,et al.  Consistency and Convergence Rates of One-Class SVMs and Related Algorithms , 2006, J. Mach. Learn. Res..

[14]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[15]  Wen Gao,et al.  Manifold Alignment via Corresponding Projections , 2010, BMVC.

[16]  W. Whyte The social life of small urban spaces , 1980 .

[17]  Alexandra Willis,et al.  The effects of a distractor on the visual gaze behavior of children at signalized road crossings , 2010 .

[18]  Takahiro Okabe,et al.  Gaze Estimation from Low Resolution Images , 2006, PSIVT.

[19]  Lakhmi C. Jain,et al.  New Advances in Virtual Humans: Artificial Intelligence Environment , 2008 .

[20]  Henrik I. Christensen,et al.  Computational visual attention systems and their cognitive foundations: A survey , 2010, TAP.

[21]  Daniel Thalmann,et al.  Level of Autonomy for Virtual Human Agents , 1999, ECAL.

[22]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[23]  Joshua B. Tenenbaum,et al.  Global Versus Local Methods in Nonlinear Dimensionality Reduction , 2002, NIPS.

[24]  S. C. Van der Spek,et al.  Urbanism on Track: Application of tracking technologies in urbanism , 2008 .

[25]  Stefania Bandini,et al.  Crowd Behavior Modeling: From Cellular Automata to Multi-Agent Systems , 2009, Multi-Agent Systems.

[26]  Jae-Eun Kim,et al.  The Built Environment and Health: Introducing Individual Space-Time Behavior , 2009, International journal of environmental research and public health.

[27]  J. M. Oakes,et al.  Invited commentary: rescuing Robinson Crusoe. , 2008, American journal of epidemiology.

[28]  Stefania Bandini,et al.  Crowd Behaviour Modeling: From Cellular Automata to Multi-Agent Systems , 2009 .

[29]  Pierre Blazevic,et al.  Mechatronic design of NAO humanoid , 2009, 2009 IEEE International Conference on Robotics and Automation.

[30]  D. Berrigan,et al.  Health and community design: the impact of the built environment on physical activity , 2005 .

[31]  Ronald L. Rivest,et al.  Introduction to Algorithms, third edition , 2009 .

[32]  George Drettakis,et al.  Bimodal perception of audio-visual material properties for virtual environments , 2010, TAP.

[33]  Ning Gu,et al.  Complexity, Human Agents, and Architectural Design , 2009 .

[34]  Luc Van Gool,et al.  Evaluation of 3D City Models Using Automatic Placed Urban Agents , 2009 .

[35]  Stephan K. Chalup,et al.  Humanoid robots for modelling and analysing visual gaze dynamics of pedestrians moving in urban space , 2011 .

[36]  Péter Molnár,et al.  Control of distributed autonomous robotic systems using principles of pattern formation in nature and pedestrian behavior , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[37]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[38]  I ChristensenHenrik,et al.  Computational visual attention systems and their cognitive foundations , 2010, TAP 2010.