Optimal Planning for Redirected Walking Based on Reinforcement Learning in Multi-user Environment with Irregularly Shaped Physical Space

Redirected Walking (RDW) enables users to walk in both virtual and physical tracking spaces simultaneously, which is an effective method to increase presence in Virtual Reality (VR). Recently, RDW technologies have been developed in a multi-user environment where multiple users share the same physical tracking space and simultaneously explore the same virtual space. Meanwhile, in the Steer-To-Optimal-Target (S2OT) method, user actions are planned in RDW by introducing machine learning models such as reinforcement learning. In this paper, we propose a new predictive RDW algorithm "Multiuser-Steer-to-Optimal-Target (MS2OT)" that extends the S2OT method into an environment with multiple users and various types of tracking space. In addition to the steering actions used in S2OT, MS2OT considers pre-reset actions and uses more steering targets and an improved reward function. The locations of multiple users and tracking space information are treated as visual information to be the state of the reinforcement learning model in MS2OT. Hence, the artificial neural network of a multilayer three-dimensional convolutional neural network with a dueling double deep network architecture is learned through Q-Learning. MS2OT significantly reduces the total number of resets compared to the conventional RDW algorithms such as S2C and APF-RDW in a multi-user environment and improves the total distance and average distance between resets during the same period. Experimental results show that MS2OT can process up to 32 users in real-time.

[1]  Dang-Yang Lee,et al.  Real-time Optimal Planning for Redirected Walking Using Deep Q-Learning , 2019, 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR).

[2]  Evan Suma Rosenberg,et al.  An evaluation of strategies for two-user redirected walking in shared physical spaces , 2017, 2017 IEEE Virtual Reality (VR).

[3]  Rajiv V. Dubey,et al.  Point & Teleport Locomotion Technique for Virtual Reality , 2016, CHI PLAY.

[4]  Michael A. Zmuda,et al.  Collision prediction and prevention in a simultaneous two-user immersive virtual environment , 2013, 2013 IEEE Virtual Reality (VR).

[5]  Philip W. Fink,et al.  Obstacle avoidance during walking in real and virtual environments , 2007, TAP.

[6]  Andreas M. Kunz,et al.  Planning redirection techniques for optimal free walking experience using model predictive control , 2014, 2014 IEEE Symposium on 3D User Interfaces (3DUI).

[7]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[8]  E. Hodgson,et al.  Optimizing Constrained-Environment Redirected Walking Instructions Using Search Techniques , 2013, IEEE Transactions on Visualization and Computer Graphics.

[9]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[10]  Evan Suma Rosenberg,et al.  A General Reactive Algorithm for Redirected Walking Using Artificial Potential Functions , 2019, 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR).

[11]  Timothy P. McNamara,et al.  Exploring large virtual environments with an HMD when physical space is limited , 2007, APGV.

[12]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Evan Suma Rosenberg,et al.  Redirected Walking Strategies in Irregularly Shaped and Dynamic Physical Environments , 2018 .

[14]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[15]  Gerd Bruder,et al.  A taxonomy for deploying redirection techniques in immersive virtual environments , 2012, 2012 IEEE Virtual Reality Workshops (VRW).

[16]  Mary C. Whitton,et al.  15 Years of Research on Redirected Walking in Immersive Virtual Environments , 2018, IEEE Computer Graphics and Applications.

[17]  Gerd Bruder,et al.  Application of redirected walking in room-scale VR , 2017, 2017 IEEE Virtual Reality (VR).

[18]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Jennifer L. Campos,et al.  Vection and visually induced motion sickness: how are they related? , 2015, Front. Psychol..

[20]  Markus Lappe,et al.  Subliminal Reorientation and Repositioning in Immersive Virtual Environments using Saccadic Suppression , 2015, IEEE Transactions on Visualization and Computer Graphics.

[21]  Peter Stone,et al.  Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.

[22]  Eric R. Bachmann,et al.  Multi-User Redirected Walking and Resetting Using Artificial Potential Fields , 2019, IEEE Transactions on Visualization and Computer Graphics.

[23]  Jennifer E. Fowlkes,et al.  Simulator Sickness Is Polygenic and polysymptomatic: Implications for Research , 1992 .

[24]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[25]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[26]  Gerd Bruder,et al.  Bending the Curve: Sensitivity to Bending of Curved Paths and Application in Room-Scale VR , 2017, IEEE Transactions on Visualization and Computer Graphics.

[27]  Wynne W. Chin The partial least squares approach for structural equation modeling. , 1998 .

[28]  Eduardo F. Morales,et al.  An Introduction to Reinforcement Learning , 2011 .

[29]  Mark T. Bolas,et al.  The redirected walking toolkit: a unified development platform for exploring large virtual environments , 2016, 2016 IEEE 2nd Workshop on Everyday Virtual Reality (WEVR).

[30]  Eric R. Bachmann,et al.  Comparing Four Approaches to Generalized Redirected Walking: Simulation and Live User Data , 2013, IEEE Transactions on Visualization and Computer Graphics.

[31]  Sharif Razzaque,et al.  Redirected Walking , 2001, Eurographics.

[32]  Klaus H. Hinrichs,et al.  Real Walking through Virtual Environments by Redirection Techniques , 2009, J. Virtual Real. Broadcast..

[33]  Eric R. Bachmann,et al.  Effects of Tracking Area Shape and Size on Artificial Potential Field Redirected Walking , 2019, 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR).