Sensor Fusion for Learning-based Tracking of Controller Movement in Virtual Reality

Inside-out pose tracking of hand-held controllers is an important problem in virtual reality devices. Current state-of-the-art combines a constellation of light-emitting diodes on controllers with a stereo pair of cameras on the head-mounted display (HMD) to track pose. These vision-based systems are unable to track controllers when they move out of the camera’s field-of-view (out-of-FOV). To overcome this limitation, we employ sensor fusion and a learning-based model. Specifically, we employ ultrasound sensors on the HMD and controllers to obtain ranging information. We combine this information with predictions from an auto-regressive forecasting model that is built with a recurrent neural network. The combination is achieved via a Kalman filter across different positional states (including out-of-FOV). With the proposed approach, we demonstrate near-isotropic accuracy levels ($\sim$ 1.23 cm error) in estimating controller position, which was not possible to achieve before with camera-alone tracking.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Klaus Dorfmüller,et al.  Real-Time Hand and Head Tracking for Virtual Environments Using Infrared Beacons , 1998, CAPTECH.

[3]  Eric Foxlin,et al.  Inertial head-tracker sensor fusion by a complementary separate-bias Kalman filter , 1996, Proceedings of the IEEE 1996 Virtual Reality Annual International Symposium.

[4]  Klaus Dorfmüller,et al.  Robust tracking for augmented reality using retroreflective markers , 1999, Comput. Graph..

[5]  Anthony M. Cook The helmet-mounted visual system in flight simulation , 1988 .

[6]  Klaus Dorfmüller An Optical Tracking System for VR/AR-Applications , 1999, EGVE.

[7]  Ronald Azuma,et al.  Tracking requirements for augmented reality , 1993, CACM.

[8]  Michael J. Black,et al.  On Human Motion Prediction Using Recurrent Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Markus König,et al.  Low-Cost Virtual Reality Environment for Engineering and Construction , 2015 .

[10]  Timo Teräsvirta,et al.  Smooth transition autoregressive models - A survey of recent developments , 2000 .

[11]  F. Raab,et al.  Magnetic Position and Orientation Tracking System , 1979, IEEE Transactions on Aerospace and Electronic Systems.

[12]  Niels Henze,et al.  Gesture recognition with a Wii controller , 2008, TEI.

[13]  Robert B. McGhee,et al.  An extended Kalman filter for quaternion-based orientation estimation using MARG sensors , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[14]  Gang Wang,et al.  Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition , 2016, ECCV.

[15]  Hirokazu Kato,et al.  Marker tracking and HMD calibration for a video-based augmented reality conferencing system , 1999, Proceedings 2nd IEEE and ACM International Workshop on Augmented Reality (IWAR'99).

[16]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[17]  Dieter Kranzlmuller,et al.  State of the art of virtual reality technology , 2016, 2016 IEEE Aerospace Conference.