Where Am I Looking: Localizing Gaze In Reconstructed 3D Space

We propose a method to estimate the 3D gaze of the observer onto the scene using a portable eye tracker with a monocular camera. We reconstruct the 3D scene using Structure from Motion (SfM) and use camera pose and 3D reconstruction information of the scene to localize the position and pose of the observer in the 3D reconstructed space. Along with the position, we can obtain the 2D gaze of the observer using an eye-tracker. Each person may have a different perspective of the same 3D object in the scene, observing it from different positions. We use this information to fuse these multiple perspectives in 3D space to get a better understanding of how differently each observer perceives the same scene, compared to others. In the entire system, we developed a convo-lutional neural network to detect and track eye movement and a camera re-localization method to localize the camera in 3D environment. Based on our novel eye tracking and camera re-localization methods, we can accurately localize the gaze in the 3D reconstructed environment.

[1]  William W. Abbott,et al.  Gaze-based teleprosthetic enables intuitive continuous control of complex robot arm use: Writing & drawing , 2016, 2016 6th IEEE International Conference on Biomedical Robotics and Biomechatronics (BioRob).

[2]  Nicu Sebe,et al.  Memory efficient large-scale image-based localization , 2014, Multimedia Tools and Applications.

[3]  Jan Kautz,et al.  Light-Weight Head Pose Invariant Gaze Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[4]  I. Scott MacKenzie,et al.  The use of gaze to control drones , 2014, ETRA.

[5]  Tomas Pfister,et al.  Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Brian E. Maki,et al.  Determining the Visual Angle of Objects in the Visual Field: An Extended Application of Eye Trackers , 2009, IEEE Transactions on Biomedical Engineering.

[7]  Carlo Menon,et al.  Content-Aware 3D Reconstruction with Gaze Data , 2017, 2017 3rd IEEE International Conference on Cybernetics (CYBCON).

[8]  Karen M. Evans,et al.  Analyzing complex gaze behavior in the natural world , 2011, Electronic Imaging.

[9]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Lixin Fan,et al.  Fast Localization in Large-Scale Environments Using Supervised Indexing of Binary Features , 2016, IEEE Transactions on Image Processing.

[11]  Sushil Chandra,et al.  Eye tracking based human computer interaction: Applications and their uses , 2015, 2015 International Conference on Man and Machine Interfacing (MAMI).

[12]  Daniel P. Huttenlocher,et al.  Location Recognition Using Prioritized Feature Matching , 2010, ECCV.

[13]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[14]  Yoichi Sato,et al.  Learning-by-Synthesis for Appearance-Based 3D Gaze Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Jeff B. Pelz,et al.  3D gaze point localization and visualization using LiDAR-based 3D reconstructions , 2016, ETRA.

[16]  Steven M. Seitz,et al.  Multicore bundle adjustment , 2011, CVPR 2011.

[17]  Peter Robinson,et al.  Learning an appearance-based gaze estimator from one million synthesised images , 2016, ETRA.

[18]  Torsten Sattler,et al.  Efficient & Effective Prioritized Matching for Large-Scale Image-Based Localization , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Alessandro Rudi,et al.  A general method for the point of regard estimation in 3D space , 2011, CVPR 2011.

[20]  Mario Fritz,et al.  MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jeff B. Pelz,et al.  3D point-of-regard, position and head orientation from a portable monocular video-based eye tracker , 2008, ETRA '08.

[22]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[23]  Mario Fritz,et al.  Appearance-based gaze estimation in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Changchang Wu,et al.  Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.