Low-bandwidth 3D visual telepresence system

We present a methodology to develop a low-cost, low-bandwidth visual telepresence system using commodity depth sensors. To obtain a precise representation of the participants, we fuse together multiple views extracted using a deep background subtraction method. We build a proof-of-concept display composed of a video projector and a quadrangular pyramid made of acrylic, to demonstrate the visualization of a remote person without the need for head-mounted displays. Our system represents an attempt to democratize high-fidelity 3D telepresence using off-the-shelf components.

[1]  Juan R. Terven,et al.  Kin2. A Kinect 2 toolbox for MATLAB , 2016, Sci. Comput. Program..

[2]  Juan R. Terven,et al.  A multiple camera calibration and point cloud fusion tool for Kinect V2 , 2017, Sci. Comput. Program..

[3]  Mohamed Sedky,et al.  Image Processing: Object Segmentation Using Full-Spectrum Matching of Albedo Derived from Colour Images , 2010 .

[4]  Guillaume-Alexandre Bilodeau,et al.  A Self-Adjusting Approach to Change Detection Based on Background Word Consensus , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[5]  Rufael Mekuria,et al.  Emerging MPEG Standards for Point Cloud Compression , 2019, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[6]  Rob Fergus,et al.  Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[7]  Cristian Ruz,et al.  2015 IEEE Winter Conference on Applications of Computer Vision, WACV 2015, Waikoloa, HI, USA, January 5-9, 2015 , 2015, WACV.

[8]  Simon J. D. Prince,et al.  Computer Vision: Models, Learning, and Inference , 2012 .

[9]  Hasan Sajid,et al.  Background subtraction for static & moving camera , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[10]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Gerhard Rigoll,et al.  A deep convolutional neural network for video sequence background subtraction , 2018, Pattern Recognit..

[12]  Dragi Tiro,et al.  The possibility of the hologram pyramid applying in the rapid prototyping , 2015, 2015 4th Mediterranean Conference on Embedded Computing (MECO).

[13]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[14]  Huiyu Zhou,et al.  Region-based Mixture of Gaussians modelling for foreground detection in dynamic scenes , 2015, Pattern Recognit..

[15]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[16]  Eyal Ofek,et al.  Room2Room: Enabling Life-Size Telepresence in a Projected Augmented Reality Environment , 2016, CSCW.

[17]  Bernd Fröhlich,et al.  Immersive Group-to-Group Telepresence , 2013, IEEE Transactions on Visualization and Computer Graphics.

[18]  Henry Fuchs,et al.  Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[19]  Duane C. Brown,et al.  Close-Range Camera Calibration , 1971 .

[20]  Irfan Siddavatam,et al.  3D Holographic Projections using Prism and Hand Gesture Recognition , 2015, ICARCSET '15.

[21]  Henry Fuchs,et al.  General-purpose telepresence with head-worn optical see-through displays and projector-based lighting , 2013, 2013 IEEE Virtual Reality (VR).

[22]  Guillaume-Alexandre Bilodeau,et al.  SuBSENSE: A Universal Change Detection Method With Local Adaptive Sensitivity , 2015, IEEE Transactions on Image Processing.

[23]  Ermal Dreshaj Holosuite : an exploration into interactive holographic telepresence , 2015 .

[24]  Hojun Lee,et al.  A hologram based tele-existence platform for emotional exchange among a group of users in both real and virtual environments , 2016, VRST.

[25]  Ling Shao,et al.  End-to-end video background subtraction with 3d convolutional neural networks , 2017, Multimedia Tools and Applications.

[26]  Ahmed K. Noor,et al.  Potential of multimodal and multiuser interaction with virtual holography , 2015, Adv. Eng. Softw..

[27]  Rodolfo Romero Herrera,et al.  Projection's Panel of models for touch screen , 2013 .

[28]  Gerhard Rigoll,et al.  Background segmentation with feedback: The Pixel-Based Adaptive Segmenter , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[29]  Ruigang Yang,et al.  3D Tele-Collaboration Over Internet2 , 2002 .

[30]  Ju Shen,et al.  An Immersive Telepresence System Using RGB-D Sensors and Head Mounted Display , 2015, 2015 IEEE International Symposium on Multimedia (ISM).

[31]  Charles T. Loop,et al.  Holoportation: Virtual 3D Teleportation in Real-time , 2016, UIST.

[32]  Rufael Mekuria,et al.  MP3DG-PCC, Open Source Software Framework for Implementation and Evaluation of Point Cloud Compression , 2016, ACM Multimedia.

[33]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[34]  Hyeyoung Yoo On Study of the Volumetric Display Techniques In Interactive Media Arts Proceedings , 2014 .

[35]  P. Blanche,et al.  Holographic three-dimensional telepresence using large-area photorefractive polymer , 2010, Nature.

[36]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[37]  Hojun Lee,et al.  A mixed reality tele-presence platform to exchange emotion and sensory information based on MPEG-V standard , 2017, 2017 IEEE Virtual Reality (VR).

[38]  S. A. Khadar Proceedings of the 2015 International Conference on Advanced Research in Computer Science Engineering & Technology (ICARCSET 2015) , 2015 .

[39]  Henry Fuchs,et al.  Real-time volumetric 3D capture of room-sized scenes for telepresence , 2012, 2012 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[40]  D. C. Brown,et al.  Lens distortion for close-range photogrammetry , 1986 .

[41]  Simon J. D. Prince,et al.  Computer Vision: Index , 2012 .