MARS: parallelism-based metrically accurate 3D reconstruction system in real-time

Due to the increasing application demands, lightweight device-based 3D recovery draws many attentions from a wide group of researchers in both academic and industrial fields. The current 3D reconstruction solutions are commonly achieved either using depth data or RGB data. The depth data usually come from a deliberately designed hardware for specific tasks, while the RGB data-based solutions only employ a single RGB camera with vision-based computing algorithms. Limitations are expected from both. Depth sensors are commonly either bulky or relatively expensive compared to RGB cameras, thus of less flexibility. Normal RGB cameras usually have better mobility but less accuracy in 3D sensing than depth sensors. Recently, machine learning based depth estimation has also been presented. However, its accuracy is still limited. To improve the flexibility of the 3D reconstruction system without loss in accuracy, this paper presents a solution of unconstrained Metrically Accurate 3D Reconstruction System (MARS) for 3D sensing based on a consumer-grade camera. With a simple initialization from a depth map, the system can achieve incremental 3D reconstruction with a stable metric scale. Experiments are conducted using both real-world data and public datasets. Competitive results are obtained using the proposed system compared with several existing methods.

[1]  Andrew J. Davison,et al.  A benchmark for RGB-D visual odometry, 3D reconstruction and SLAM , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Noah Snavely,et al.  Unsupervised Learning of Depth and Ego-Motion from Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Rong Xie,et al.  Real-time UHD video super-resolution and transcoding on heterogeneous hardware , 2019, Journal of Real-Time Image Processing.

[4]  Jitendra Malik,et al.  Shape Estimation from Shading, Defocus, and Correspondence Using Light-Field Angular Coherence , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Ludovic Magerand,et al.  Revisiting Projective Structure from Motion: A Robust and Efficient Incremental Solution , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Daniel Cremers,et al.  Dense visual SLAM for RGB-D cameras , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Sebastian Nowozin,et al.  Bayesian Time-of-Flight for Realtime Shape, Illumination and Albedo , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Stefan Leutenegger,et al.  ElasticFusion: Dense SLAM Without A Pose Graph , 2015, Robotics: Science and Systems.

[9]  Albert S. Huang,et al.  Visual Odometry and Mapping for Autonomous Flight Using an RGB-D Camera , 2011, ISRR.

[10]  Houbing Song,et al.  Digital image watermarking method based on DCT and fractal encoding , 2017, IET Image Process..

[11]  Changhe Zhou,et al.  Colorful 3-D Imaging Using an Infrared Dammann Grating , 2016, IEEE Transactions on Industrial Informatics.

[12]  Houbing Song,et al.  A 2-D Discrete-Time Model of Physical Impairments in Wavelength-Division Multiplexing Systems , 2012, Journal of Lightwave Technology.

[13]  Andreas Zell,et al.  Multi-camera visual SLAM for autonomous navigation of micro aerial vehicles , 2017, Robotics Auton. Syst..

[14]  Houbing Song,et al.  Range of Influence and Impact of Physical Impairments in Long-Haul DWDM Systems , 2013, Journal of Lightwave Technology.

[15]  Lei Shi,et al.  A Compressive Sensing-Based Approach to End-to-End Network Traffic Reconstruction , 2020, IEEE Transactions on Network Science and Engineering.

[16]  Junyu Dong,et al.  Dense 3D facial reconstruction from a single depth image in unconstrained environment , 2017, Virtual Reality.

[17]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[18]  Tomislav Petković,et al.  An evaluation of real-time RGB-D visual odometry algorithms on mobile devices , 2017, Journal of Real-Time Image Processing.

[19]  Houbing Song,et al.  Internet of Things and Big Data Analytics for Smart and Connected Communities , 2016, IEEE Access.

[20]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Wolfram Burgard,et al.  An evaluation of the RGB-D SLAM system , 2012, 2012 IEEE International Conference on Robotics and Automation.

[22]  Roy Kelner,et al.  Three-Dimensional Imaging by Self-Reference Single-Channel Digital Incoherent Holography , 2016, IEEE Transactions on Industrial Informatics.

[23]  G. Klein,et al.  Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[24]  Matthias Nießner,et al.  BundleFusion , 2016, TOGS.

[25]  Zhanyi Hu,et al.  Learning Depth From Single Images With Deep Neural Network Embedding Focal Length , 2018, IEEE Transactions on Image Processing.

[26]  Vladlen Koltun,et al.  Robust reconstruction of indoor scenes , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[28]  Diego González-Aguilera,et al.  Feasibility Study of a Structured Light System Applied to Welding Inspection Based on Articulated Coordinate Measure Machine Data , 2017, IEEE Sensors Journal.

[29]  Jörg Stückler,et al.  Multi-resolution surfel maps for efficient dense 3D modeling and tracking , 2014, J. Vis. Commun. Image Represent..

[30]  John J. Leonard,et al.  Robust real-time visual odometry for dense RGB-D mapping , 2013, 2013 IEEE International Conference on Robotics and Automation.

[31]  Andrew J. Davison,et al.  DTAM: Dense tracking and mapping in real-time , 2011, 2011 International Conference on Computer Vision.

[32]  Hideo Saito,et al.  Real-time scene reconstruction and triangle mesh generation using multiple RGB-D cameras , 2017, Journal of Real-Time Image Processing.

[33]  Pablo Bauszat,et al.  Indoor Scene Reconstruction Using Near-Light Photometric Stereo , 2017, IEEE Transactions on Image Processing.

[34]  Max Mulder,et al.  Obstacle Avoidance Strategy using Onboard Stereo Vision on a Flapping Wing MAV , 2016, IEEE Transactions on Robotics.

[35]  Ian D. Reid,et al.  Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Zhihan Lv,et al.  Virtual Reality Smart City Based on WebVRGIS , 2016, IEEE Internet of Things Journal.

[37]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.