Exploring RGB-D Cameras for 3D Reconstruction of Cultural Heritage

RGB-D cameras have a great potential to solve several problems arising during the digitization of objects, such as cultural heritage. Three-dimensional (3D) digital preservation is usually performed with the use of high-end 3D scanners, as the 3D points generated by this type of equipment are in average millimeter up to sub-millimeter accurate. The downside of 3D scanners, in addition to the high cost, is the infrastructure requirements. It requires its own source of energy, a large workspace with tripods, special training to calibrate and operate the equipment, and high acquisition time, potentially taking several minutes for capturing a single image. An alternative is the use of low-cost depth cameras that are easy to operate and only require connection to a laptop and a source of energy. There are several recent studies showing the potential of RGB-D sensors. However, they often exhibit errors when applied to a full 360 degrees 3D reconstruction setup, known as the loop closure problem. This kind of error accumulation is intensified by the lower accuracy and large volume of data generated by RGB-D cameras. This article proposes a complete methodology for 3D reconstruction based on RGB-D sensors. To mitigate the loop closure effect, a pairwise alignment method was developed. The proposed approach expands the connectivity graph connections in a pairwise alignment system, by automatically discovering new pairs of meshes with overlapping regions. Then the alignment is more evenly distributed over the aligned pairs, avoiding the loop closure problem of full 3D reconstructions. The experiments were performed on a collection of 30 artworks made by the Baroque artist Antonio Francisco Lisboa, known as Aleijadinho, as part of the Aleijadinho Digital project conducted in partnership with IPHAN (Brazilian National Institute for Cultural and Artistic Heritage) and United Nations Educational, Scientific and Cultural Organization (UNESCO). Experimental results show 3D models that are favorably compared to state-of-the-art methods available in the literature using RGD-D sensors. The main contributions of this work are: a new method for 3D alignment dedicated to attenuate the RGB-D camera loop closure problem; the development and disclosure of a complete, practical solution for 3D reconstruction of artworks; and the construction of 3D digital models of an important and challenging collection of Brazilian cultural heritage, made accessible by a virtual museum.

[1]  Juho Kannala,et al.  Joint Depth and Color Camera Calibration with Distortion Correction , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Gérard G. Medioni,et al.  Object modelling by registration of multiple range images , 1992, Image Vis. Comput..

[3]  Ozgur Yilmaz,et al.  Stereo and kinect fusion for continuous 3D reconstruction and visual odometry , 2013, 2013 International Conference on Electronics, Computer and Computation (ICECCO).

[4]  Yutaka Ohtake,et al.  A comparison of mesh smoothing methods , 2003 .

[5]  TheobaltChristian,et al.  Coherent Spatiotemporal Filtering, Upsampling and Rendering of RGBZ Videos , 2012 .

[6]  Scott Cohen,et al.  Stereo+Kinect for High Resolution Stereo Correspondences , 2013, 2013 International Conference on 3D Vision.

[7]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[8]  Guido M. Cortelazzo,et al.  Handheld scanning with 3D cameras , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[9]  Vladlen Koltun,et al.  Dense scene reconstruction with points of interest , 2013, ACM Trans. Graph..

[10]  Yang Bai,et al.  A Study in 3D-Reconstruction Using Kinect Sensor , 2012 .

[11]  Shahram Izadi,et al.  MonoFusion: Real-time 3D reconstruction of small scenes with a single web camera , 2013, 2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[12]  Ian D. Reid,et al.  STAR3D: Simultaneous Tracking and Reconstruction of 3D Objects Using RGB-D Data , 2013, 2013 IEEE International Conference on Computer Vision.

[13]  Andrew W. Fitzgibbon,et al.  Real-time non-rigid reconstruction using an RGB-D camera , 2014, ACM Trans. Graph..

[14]  Sander Oude Elberink,et al.  Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.

[15]  David Fofi,et al.  A review of recent range image registration methods with accuracy evaluation , 2007, Image Vis. Comput..

[16]  Didier Stricker,et al.  Algorithms for 3D Shape Scanning with a Depth Camera , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Michael M. Kazhdan,et al.  Poisson surface reconstruction , 2006, SGP '06.

[18]  Minh N. Do,et al.  Registration and integration of multiple depth images using signed distance function , 2012, Electronic Imaging.

[19]  Derek D. Lichti,et al.  Photogrammetric Bundle Adjustment With Self-Calibration of the PrimeSense 3D Camera Technology: Microsoft Kinect , 2013, IEEE Access.

[20]  Tomás Pajdla,et al.  3D with Kinect , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[21]  Luciano Silva,et al.  3D reconstruction methods for digital preservation of cultural heritage: A survey , 2014, Pattern Recognit. Lett..

[22]  Matthias Nießner,et al.  Real-time 3D reconstruction at scale using voxel hashing , 2013, ACM Trans. Graph..

[23]  Olga R. P. Bellon,et al.  Real-time acquisition and super-resolution techniques on 3D reconstruction , 2013, 2013 IEEE International Conference on Image Processing.

[24]  Masatoshi Ishikawa,et al.  High-resolution surface reconstruction based on multi-level implicit surface from multiple range images , 2013, 2013 IEEE International Conference on Image Processing.

[25]  Gary K. L. Tam,et al.  Registration of 3D Point Clouds and Meshes: A Survey from Rigid to Nonrigid , 2013, IEEE Transactions on Visualization and Computer Graphics.

[26]  Weihai Chen,et al.  Fast 3D modeling in complex environments using a single Kinect sensor , 2014 .

[27]  Paolo Cignoni,et al.  Pushing time-of-flight scanners to the limit , 2009, VAST'09.

[28]  Joachim Hertzberg,et al.  An Explicit Loop Closing Technique for 6D SLAM , 2009, ECMR.

[29]  Guido M. Cortelazzo,et al.  3D scanning of cultural heritage with consumer depth cameras , 2016, Multimedia Tools and Applications.

[30]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[31]  Petros Daras,et al.  Real-Time, Full 3-D Reconstruction of Moving Foreground Objects From Multiple Consumer Depth Cameras , 2013, IEEE Transactions on Multimedia.

[32]  Kari Pulli,et al.  Multiview registration for large data sets , 1999, Second International Conference on 3-D Digital Imaging and Modeling (Cat. No.PR00062).

[33]  Maurício Pamplona Segundo,et al.  Automating 3D reconstruction pipeline by surf-based alignment , 2012, 2012 19th IEEE International Conference on Image Processing.

[34]  Hans-Peter Seidel,et al.  Coherent Spatiotemporal Filtering, Upsampling and Rendering of RGBZ Videos , 2012, Comput. Graph. Forum.

[35]  Xuan Song,et al.  When 3D Reconstruction Meets Ubiquitous RGB-D Images , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Luciano Silva,et al.  3D reconstruction of cultural heritages: Challenges and advances on precise mesh integration , 2012, Comput. Vis. Image Underst..

[37]  A. N. Rajagopalan,et al.  Range map superresolution-inpainting, and reconstruction from sparse data , 2012, Comput. Vis. Image Underst..

[38]  Bo Jiang,et al.  A divide-and-conquer approach to large scene reconstruction with interactive scene analysis and segmentation , 2013, VRCAI '13.

[39]  John J. Leonard,et al.  Deformation-based loop closure for large scale dense RGB-D SLAM , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[40]  Yutaka Ishibashi,et al.  Weighted Joint Bilateral Filter with Slope Depth Compensation Filter for Depth Map Refinement , 2013, VISAPP.

[41]  Kai Liu,et al.  A Two-Stage Framework for 3D FaceReconstruction from RGBD Images , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  John J. Leonard,et al.  Kintinuous: Spatially Extended KinectFusion , 2012, AAAI 2012.

[43]  Yasushi Makihara,et al.  Dynamic scene reconstruction using asynchronous multiple Kinects , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[44]  Jiawen Chen,et al.  Scalable real-time volumetric surface reconstruction , 2013, ACM Trans. Graph..

[45]  Ian D. Reid,et al.  Dense Reconstruction Using 3D Object Shape Priors , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Christian Siegl,et al.  Low-Cost Real-Time 3D Reconstruction of Large-Scale Excavation Sites using an RGB-D Camera , 2014, GCH.

[48]  Olaf Kähler,et al.  A Framework for the Volumetric Integration of Depth Images , 2014, ArXiv.

[49]  Kurt Konolige,et al.  Change Their Perception: RGB-D for 3-D Modeling and Recognition , 2013, IEEE Robotics & Automation Magazine.

[50]  Venu Madhav Govindu,et al.  A pipeline for building 3D models using depth cameras , 2012, ICVGIP '12.

[51]  Özgür Yilmaz,et al.  Stereo and KinectFusion for continuous 3D reconstruction and visual odometry , 2016 .

[52]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[53]  Daniel Cremers,et al.  Large-Scale Multi-resolution Surface Reconstruction from RGB-D Sequences , 2013, 2013 IEEE International Conference on Computer Vision.

[54]  Luciano Silva,et al.  3D preserving xviii century barroque masterpiece: Challenges and results on the digital preservation of Aleijadinho’s sculpture of the Prophet Joel , 2012 .

[55]  Paul Newman,et al.  Detecting Loop Closure with Scene Sequences , 2007, International Journal of Computer Vision.

[56]  Shahram Izadi,et al.  Real-time shading-based refinement for consumer depth cameras , 2014, ACM Trans. Graph..

[57]  Kang Chen,et al.  Automatic semantic modeling of indoor scenes from low-quality RGB-D data using contextual information , 2014, ACM Trans. Graph..

[58]  In-So Kweon,et al.  High Quality Shape from a Single RGB-D Image under Uncalibrated Natural Illumination , 2013, 2013 IEEE International Conference on Computer Vision.

[59]  Elise Lachat,et al.  Assessment and Calibration of a RGB-D Camera (Kinect v2 Sensor) Towards a Potential Use for Close-Range 3D Modeling , 2015, Remote. Sens..