Learning-based feature extraction for active 3D scan with reducing color crosstalk of multiple pattern projections

3D reconstruction methods based on active stereo technique have been widely used for many practical systems. Many of these systems are configured with a single camera and a single projector. Since such systems can only capture one side of the target object, several attempts have been conducted to enlarge the captured area, especially multi-projector systems attract many researchers. For multi-projector based systems, overlap between multiple pattern projections is a serious problem. Even if different color channels are used for each projector, complete separation is not possible because of color crosstalks. Another open problem is decoding errors of the projected patterns, which causes a failure on extracting positional information of the projected pattern form the captured image. Among several reasons for such errors, color crosstalks are crucial because their features are similar to the main signal and difficult to be decomposed. In this paper, we solve these problems by utilizing machine learning techniques where a convolutional neural network is trained to extract low dimensional pattern features for each projector. In addition, it is trained to suppress the color crosstalks from different projectors. Using this new technique, we succeeded in reconstructing 3D shapes from images where multiple patterns are overlapped.

[1]  Yasushi Yagi,et al.  Dynamic scene shape reconstruction using a single structured light pattern , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Yann LeCun,et al.  Computing the stereo matching cost with a convolutional neural network , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Shinsaku Hiura,et al.  Active One-Shot Scan for Wide Depth Range Using a Light Field Projector Based on Coded Aperture , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[4]  Young Min Kim,et al.  Design and calibration of a multi-view TOF sensor fusion system , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[5]  Sergio Orts,et al.  HyperDepth: Learning Depth from Structured Light without Matching , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Yasushi Makihara,et al.  Dynamic scene reconstruction using asynchronous multiple Kinects , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[7]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[8]  Pedro F. Felzenszwalb,et al.  Efficient belief propagation for early vision , 2004, CVPR 2004.

[9]  Mark R. Pickering,et al.  Dense depth estimation using adaptive structured light and cooperative algorithm , 2011, CVPR 2011 WORKSHOPS.

[10]  Nicolas Martin,et al.  Unstructured light scanning to overcome interreflections , 2011, 2011 International Conference on Computer Vision.

[11]  Nozomu Kasuya,et al.  One-Shot Entire Shape Scanning by Utilizing Multiple Projector-Camera Constraints of Grid Patterns , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[12]  Mark R. Pickering,et al.  Accurate depth estimation using structured light and passive stereo disparity estimation , 2011, 2011 18th IEEE International Conference on Image Processing.

[13]  Qionghai Dai,et al.  Fusing Multiview and Photometric Stereo for 3D Reconstruction under Uncalibrated Illumination , 2011, IEEE Transactions on Visualization and Computer Graphics.

[14]  Joaquim Salvi,et al.  A state of the art in structured light patterns for surface profilometry , 2010, Pattern Recognit..

[15]  Cem Ünsalan,et al.  A Color Invariant Based Binary Coded Structured Light Range Scanner for Shiny Objects , 2010, 2010 20th International Conference on Pattern Recognition.

[16]  Xu Zhang,et al.  Color code identification in coded structured light. , 2012, Applied optics.

[17]  Suming Tang,et al.  Fuzzy decoding in color-coded structured light , 2014 .

[18]  Ryo Furukawa,et al.  Single colour one-shot scan using modified Penrose tiling pattern , 2013, IET Comput. Vis..

[19]  Gabriel Taubin,et al.  Robust one-shot 3D scanning using loopy belief propagation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[20]  Nikos Komodakis,et al.  Learning to compare image patches via convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Jovan Popović,et al.  Dynamic shape capture using multi-view photometric stereo , 2009, SIGGRAPH 2009.