Automatic feature extraction using CNN for robust active one-shot scanning

Active one-shot scanning techniques have been widely used for various applications. Stereo-based active one-shot scanning embeds a positional information regarding the image plane of a projector onto a projected pattern to retrieve correspondences entirely from a captured image. Many combinations of patterns and decoding algorithms for active one-shot scanning have been proposed. If the capturing environment lacks the assumed conditions, such as the absence of strong external lights, then reconstruction using those methods is degraded, because the pattern decoding fails. In this paper, we propose a general reconstruction algorithm that can be used for any kind of patterns without strict assumptions. The technique is based on an efficient feature extraction function that can drastically reduce redundant information from the raw pixel values of patches of captured images. Shapes are reconstructed by efficiently finding correspondences between a captured image and the pattern using low-dimensional feature vectors. Such a function is created automatically by a convolutional neural network using a large database of pattern images that are efficiently synthesized by using GPU with wide variation of depth and surface orientation. Experimental results show that our technique can be used for several existing patterns without any ad hoc algorithm or information regarding the scene or the sensor.

[1]  Gabriel Taubin,et al.  Robust one-shot 3D scanning using loopy belief propagation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[2]  Nikos Komodakis,et al.  Learning to compare image patches via convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Zhiwei Xiong,et al.  Depth Acquisition from Density Modulated Binary Patterns , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  日向 俊二 Kinect for Windowsアプリを作ろう , 2012 .

[5]  Joaquim Salvi,et al.  A state of the art in structured light patterns for surface profilometry , 2010, Pattern Recognit..

[6]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[7]  Yasushi Yagi,et al.  Grid-Based Active Stereo with Single-Colored Wave Pattern for Dense One-shot 3D Scan , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[8]  Suming Tang,et al.  Fuzzy decoding in color-coded structured light , 2014 .

[9]  Ryo Furukawa,et al.  Single colour one-shot scan using modified Penrose tiling pattern , 2013, IET Comput. Vis..

[10]  Pedro F. Felzenszwalb,et al.  Efficient belief propagation for early vision , 2004, CVPR 2004.

[11]  Cem Ünsalan,et al.  A Color Invariant Based Binary Coded Structured Light Range Scanner for Shiny Objects , 2010, 2010 20th International Conference on Pattern Recognition.

[12]  Xu Zhang,et al.  Color code identification in coded structured light. , 2012, Applied optics.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Yasushi Yagi,et al.  Dynamic scene shape reconstruction using a single structured light pattern , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Yann LeCun,et al.  Computing the stereo matching cost with a convolutional neural network , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  André Oosterlinck,et al.  Range Image Acquisition with a Single Binary-Encoded Light Pattern , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[18]  Shinsaku Hiura,et al.  Active One-Shot Scan for Wide Depth Range Using a Light Field Projector Based on Coded Aperture , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[19]  Hiroshi Murase,et al.  Learning and recognition of 3D objects from appearance , 1993, [1993] Proceedings IEEE Workshop on Qualitative Vision.

[20]  Mark R. Pickering,et al.  Accurate depth estimation using structured light and passive stereo disparity estimation , 2011, 2011 18th IEEE International Conference on Image Processing.

[21]  Nicolas Martin,et al.  Unstructured light scanning to overcome interreflections , 2011, 2011 International Conference on Computer Vision.

[22]  Takeo Kanade,et al.  Use of Fourier and Karhunen-Loeve decomposition for fast pattern matching with a large set of templates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Patrick J. Flynn,et al.  A survey of approaches and challenges in 3D and multi-modal 3D + 2D face recognition , 2006, Comput. Vis. Image Underst..

[24]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[25]  Mark R. Pickering,et al.  Dense depth estimation using adaptive structured light and cooperative algorithm , 2011, CVPR 2011 WORKSHOPS.