Auto-Tuning Structured Light by Optical Stochastic Gradient Descent

We consider the problem of optimizing the performance of an active imaging system by automatically discovering the illuminations it should use, and the way to decode them. Our approach tackles two seemingly incompatible goals: (1) ''tuning'' the illuminations and decoding algorithm precisely to the devices at hand---to their optical transfer functions, non-linearities, spectral responses, image processing pipelines---and (2) doing so without modeling or calibrating the system; without modeling the scenes of interest; and without prior training data. The key idea is to formulate a stochastic gradient descent (SGD) optimization procedure that puts the actual system in the loop: projecting patterns, capturing images, and calculating the gradient of expected reconstruction error. We apply this idea to structured-light triangulation to ''auto-tune'' several devices---from smartphones and laser projectors to advanced computational cameras. Our experiments show that despite being model-free and automatic, optical SGD can boost system 3D accuracy substantially over state-of-the-art coding schemes.

[1]  Pierre Ambs,et al.  A short history of optical computing: rise, decline, and evolution , 2009, International Conference on Correlation Optics.

[2]  Qi Tian,et al.  DisturbLabel: Regularizing CNN on the Loss Layer , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Yoav Y. Schechner,et al.  Illumination Multiplexing within Fundamental Limits , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Samy Bengio,et al.  Understanding deep learning requires rethinking generalization , 2016, ICLR.

[5]  Felix Heide,et al.  Hyperparameter optimization in black-box image processing using differentiable proxies , 2019, ACM Trans. Graph..

[6]  Gordon Wetzstein,et al.  Snapshot difference imaging using correlation time-of-flight sensors , 2017, ACM Trans. Graph..

[7]  Matthew O'Toole,et al.  Temporal frequency probing for 5D transient analysis of global light transport , 2014, ACM Trans. Graph..

[8]  Gordon Wetzstein,et al.  Deep End-to-End Time-of-Flight Imaging , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  David A. Clausi,et al.  Simultaneous Projector-Camera Self-Calibration for Three-Dimensional Reconstruction and Projection Mapping , 2017, IEEE Transactions on Computational Imaging.

[10]  Gordon Wetzstein,et al.  Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification , 2018, Scientific Reports.

[11]  Kiriakos N. Kutulakos,et al.  Optimal Structured Light a la Carte , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  Kiriakos N. Kutulakos,et al.  Coded Two-Bucket Cameras for Computer Vision , 2018, ECCV.

[13]  Zhengyou Zhang,et al.  Parameter estimation techniques: a tutorial with application to conic fitting , 1997, Image Vis. Comput..

[14]  Roarke Horstmeyer,et al.  Convolutional neural networks that teach microscopes how to image , 2017, ArXiv.

[15]  Shree K. Nayar,et al.  Multiplexing for Optimal Lighting , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[17]  Joaquim Salvi,et al.  ' s personal copy Efficient multiple phase shift patterns for dense 3 D acquisition in structured light scanning , 2010 .

[18]  Mohit Gupta,et al.  A Geometric Perspective on Structured Light Coding , 2018, ECCV.

[19]  Andreas Velten,et al.  High-depth-resolution range imaging with multiple-wavelength superheterodyne interferometry using 1550-nm lasers. , 2017, Applied optics.

[20]  Hans-Peter Seidel,et al.  Modulated phase-shifting for 3D scanning , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Wolfgang Heidrich,et al.  Low-budget transient imaging using photonic mixer devices , 2013, ACM Trans. Graph..

[22]  Ruigang Yang,et al.  Simultaneous Time-of-Flight sensing and photometric stereo with a single ToF sensor , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Junping Du,et al.  Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Stephen P. Boyd,et al.  End-to-end optimization of optics and image processing for achromatic extended depth of field and super-resolution imaging , 2018, ACM Trans. Graph..

[25]  Marc Levoy,et al.  Synthetic aperture confocal imaging , 2004, ACM Trans. Graph..

[26]  Matthew O'Toole,et al.  Optical computing for fast light transport analysis , 2010, ACM Trans. Graph..

[27]  Ramesh Raskar,et al.  Resolving Multi-path Interference in Time-of-Flight Imaging via Modulation Frequency Diversity and Sparse Regularization , 2014, Optics letters.

[28]  Shuntaro Yamazaki,et al.  Exploiting DLP Illumination Dithering for Reconstruction and Photography of High-Speed Scenes , 2011, International Journal of Computer Vision.

[29]  Joaquim Salvi,et al.  Efficient and Low-Cost 3D Structured Light System Based on a Modified Number-Theoretic Approach , 2009, EURASIP J. Adv. Signal Process..

[30]  Gordon Wetzstein,et al.  Computational imaging with multi-camera time-of-flight systems , 2016, ACM Trans. Graph..

[31]  Nagarajan Natarajan,et al.  Learning with Noisy Labels , 2013, NIPS.

[32]  Xiang Peng,et al.  Structured light field 3D imaging. , 2016, Optics express.

[33]  Matthew O'Toole,et al.  Homogeneous codes for energy-efficient illumination and imaging , 2015, ACM Trans. Graph..

[34]  J. Goodman Introduction to Fourier optics , 1969 .

[35]  Ayan Chakrabarti,et al.  Learning Sensor Multiplexing Design through Back-propagation , 2016, NIPS.

[36]  Pietro Perona,et al.  3D photography on your desk , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[37]  Ramesh Raskar,et al.  Coded time of flight cameras , 2013, ACM Trans. Graph..

[38]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Gabriel Taubin,et al.  Embedded phase shifting: Robust phase shifting with embedded signals , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[41]  Nahum Kiryati,et al.  Toward optimal structured light patterns , 1997, Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134).

[42]  Gordon Wetzstein,et al.  Doppler time-of-flight imaging , 2015, ACM Trans. Graph..

[43]  E. Leith,et al.  The evolution of information optics , 2000, IEEE Journal of Selected Topics in Quantum Electronics.

[44]  Shree K. Nayar,et al.  What Are Optimal Coding Functions for Time-of-Flight Imaging? , 2018, ACM Trans. Graph..

[45]  Quoc V. Le,et al.  Adding Gradient Noise Improves Learning for Very Deep Networks , 2015, ArXiv.

[46]  Baoxin Li,et al.  Automatic keystone correction for smart projectors with embedded camera , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[47]  William Whittaker,et al.  Epipolar time-of-flight imaging , 2017, ACM Trans. Graph..

[48]  Hao He,et al.  Exposure , 2017, ACM Trans. Graph..

[49]  Richard G. Baraniuk,et al.  Coherent inverse scattering via transmission matrices: Efficient phase retrieval algorithms and a public dataset , 2017, 2017 IEEE International Conference on Computational Photography (ICCP).

[50]  Ramesh Raskar,et al.  Rethinking Machine Vision Time of Flight With GHz Heterodyning , 2017, IEEE Access.

[51]  Ashok Veeraraghavan,et al.  A Practical Approach to 3D Scanning in the Presence of Interreflections, Subsurface Scattering and Defocus , 2013, International Journal of Computer Vision.

[52]  Wolfgang Heidrich,et al.  High Brightness HDR Projection Using Dynamic Freeform Lensing , 2016, ACM Trans. Graph..

[53]  Yuandong Tian,et al.  A Combined Theory of Defocused Illumination and Global Light Transport , 2012, International Journal of Computer Vision.

[54]  Tomislav Pribanic,et al.  On Tablet 3D Structured Light Reconstruction and Registration , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[55]  H. Robbins A Stochastic Approximation Method , 1951 .

[56]  Daniel G. Aliaga,et al.  Robust pixel classification for 3D modeling with structured light , 2007, GI '07.

[57]  Sebastian Nowozin,et al.  Bayesian Time-of-Flight for Realtime Shape, Illumination and Albedo , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Shree K. Nayar,et al.  Multiplexed illumination for scene recovery in the presence of global illumination , 2011, 2011 International Conference on Computer Vision.

[59]  Aswin C. Sankaranarayanan,et al.  KRISM—Krylov Subspace-based Optical Computing of Hyperspectral Images , 2018, ACM Trans. Graph..

[60]  Jie Zhang,et al.  Compact all-CMOS spatiotemporal compressive sensing video camera with pixel-wise coded exposure. , 2016, Optics express.

[61]  Jeffrey A. Davis,et al.  Complete polarization control of light from a liquid crystal spatial light modulator. , 2012, Optics express.

[62]  Gabriel Taubin,et al.  Unsynchronized structured light , 2015, ACM Trans. Graph..

[63]  Joaquim Salvi,et al.  Pattern codification strategies in structured light systems , 2004, Pattern Recognit..

[64]  Shahram Izadi,et al.  UltraStereo: Efficient Learning-Based Matching for Active Stereo Systems , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[65]  Syed Azer Reza,et al.  Practical Coding Function Design for Time-Of-Flight Imaging , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[66]  Ashok Veeraraghavan,et al.  High spatial resolution time-of-flight imaging , 2018, Commercial + Scientific Sensing and Imaging.

[67]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[68]  Song Wu,et al.  3 D ShapeNets : A Deep Representation for Volumetric Shape Modeling , 2015 .

[69]  Shree K. Nayar,et al.  Micro Phase Shifting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[70]  Joaquim Salvi,et al.  A state of the art in structured light patterns for surface profilometry , 2010, Pattern Recognit..

[71]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..