Physics-based Neural Networks for Shape from Polarization

How should prior knowledge from physics inform a neural network solution? We study the blending of physics and deep learning in the context of Shape from Polarization (SfP). The classic SfP problem recovers an object's shape from polarized photographs of the scene. The SfP problem is special because the physical models are only approximate. Previous attempts to solve SfP have been purely model-based, and are susceptible to errors when real-world conditions deviate from the idealized physics. In our solution, there is a subtlety to combining physics and neural networks. Our final solution blends deep learning with synthetic renderings (derived from physics) in the framework of a two-stage encoder. The lessons learned from this exemplary problem foreshadow the future impact of physics-based learning.

[1]  Gordon Wetzstein,et al.  Single-photon 3D imaging with deep sensor fusion , 2018, ACM Trans. Graph..

[2]  O. Drbohlav,et al.  Unambiguous determination of shape from photometric stereo with unknown light sources , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[3]  Min H. Kim,et al.  Simultaneous acquisition of polarimetric SVBRDF and normals , 2018, ACM Trans. Graph..

[4]  Takanori Maehara,et al.  Neural Inverse Rendering for General Reflectance Photometric Stereo , 2018, ICML.

[5]  George Barbastathis,et al.  Low Photon Count Phase Retrieval Using Deep Learning. , 2018, Physical review letters.

[6]  Masashi Baba,et al.  Surface normal estimation of black specular objects from multiview polarization images , 2016 .

[7]  Ping Tan,et al.  A Benchmark Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Aly A. Farag,et al.  Direct method for shape recovery from polarization and shading , 2012, 2012 19th IEEE International Conference on Image Processing.

[9]  Kalyan Sunkavalli,et al.  Learning to reconstruct shape and spatially-varying reflectance from a single image , 2018, ACM Trans. Graph..

[10]  Kalyan Sunkavalli,et al.  Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone Image , 2018, ECCV.

[11]  Katsushi Ikeuchi,et al.  Polarization-based inverse rendering from a single view , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Stefano Ermon,et al.  Label-Free Supervision of Neural Networks with Physics and Domain Knowledge , 2016, AAAI.

[13]  Edwin R. Hancock,et al.  Multi-view surface reconstruction using polarization , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Rin-ichiro Taniguchi,et al.  Shape and light directions from shading and polarization , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Katsushi Ikeuchi,et al.  Transparent surface modeling from a pair of polarization images , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Yasuyuki Matsushita,et al.  Deep Photometric Stereo Network , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[18]  William A. P. Smith,et al.  Linear Depth Estimation from an Uncalibrated, Monocular Polarisation Image , 2016, ECCV.

[19]  Gordon Wetzstein,et al.  Deep End-to-End Time-of-Flight Imaging , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Ramesh Raskar,et al.  Data-Driven Non-Line-of-Sight Imaging With A Traditional Camera , 2018 .

[21]  Ramesh Raskar,et al.  Depth Sensing Using Geometrically Constrained Polarization Normals , 2017, International Journal of Computer Vision.

[22]  Shuang Zhao,et al.  Inverse Transport Networks , 2018, ArXiv.

[23]  Yang Liu,et al.  Physics-Based Generative Adversarial Models for Image Restoration and Beyond , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Satoshi Ikehata,et al.  CNN-PS: CNN-based Photometric Stereo for General Non-Convex Surfaces , 2018, ECCV.

[25]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[26]  Cong Phuoc Huynh,et al.  Shape and Refractive Index from Single-View Spectro-Polarimetric Images , 2012, International Journal of Computer Vision.

[27]  Jan Kautz,et al.  Polarimetric Multi-view Stereo , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Ramesh Raskar,et al.  Dynamic heterodyne interferometry , 2018, 2018 IEEE International Conference on Computational Photography (ICCP).

[29]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  Lawrence B. Wolff,et al.  Polarization vision: a new sensory approach to image understanding , 1997, Image Vis. Comput..

[31]  Zhe Wu,et al.  A Benchmark Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo , 2019, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Paul E. Debevec,et al.  Multiview face capture using polarized spherical gradient illumination , 2011, ACM Trans. Graph..

[33]  Ramesh Raskar,et al.  Object classification through scattering media with deep learning on time resolved measurement. , 2017, Optics express.

[34]  Pieter Peers,et al.  Rapid Acquisition of Specular and Diffuse Normal Maps from Polarized Spherical Gradient Illumination , 2007 .

[35]  Paul E. Debevec,et al.  Rendering synthetic objects into real scenes: bridging traditional and image-based graphics with global illumination and high dynamic range photography , 1998, SIGGRAPH '08.

[36]  Pieter Peers,et al.  Circularly polarized spherical illumination reflectometry , 2010, ACM Trans. Graph..

[37]  Ping Tan,et al.  Polarimetric Dense Monocular SLAM , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Ramesh Raskar,et al.  Polarized 3D: High-Quality Depth Sensing with Polarization Cues , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[39]  Xiao Li,et al.  Single Image Surface Appearance Modeling with Self‐augmented CNNs and Inexact Supervision , 2018, Comput. Graph. Forum.

[40]  Michael Unser,et al.  Deep Convolutional Neural Network for Inverse Problems in Imaging , 2016, IEEE Transactions on Image Processing.

[41]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[42]  William A. P. Smith,et al.  Height-from-Polarisation with Unknown Lighting or Albedo , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[44]  Gordon Wetzstein,et al.  Unrolled Optimization with Deep Priors , 2017, ArXiv.

[45]  Jan Kautz,et al.  Reblur2Deblur: Deblurring videos via self-supervised learning , 2018, 2018 IEEE International Conference on Computational Photography (ICCP).

[46]  Gary A. Atkinson,et al.  High-sensitivity analysis of polarization by surface reflection , 2018, Machine Vision and Applications.

[47]  Xiao Li,et al.  Modeling surface appearance from a single photograph using self-augmented convolutional neural networks , 2017, ACM Trans. Graph..

[48]  Eugenio Culurciello,et al.  LinkNet: Exploiting encoder representations for efficient semantic segmentation , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[49]  Larry H. Matthies,et al.  Depth from stereo polarization in specular scenes for urban robotics , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[50]  Adrien Bousseau,et al.  Single-image SVBRDF capture with a rendering-aware deep network , 2018, ACM Trans. Graph..

[51]  Ramesh Raskar,et al.  Flash Photography for Data-Driven Hidden Scene Recovery , 2018, ArXiv.

[52]  Gary A. Atkinson,et al.  Recovery of surface orientation from diffuse polarization , 2006, IEEE Transactions on Image Processing.

[53]  Gary A. Atkinson,et al.  Polarisation photometric stereo , 2017, Comput. Vis. Image Underst..

[54]  Ming-Hsuan Yang,et al.  Learning a Discriminative Prior for Blind Image Deblurring , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[55]  Anuj Karpatne,et al.  Physics-guided Neural Networks (PGNN): An Application in Lake Temperature Modeling , 2017, ArXiv.

[56]  Soon-Jo Chung,et al.  Neural Lander: Stable Drone Landing Control Using Learned Dynamics , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[57]  Pieter Peers,et al.  Estimating Surface Normals from Spherical Stokes Reflectance Fields , 2012, ECCV Workshops.

[58]  Cong Phuoc Huynh,et al.  Shape and refractive index recovery from single-view polarisation images , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[59]  Edwin R. Hancock,et al.  Linear Differential Constraints for Photo-Polarimetric Height Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[60]  George Barbastathis,et al.  High-resolution limited-angle phase tomography of dense layered objects using deep neural networks , 2018, Proceedings of the National Academy of Sciences.

[61]  Carlos D. Castillo,et al.  SfSNet: Learning Shape, Reflectance and Illuminance of Faces 'in the Wild' , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[62]  Kai Han,et al.  PS-FCN: A Flexible Learning Framework for Photometric Stereo , 2018, ECCV.

[63]  Yisong Yue,et al.  Coordinated Multi-Agent Imitation Learning , 2017, ICML.