论文信息 - Modeling surface appearance from a single photograph using self-augmented convolutional neural networks

Modeling surface appearance from a single photograph using self-augmented convolutional neural networks

We present a convolutional neural network (CNN) based solution for modeling physically plausible spatially varying surface reflectance functions (SVBRDF) from a single photograph of a planar material sample under unknown natural illumination. Gathering a sufficiently large set of labeled training pairs consisting of photographs of SVBRDF samples and corresponding reflectance parameters, is a difficult and arduous process. To reduce the amount of required labeled training data, we propose to leverage the appearance information embedded in unlabeled images of spatially varying materials to self-augment the training process. Starting from an initial approximative network obtained from a small set of labeled training pairs, we estimate provisional model parameters for each unlabeled training exemplar. Given this provisional reflectance estimate, we then synthesize a novel temporary labeled training pair by rendering the exact corresponding image under a new lighting condition. After refining the network using these additional training samples, we re-estimate the provisional model parameters for the unlabeled data and repeat the self-augmentation process until convergence. We demonstrate the efficacy of the proposed network structure on spatially varying wood, metals, and plastics, as well as thoroughly validate the effectiveness of the self-augmentation training process.

[1] Ankush Gupta,et al. Synthetic Data for Text Localisation in Natural Images , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Vladlen Koltun,et al. A Simple Model for Intrinsic Image Decomposition with Depth Cues , 2013, 2013 IEEE International Conference on Computer Vision.

[3] Steve Marschner,et al. Estimating dual-scale properties of glossy surfaces from step-edge lighting , 2011, ACM Trans. Graph..

[4] J WardGregory,et al. Measuring and modeling anisotropic reflection , 1992 .

[5] Julie Dorsey,et al. Digital Modeling of Material Appearance , 2007 .

[6] Antonio M. López,et al. The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Geoffrey E. Hinton,et al. Analysis-by-Synthesis by Learning to Invert Generative Black Boxes , 2008, ICANN.

[8] Mario Fritz,et al. Deep Reflectance Maps , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Jitendra Malik,et al. Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.

[10] Jaakko Lehtinen,et al. Reflectance modeling by neural texture synthesis , 2016, ACM Trans. Graph..

[11] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[12] Reinhard Klein,et al. Advances in geometry and reflectance acquisition (course notes) , 2015, SIGGRAPH Asia Courses.

[13] Todd E. Zickler,et al. Blind Reflectometry , 2010, ECCV.

[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15] Ken Perlin,et al. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks , 2014, ACM Trans. Graph..

[16] Gregory J. Ward,et al. Measuring and modeling anisotropic reflection , 1992, SIGGRAPH.

[17] Noah Snavely,et al. OpenSurfaces , 2013, ACM Trans. Graph..

[18] Alexei A. Efros,et al. SVBRDF-Invariant Shape and Reflectance Estimation from Light-Field Cameras , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Ko Nishino,et al. Reflectance and Illumination Recovery in the Wild , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Alexei A. Efros,et al. SVBRDF-Invariant Shape and Reflectance Estimation from a Light-Field Camera , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Stella X. Yu,et al. Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[22] Qiao Wang,et al. VirtualWorlds as Proxy for Multi-object Tracking Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Ko Nishino,et al. Shape and Reflectance from Natural Illumination , 2012, ECCV.

[24] Jannik Boll Nielsen,et al. Minimal BRDF sampling for two-shot near-field reflectance acquisition , 2016, ACM Trans. Graph..

[25] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[26] Baining Guo,et al. AppGen: interactive material modeling from a single image , 2011, ACM Trans. Graph..

[27] Honglak Lee,et al. Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[28] Ming-Yu Liu,et al. Coupled Generative Adversarial Networks , 2016, NIPS.

[29] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[30] SnavelyNoah,et al. Estimating dual-scale properties of glossy surfaces from step-edge lighting , 2011 .

[31] Jonathan T. Barron,et al. Scene Intrinsics and Depth from a Single Image , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[32] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[33] Ko Nishino,et al. Shape and Reflectance Estimation in the Wild , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34] Geoffrey E. Hinton,et al. Deep Lambertian Networks , 2012, ICML.

[35] Ko Nishino,et al. Reflectance and Natural Illumination from a Single Image , 2012, ECCV.

[36] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[37] Jitendra Malik,et al. Shape, Illumination, and Reflectance from Shading , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38] Steven M. Seitz,et al. Shape and materials by example: a photometric stereo approach , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..