Surface reflectance recognition and real-world illumination statistics

Humans distinguish materials such as metal, plastic, and paper effortlessly at a glance. Traditional computer vision systems cannot solve this problem at all. Recognizing surface reflectance properties from a single photograph is difficult because the observed image depends heavily on the amount of light incident from every direction. A mirrored sphere, for example, produces a different image in every environment. To make matters worse, two surfaces with different reflectance properties could produce identical images. The mirrored sphere simply reflects its surroundings, so in the right artificial setting, it could mimic the appearance of a matte ping-pong ball. Yet, humans possess an intuitive sense of what materials typically “look like” in the real world. This thesis develops computational algorithms with a similar ability to recognize reflectance properties from photographs under unknown, real-world illumination conditions. Real-world illumination is complex, with light typically incident on a surface from every direction. We find, however, that real-world illumination patterns are not arbitrary. They exhibit highly predictable spatial structure, which we describe largely in the wavelet domain. Although they differ in several respects from the typical photographs, illumination patterns share much of the regularity described in the natural image statistics literature. These properties of real-world illumination lead to predictable image statistics for a surface with given reflectance properties. We construct a system that classifies a surface according to its reflectance from a single photograph under unknown illuminination. Our algorithm learns relationships between surface reflectance and certain statistics computed from the observed image. Like the human visual system, we solve the otherwise underconstrained inverse problem of reflectance estimation by taking advantage of the statistical regularity of illumination. For surfaces with homogeneous reflectance properties and known geometry, our system rivals human performance. Thesis Supervisors: Alan S. Willsky, Professor of Electrical Engineering Edward H. Adelson, Professor of Vision Science

[1]  P. Perona,et al.  Local analysis for 3D reconstruction of specular surfaces , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[2]  Pat Hanrahan,et al.  A signal-processing framework for inverse rendering , 2001, SIGGRAPH.

[3]  T. Inui,et al.  Group theory and its applications in physics , 1990 .

[4]  Jitendra Malik,et al.  Recovering photometric properties of architectural scenes from photographs , 1998, SIGGRAPH.

[5]  Paul E. Debevec,et al.  Acquiring the reflectance field of a human face , 2000, SIGGRAPH.

[6]  Daniel L. Ruderman,et al.  Origins of scaling in natural images , 1996, Vision Research.

[7]  André Gagalowicz,et al.  Image-based rendering of diffuse, specular and glossy surfaces from a single image , 2001, SIGGRAPH.

[8]  S K Nayar,et al.  Visual appearance of matte surfaces , 1995, Science.

[9]  Edward H. Adelson,et al.  Shiftable multiscale transforms , 1992, IEEE Trans. Inf. Theory.

[10]  D H Brainard,et al.  Bayesian color constancy. , 1997, Journal of the Optical Society of America. A, Optics, image science, and vision.

[11]  Katsushi Ikeuchi,et al.  Determining reflectance parameters and illumination distribution from a sparse set of images for view-dependent image synthesis , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[12]  D Marr,et al.  The computation of lightness by the primate retina. , 1974, Vision research.

[13]  Carl-Fredrik Westin,et al.  On the equivalence of normalized convolution and normalized differential convolution , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Roland W Fleming,et al.  Real-world illumination and the perception of surface reflectance properties. , 2003, Journal of vision.

[15]  Yair Weiss,et al.  Deriving intrinsic images from image sequences , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[16]  Edward H. Adelson,et al.  Noise removal via Bayesian wavelet coring , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[17]  Andrew Zisserman,et al.  Classifying Images of Materials: Achieving Viewpoint and Illumination Independence , 2002, ECCV.

[18]  Michael Bosse,et al.  Calibrated, Registered Images of an Extended Urban Area , 2003, International Journal of Computer Vision.

[19]  Tomaso Poggio,et al.  Everything old is new again: a fresh look at historical approaches in machine learning , 2002 .

[20]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[21]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[22]  Shoji Tominaga,et al.  Estimating Reflection Parameters from a Single Color Image , 2000, IEEE Computer Graphics and Applications.

[23]  L. Maloney,et al.  Color constancy: a method for recovering surface spectral reflectance. , 1986, Journal of the Optical Society of America. A, Optics and image science.

[24]  P. Hanrahan,et al.  On the relationship between radiance and irradiance: determining the illumination from images of a convex Lambertian object. , 2001, Journal of the Optical Society of America. A, Optics, image science, and vision.

[25]  S. Laughlin A Simple Coding Procedure Enhances a Neuron's Information Capacity , 1981, Zeitschrift fur Naturforschung. Section C, Biosciences.

[26]  William C. Snyder,et al.  Definition and invariance properties of structured surface BRDF , 2002, IEEE Trans. Geosci. Remote. Sens..

[27]  R. Love Surface reflection model estimation from naturally illuminated image sequences , 1997 .

[28]  Paul Debevec,et al.  Inverse global illumination: Recovering re?ectance models of real scenes from photographs , 1998 .

[29]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[30]  J. H. Hateren,et al.  Independent component filters of natural images compared with simple cells in primary visual cortex , 1998 .

[31]  Frank Canters,et al.  The World in Perspective: A Directory of World Map Projections , 1989 .

[32]  Pietro Perona,et al.  Local Analysis for 3D Reconstruction of Specular Surfaces - Part II , 2002, ECCV.

[33]  E. Land,et al.  Lightness and retinex theory. , 1971, Journal of the Optical Society of America.

[34]  Pat Hanrahan,et al.  An efficient representation for irradiance environment maps , 2001, SIGGRAPH.

[35]  Eero P. Simoncelli,et al.  Random Cascades on Wavelet Trees and Their Use in Analyzing and Modeling Natural Images , 2001 .

[36]  Bui Tuong Phong Illumination for computer generated pictures , 1975, Commun. ACM.

[37]  Shoji Tominaga,et al.  Parameter estimation of a reflection model from a multi-band image , 1999, Proceedings Workshop on Photometric Modeling for Computer Vision and Graphics (Cat. No.PR00271).

[38]  G. W. Larson,et al.  Rendering with radiance - the art and science of lighting visualization , 2004, Morgan Kaufmann series in computer graphics and geometric modeling.

[39]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[40]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[41]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[42]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[43]  Samy Bengio,et al.  SVMTorch: Support Vector Machines for Large-Scale Regression Problems , 2001, J. Mach. Learn. Res..

[44]  Stéphane Mallat,et al.  The Texture Gradient Equation for Recovering Shape from Texture , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  James D. Johnston,et al.  A filter family designed for use in quadrature mirror filter banks , 1980, ICASSP.

[46]  M. Landy,et al.  Shading Ambiguity: Reflectance and Illumination , 1991 .

[47]  Jitendra Malik,et al.  Recognizing surfaces using three-dimensional textons , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[48]  David Mumford,et al.  Statistics of natural images and models , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[49]  William T. Freeman,et al.  Exploiting the generic viewpoint assumption , 1996, International Journal of Computer Vision.

[50]  T. Poggio,et al.  Multiclass cancer diagnosis using tumor gene expression signatures , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[51]  J. V. van Hateren,et al.  Independent component filters of natural images compared with simple cells in primary visual cortex , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[52]  S. Nishida,et al.  Use of image-based information in judgments of surface-reflectance properties. , 1998, Journal of the Optical Society of America. A, Optics, image science, and vision.

[53]  Katsushi Ikeuchi,et al.  Determining Reflectance Parameters and Illumination Distribution from a Sparse Set of Images for View-dependent Image Synthesis , 2001, ICCV.

[54]  Shree K. Nayar,et al.  Bidirectional Reflection Distribution Function of Thoroughly Pitted Surfaces , 1999, International Journal of Computer Vision.

[55]  Satoshi Matsuoka,et al.  Teddy: A Sketching Interface for 3D Freeform Design , 1999, SIGGRAPH Courses.

[56]  Steven A. Shafer,et al.  Using color to separate reflection components , 1985 .

[57]  Carl-Johan Westelius,et al.  Focus of attention and gaze control for robot vision , 1995 .

[58]  Shree K. Nayar,et al.  Reflectance and texture of real-world surfaces , 1999, TOGS.

[59]  J. Beck,et al.  Highlights and the perception of glossiness , 1981, Perception & psychophysics.

[60]  E. Adelson,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[61]  Ronen Basri,et al.  Lambertian reflectance and linear subspaces , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[62]  A. Gilchrist,et al.  An anchoring theory of lightness perception. , 1999 .

[63]  Eero P. Simoncelli,et al.  Image compression via joint statistical characterization in the wavelet domain , 1999, IEEE Trans. Image Process..

[64]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[65]  Steve Marschner,et al.  A practical model for subsurface light transport , 2001, SIGGRAPH.

[66]  西野 恒 Photometric object modeling : Rendering from a dense/sparse set of images , 2002 .

[67]  Peter-Pike J. Sloan,et al.  The Lit Sphere: A Model for Capturing NPR Shading from Art , 2001, Graphics Interface.

[68]  William T. Freeman,et al.  Learning Low-Level Vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[69]  Tomaso A. Poggio,et al.  Regularization Networks and Support Vector Machines , 2000, Adv. Comput. Math..

[70]  Andrew S. Glassner,et al.  Principles of Digital Image Synthesis , 1995 .

[71]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[72]  Katsushi Ikeuchi,et al.  Object shape and reflectance modeling from observation , 1997, SIGGRAPH.

[73]  J. Beck Surface color perception , 1972 .

[74]  P Hanrahan,et al.  Digital materials and virtual weathering. , 2000, Scientific American.

[75]  S. Marschner,et al.  Inverse Rendering for Computer Graphics , 1998 .

[76]  Peter Schröder,et al.  Spherical wavelets: efficiently representing functions on the sphere , 1995, SIGGRAPH.

[77]  Kevin Amaratunga,et al.  Surface wavelets: a multiresolution signal processing tool for 3D computational modelling , 2001 .

[78]  Leonard McMillan,et al.  Plenoptic Modeling: An Image-Based Rendering System , 2023 .

[79]  Donald P. Greenberg,et al.  Toward a psychophysically-based light reflection model for image synthesis , 2000, SIGGRAPH.

[80]  David A. Forsyth,et al.  A novel algorithm for color constancy , 1990, International Journal of Computer Vision.

[81]  E. Adelson Lightness Perception and Lightness Illusions , 1999 .

[82]  Szymon Rusinkiewicz,et al.  A New Change of Variables for Efficient BRDF Representation , 1998, Rendering Techniques.

[83]  Paul E. Debevec,et al.  Rendering synthetic objects into real scenes: bridging traditional and image-based graphics with global illumination and high dynamic range photography , 1998, SIGGRAPH '08.

[84]  Richard Szeliski,et al.  Layer extraction from multiple images containing reflections and transparency , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[85]  Henrik Wann Jensen,et al.  A rapid hierarchical rendering technique for translucent materials , 2005, ACM Trans. Graph..

[86]  C. H. Edwards,et al.  Calculus and Analytic Geometry , 1982 .

[87]  C. R. Hoffman,et al.  Illumination and Reflection Maps : Simulated Objects in Simulated and Real Environments Gene , 1984 .

[88]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[89]  F. E. Nicodemus,et al.  Geometrical considerations and nomenclature for reflectance , 1977 .

[90]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[91]  Marshall Friend Tappen Recovering shading and reflectance from a single image , 2002 .

[92]  James F. Blinn,et al.  Models of light reflection for computer synthesized pictures , 1977, SIGGRAPH.

[93]  B. Wandell,et al.  Color Constancy: From Physics to Appearance , 1993 .

[94]  Edward H. Adelson,et al.  How do Humans Determine Reflectance Properties under Unknown Illumination , 2001, CVPR 2001.

[95]  Andrew S. Glassner Principles of digital image synthesis. Volume 1 , 1995 .

[96]  Donald P. Greenberg,et al.  A comprehensive physical model for light reflection , 1991, SIGGRAPH.

[97]  Jitendra Malik,et al.  Computing Local Surface Orientation and Shape from Texture for Curved Surfaces , 1997, International Journal of Computer Vision.

[98]  D. Ruderman The statistics of natural images , 1994 .

[99]  D. Tolhurst,et al.  Amplitude spectra of natural images. , 1992, Ophthalmic & physiological optics : the journal of the British College of Ophthalmic Opticians.

[100]  A. Gilchrist,et al.  An anchoring theory of lightness perception. , 1999, Psychological review.

[101]  M. D'Zmura,et al.  Color constancy. I. Basic theory of two-stage linear recovery of spectral descriptions for lights and surfaces. , 1993, Journal of the Optical Society of America. A, Optics, image science, and vision.

[102]  G. Buchsbaum A spatial processor model for object colour perception , 1980 .

[103]  Kristin J. Dana,et al.  Recognition methods for 3D textured surfaces , 2001, IS&T/SPIE Electronic Imaging.

[104]  Shree K. Nayar,et al.  Generalization of Lambert's reflectance model , 1994, SIGGRAPH.

[105]  Gregory J. Ward,et al.  Measuring and modeling anisotropic reflection , 1992, SIGGRAPH.

[106]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[107]  Eero P. Simoncelli Modeling the joint statistics of images in the wavelet domain , 1999, Optics & Photonics.

[108]  Anuj Srivastava,et al.  Probability Models for Clutter in Natural Images , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[109]  Martin J. Wainwright,et al.  Adaptive Wiener denoising using a Gaussian scale mixture model in the wavelet domain , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[110]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[111]  Steve Marschner,et al.  Image-Based BRDF Measurement Including Human Skin , 1999, Rendering Techniques.