Statistical Modeling of 3-D Natural Scenes With Application to Bayesian Stereopsis

We studied the empirical distributions of luminance, range and disparity wavelet coefficients using a coregistered database of luminance and range images. The marginal distributions of range and disparity are observed to have high peaks and heavy tails, similar to the well-known properties of luminance wavelet coefficients. However, we found that the kurtosis of range and disparity coefficients is significantly larger than that of luminance coefficients. We used generalized Gaussian models to fit the empirical marginal distributions. We found that the marginal distribution of luminance coefficients have a shape parameter p between 0.6 and 0.8, while range and disparity coefficients have much smaller parameters p <; 0.32, corresponding to a much higher peak. We also examined the conditional distributions of luminance, range and disparity coefficients. The magnitudes of luminance and range (disparity) coefficients show a clear positive correlation, which means, at a location with larger luminance variation, there is a higher probability of a larger range (disparity) variation. We also used generalized Gaussians to model the conditional distributions of luminance and range (disparity) coefficients. The values of the two shape parameters (p,s) reflect the observed luminance-range (disparity) dependency. As an example of the usefulness of luminance statistics conditioned on range statistics, we modified a well-known Bayesian stereo ranging algorithm using our natural scene statistics models, which improved its performance.

[1]  D. Mumford,et al.  Stochastic models for generic images , 2001 .

[2]  M. H. Loew,et al.  A generalized Gaussian autoregressive model for chest X-ray images , 1989, Images of the Twenty-First Century. Proceedings of the Annual International Engineering in Medicine and Biology Society,.

[3]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[4]  Oscar Firschein,et al.  Readings in computer vision: issues, problems, principles, and paradigms , 1987 .

[5]  G. Poggio,et al.  Binocular interaction and depth sensitivity in striate and prestriate cortex of behaving rhesus monkey. , 1977, Journal of neurophysiology.

[6]  B. Cumming,et al.  Testing quantitative models of binocular disparity selectivity in primary visual cortex. , 2003, Journal of neurophysiology.

[7]  Paul Scheunders,et al.  Wavelet Denoising of Multicomponent Images Using Gaussian Scale Mixture Models and a Noise-Free Image as Priors , 2007, IEEE Transactions on Image Processing.

[8]  J. Pokorny Foundations of Cyclopean Perception , 1972 .

[9]  M. Landy,et al.  Measurement and modeling of depth cue combination: in defense of weak fusion , 1995, Vision Research.

[10]  Tai Sing Lee,et al.  Scaling Laws in Natural Scenes and the Inference of 3D Shape , 2005, NIPS.

[11]  Dale Purves,et al.  Image/source statistics of surfaces in natural scenes , 2003, Network.

[12]  Ning Qian,et al.  Computing Stereo Disparity and Motion with Known Binocular Cell Properties , 1994, Neural Computation.

[13]  Eero P. Simoncelli,et al.  Image Modeling and Denoising With Orientation-Adapted Gaussian Scale Mixtures , 2008, IEEE Transactions on Image Processing.

[14]  Martin J. Wainwright,et al.  Image denoising using scale mixtures of Gaussians in the wavelet domain , 2003, IEEE Trans. Image Process..

[15]  Russell M. Mersereau,et al.  Adaptive subband video coding using bivariate generalized Gaussian distribution model , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[16]  David Mumford,et al.  Occlusion Models for Natural Images: A Statistical Study of a Scale-Invariant Dead Leaves Model , 2004, International Journal of Computer Vision.

[17]  Ian P. Howard,et al.  Binocular Vision and Stereopsis , 1996 .

[18]  Christopher Joseph Pal,et al.  Efficiently Learning Random Fields for Stereo Vision with Sparse Message Passing , 2008, ECCV.

[19]  W. PEDDIE,et al.  Helmholtz's Treatise on Physiological Optics , 1926, Nature.

[20]  Javier Portilla,et al.  Image Restoration Using Space-Variant Gaussian Scale Mixtures in Overcomplete Pyramids , 2008, IEEE Transactions on Image Processing.

[21]  C. Blakemore The range and scope of binocular depth discrimination in man , 1970, The Journal of physiology.

[22]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  A. Parker,et al.  Integration of depth modules: Stereopsis and texture , 1993, Vision Research.

[24]  William Bialek,et al.  Statistics of Natural Images: Scaling in the Woods , 1993, NIPS.

[25]  A. Parker,et al.  Range and mechanism of encoding of horizontal disparity in macaque V1. , 2002, Journal of neurophysiology.

[26]  Stephen T. Barnard,et al.  A Stochastic Approach to Stereo Vision , 1986, AAAI.

[27]  N BelhumeurPeter A Bayesian approach to binocular stereopsis , 1996 .

[28]  Richard A Eagle,et al.  Weighted directional energy model of human stereo correspondence , 2000, Vision Research.

[29]  Dale Purves,et al.  Range image statistics can explain the anomalous perception of length , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[31]  Tai Sing Lee,et al.  Statistical correlations between two-dimensional images and three-dimensional structures in natural scenes. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[32]  Darius Burschka,et al.  Advances in Computational Stereo , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Fernando Pérez-González,et al.  Modeling Gabor Coefficients via Generalized Gaussian Distributions for Face Recognition , 2007, 2007 IEEE International Conference on Image Processing.

[34]  Eero P. Simoncelli,et al.  Image compression via joint statistical characterization in the wavelet domain , 1999, IEEE Trans. Image Process..

[35]  Yang Liu,et al.  Disparity statistics in natural scenes. , 2008, Journal of vision.

[36]  P. Moulin,et al.  Analysis of multiresolution image denoising schemes using generalized-Gaussian priors , 1998, Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.98TH8380).

[37]  Dale Purves,et al.  A statistical explanation of visual space , 2003, Nature Neuroscience.

[38]  David Mumford,et al.  Statistics of range images , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[39]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[40]  David C Knill,et al.  Mixture models and the probabilistic structure of depth cues , 2003, Vision Research.

[41]  Alan C. Bovik,et al.  Relationship between the Helmholtz shear of vertical meridians and disparity statistics in natural scenes , 2010 .

[42]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[44]  Ken D. Sauer,et al.  Efficient ML estimation of the shape parameter for generalized Gaussian MRFs , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[45]  Asoke K. Nandi,et al.  Exponent parameter estimation for generalized Gaussian probability density functions with application to speech modeling , 2005, Signal Process..

[46]  Gregory C DeAngelis,et al.  Coding of horizontal disparity and velocity by MT neurons in the alert macaque. , 2003, Journal of neurophysiology.

[47]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[48]  Christopher Joseph Pal,et al.  Learning Conditional Random Fields for Stereo , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  H. Bülthoff,et al.  3D shape perception from combined depth cues in human visual cortex , 2005, Nature Neuroscience.

[50]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[51]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[52]  Michael J. Black,et al.  On the Spatial Statistics of Optical Flow , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[53]  G C DeAngelis,et al.  The physiology of stereopsis. , 2001, Annual review of neuroscience.

[54]  David C Knill,et al.  Reaching for visual cues to depth: the brain combines depth cues differently for motor control and perception. , 2005, Journal of vision.

[55]  Alberto Leon-Garcia,et al.  Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[56]  Nir A. Sochen,et al.  Stereo Matching with Mumford-Shah Regularization and Occlusion Handling , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  A. Parker,et al.  Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. , 2002, Journal of neurophysiology.