论文信息 - Natural Image Coding in V1: How Much Use Is Orientation Selectivity?

Natural Image Coding in V1: How Much Use Is Orientation Selectivity?

Orientation selectivity is the most striking feature of simple cell coding in V1 that has been shown to emerge from the reduction of higher-order correlations in natural images in a large variety of statistical image models. The most parsimonious one among these models is linear Independent Component Analysis (ICA), whereas second-order decorrelation transformations such as Principal Component Analysis (PCA) do not yield oriented filters. Because of this finding, it has been suggested that the emergence of orientation selectivity may be explained by higher-order redundancy reduction. To assess the tenability of this hypothesis, it is an important empirical question how much more redundancy can be removed with ICA in comparison to PCA or other second-order decorrelation methods. Although some previous studies have concluded that the amount of higher-order correlation in natural images is generally insignificant, other studies reported an extra gain for ICA of more than 100%. A consistent conclusion about the role of higher-order correlations in natural images can be reached only by the development of reliable quantitative evaluation methods. Here, we present a very careful and comprehensive analysis using three evaluation criteria related to redundancy reduction: In addition to the multi-information and the average log-loss, we compute complete rate–distortion curves for ICA in comparison with PCA. Without exception, we find that the advantage of the ICA filters is small. At the same time, we show that a simple spherically symmetric distribution with only two parameters can fit the data significantly better than the probabilistic model underlying ICA. This finding suggests that, although the amount of higher-order correlation in natural images can in fact be significant, the feature of orientation selectivity does not yield a large contribution to redundancy reduction within the linear filter bank models of V1 simple cells.

[1] J. Maxwell. XVIII.—Experiments on Colour, as perceived by the Eye, with Remarks on Colour-Blindness , 1857, Transactions of the Royal Society of Edinburgh.

[2] F. Attneave. Some informational aspects of visual perception. , 1954, Psychological review.

[3] A. Hoffman,et al. Some metric inequalities in the space of matrices , 1955 .

[4] Herbert Gish,et al. Asymptotically efficient quantizing , 1968, IEEE Trans. Inf. Theory.

[5] Albert Perez. Ε-admissible Simplifications of the Dependence Structure of a Set of Random Variables , 1977, Kybernetika.

[6] H. Helmholtz. The Facts in Perception , 1977 .

[7] J. Bernardo. Expected Information as Expected Utility , 1979 .

[8] Satosi Watanabe,et al. Pattern recognition as a quest for minimum entropy , 1981, Pattern Recognit..

[9] G. Buchsbaum,et al. Trichromacy, opponent colours coding and optimum colour information transmission in the retina , 1983, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[10] J. Friedman,et al. PROJECTION PURSUIT DENSITY ESTIMATION , 1984 .

[11] Ralph Linsker,et al. Self-organization in a perceptual network , 1988, Computer.

[12] F. A. Seiler,et al. Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[13] R. Gray. Entropy and Information Theory , 1990, Springer New York.

[14] Peter Földiák,et al. Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[15] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[16] J. Freyd. The Facts of Perception. , 1991 .

[17] Geoffrey E. Hinton,et al. Self-organizing neural network that discovers surfaces in random-dot stereograms , 1992, Nature.

[18] Leslie S. Smith,et al. The principal components of natural images , 1992 .

[19] Joseph J. Atick,et al. What Does the Retina Know about Natural Scenes? , 1992, Neural Computation.

[20] William H. Press,et al. Numerical recipes in C++: the art of scientific computing, 2nd Edition (C++ ed., print. is corrected to software version 2.10) , 1994 .

[21] J. V. van Hateren,et al. Spatiotemporal contrast sensitivity of early vision , 1993, Vision Research.

[22] Zhaoping Li,et al. Toward a Theory of the Striate Cortex , 1994, Neural Computation.

[23] J. Nadal,et al. Nonlinear neurons in the low-noise limit: a factorial code maximizes information transfer Network 5 , 1994 .

[24] Terrence J. Sejnowski,et al. An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[25] Gavin J. Brelstaff,et al. Hyperspectral camera system: acquisition and analysis , 1995, Remote Sensing.

[26] J. Atick,et al. STATISTICS OF NATURAL TIME-VARYING IMAGES , 1995 .

[27] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[28] R C Reid,et al. Efficient Coding of Natural Scenes in the Lateral Geniculate Nucleus: Experimental Test of a Computational Theory , 1996, The Journal of Neuroscience.

[29] Roland Baddeley,et al. An efficient code in V1? , 1996, Nature.

[30] J. Leo van Hemmen,et al. Development of spatiotemporal receptive fields of simple cells: I. Model formulation , 1997, Biological Cybernetics.

[31] Terrence J. Sejnowski,et al. The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[32] Alan Edelman,et al. The Geometry of Algorithms with Orthogonality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[33] David L. Neuhoff,et al. Quantization , 2022, IEEE Trans. Inf. Theory.

[34] D. Ruderman,et al. Statistics of cone responses to natural images: implications for visual coding , 1998 .

[35] J. Nadal,et al. Nonlinear feedforward networks with stochastic outputs: infomax implies redundancy reduction. , 1998, Network.

[36] Martin J. Wainwright,et al. Scale Mixtures of Gaussians and the Statistics of Natural Images , 1999, NIPS.

[37] Aapo Hyvärinen,et al. Survey on Independent Component Analysis , 1999 .

[38] Gerhard Krieger,et al. The atoms of vision: Cartesian or polar? , 1999 .

[39] Bruno A. Olshausen,et al. PROBABILISTIC FRAMEWORK FOR THE ADAPTATION AND COMPARISON OF IMAGE CODES , 1999 .

[40] Terrence J. Sejnowski,et al. Unsupervised Learning , 2018, Encyclopedia of GIS.

[41] Terrence J. Sejnowski,et al. Learning Overcomplete Representations , 2000, Neural Computation.

[42] T. W. Lee,et al. Chromatic structure of natural scenes. , 2001, Journal of the Optical Society of America. A, Optics, image science, and vision.

[43] Vivek K. Goyal,et al. Theoretical foundations of transform coding , 2001, IEEE Signal Process. Mag..

[44] H. Barlow. The exploitation of regularities in the environment by the brain. , 2001, The Behavioral and brain sciences.

[45] Eero P. Simoncelli,et al. Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[46] T. Sejnowski,et al. Color opponency is an efficient representation of spectral properties in natural scenes , 2002, Vision Research.

[47] A. U.S.,et al. Predictability , Complexity , and Learning , 2002 .

[48] Zhou Wang,et al. Why is image quality assessment so difficult? , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[49] Y. Petrov,et al. Local correlations, information redundancy, and sufficient pixel depth in natural images. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[50] M. Lewicki,et al. Learning higher-order structures in natural images. , 2003 .

[51] Eero P. Simoncelli,et al. On Advances in Statistical Modeling of Natural Images , 2004, Journal of Mathematical Imaging and Vision.

[52] A. Torralba,et al. Specular reflections and the perception of shape. , 2004, Journal of vision.

[53] P. Földiák,et al. Forming sparse representations by local anti-Hebbian learning , 1990, Biological Cybernetics.

[54] C. Malsburg. Self-organization of orientation sensitive cells in the striate cortex , 2004, Kybernetik.

[55] David J. Field,et al. How Close Are We to Understanding V1? , 2005, Neural Computation.

[56] Michael S. Lewicki,et al. A Hierarchical Bayesian Model for Learning Nonlinear Statistical Regularities in Nonstationary Natural Signals , 2005, Neural Computation.

[57] F. Wolf. Symmetry, multistability, and long-range interactions in brain development. , 2005, Physical review letters.

[58] Daniel L Adams,et al. The cortical column: a structure without a function , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[59] Marc M. Van Hulle,et al. Mixture density modeling, Kullback-Leibler divergence, and differential log-likelihood , 2005, Signal Process..

[60] Li Zhaoping,et al. Theoretical understanding of the early visual processes by data compression and data selection , 2006, Network.

[61] M. Bethge. Factorial coding of natural images: how effective are linear models in removing higher-order dependencies? , 2006, Journal of the Optical Society of America. A, Optics, image science, and vision.

[62] Thomas V. Wiecki,et al. The independent components of natural images are perceptually dependent , 2007, Electronic Imaging.

[63] Bruno A. Olshausen,et al. Learning Horizontal Connections in a Sparse Coding Model of Natural Images , 2007, NIPS.

[64] D. Field,et al. Estimates of the information content and dimensionality of natural scenes from proximity distributions. , 2007, Journal of the Optical Society of America. A, Optics, image science, and vision.

[65] Geoffrey E. Hinton,et al. Modeling image patches with a directed hierarchy of Markov random fields , 2007, NIPS.

[66] Matthias Bethge,et al. Near-Maximum Entropy Models for Binary Neural Representations of Natural Images , 2007, NIPS.

[67] Florian Steinke,et al. Bayesian Inference and Optimal Design in the Sparse Linear Model , 2007, AISTATS.

[68] William T. Freeman,et al. What makes a good model of natural images? , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[69] Paul R. Martin,et al. The unsolved mystery of vision , 2007, Current Biology.

[70] A. Hyvärinen,et al. Complex cell pooling and the statistics of natural images , 2007, Network.

[71] Eero P. Simoncelli,et al. Image denoising using mixtures of Gaussian scale mixtures , 2008, 2008 15th IEEE International Conference on Image Processing.

[72] Matthias Bethge,et al. How Much Can Orientation Selectivity and Contrast Gain Control Reduce the Redundancies in Natural Images , 2008 .

[73] Eero P. Simoncelli,et al. Nonlinear image representation using divisive normalization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[74] Eero P. Simoncelli,et al. Image Modeling and Denoising With Orientation-Adapted Gaussian Scale Mixtures , 2008, IEEE Transactions on Image Processing.

[75] Matthias Bethge,et al. Characterization of the p-generalized normal distribution , 2009, J. Multivar. Anal..

[76] F. Wolf. Erratum: Symmetry, Multistability, and Long-Range Interactions in Brain Development [Phys. Rev. Lett. 95, 208701 (2005)] , 2009 .

[77] 俊一甘利,et al. A. Hyvärinen, J. Karhunen and E. Oja, Independent Component Analysis, Jhon Wiley & Sons, 2001年，504ページ．（根本幾・川勝真喜訳：独立成分分析——信号解析の新しい世界，東京電機大学出版局，2005年，532ページ．） , 2010 .

[78] J. Maxwell,et al. The Scientific Papers of James Clerk Maxwell: Experiments on Colour as perceived by the Eye, with remarks on Colour-Blindness , 2011 .

[79] Joseph J Atick,et al. Could information theory provide an ecological theory of sensory processing? , 2011, Network.