In Praise of Artifice Reloaded: Caution With Natural Image Databases in Modeling Vision

Subjective image quality databases are a major source of raw data on how the visual system works in naturalistic environments. These databases describe the sensitivity of many observers to a wide range of distortions of different nature and intensity seen on top of a variety of natural images. Data of this kind seems to open a number of possibilities for the vision scientist to check the models in realistic scenarios. However, while these natural databases are great benchmarks for models developed in some other way (e.g., by using the well-controlled artificial stimuli of traditional psychophysics), they should be carefully used when trying to fit vision models. Given the high dimensionality of the image space, it is very likely that some basic phenomena are under-represented in the database. Therefore, a model fitted on these large-scale natural databases will not reproduce these under-represented basic phenomena that could otherwise be easily illustrated with well selected artificial stimuli. In this work we study a specific example of the above statement. A standard cortical model using wavelets and divisive normalization tuned to reproduce subjective opinion on a large image quality dataset fails to reproduce basic cross-masking. Here we outline a solution for this problem by using artificial stimuli and by proposing a modification that makes the model easier to tune. Then, we show that the modified model is still competitive in the large-scale database. Our simulations with these artificial stimuli show that when using steerable wavelets, the conventional unit norm Gaussian kernels in divisive normalization should be multiplied by high-pass filters to reproduce basic trends in masking. Basic visual phenomena may be misrepresented in large natural image datasets but this can be solved with model-interpretable stimuli. This is an additional argument in praise of artifice in line with Rust and Movshon (2005).

[1]  M. Bertalmío,et al.  Appropriate kernels for Divisive Normalization explained by Wilson-Cowan equations , 2018, 1804.05964.

[2]  Zhengfang Duanmu,et al.  End-to-End Blind Image Quality Assessment Using Deep Neural Networks , 2018, IEEE Transactions on Image Processing.

[3]  M Martinez-Garcia,et al.  Derivatives and inverse of cascaded linear+nonlinear neural models , 2017, PloS one.

[4]  Sebastian Bosse,et al.  Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment , 2016, IEEE Transactions on Image Processing.

[5]  Xavier Otazu,et al.  Which tone-mapping operator is the best? A comparative study of perceptual quality , 2016, Journal of the Optical Society of America. A, Optics, image science, and vision.

[6]  Valero Laparra,et al.  Eigen-Distortions of Hierarchical Representations , 2017, NIPS.

[7]  Marcelo Bertalmío,et al.  The Wilson-Cowan model describes Contrast Response and Subjective Distortion , 2017 .

[8]  R. VanRullen Perception Science in the Age of Deep Neural Networks , 2017, Front. Psychol..

[9]  Valero Laparra,et al.  Perceptually Optimized Image Rendering , 2017, Journal of the Optical Society of America. A, Optics, image science, and vision.

[10]  J. Bohannon The cyberscientist. , 2017, Science.

[11]  Davide Castelvecchi,et al.  Can we open the black box of AI? , 2016, Nature.

[12]  David Kane,et al.  System gamma as a function of image- and monitor-dynamic range. , 2016, Journal of vision.

[13]  Marcelo Bertalmío,et al.  Optimized Tone Curve for In-Camera Image Processing , 2016, IQSP.

[14]  Alan C. Bovik,et al.  Massive Online Crowdsourced Study of Subjective and Objective Picture Quality , 2015, IEEE Transactions on Image Processing.

[15]  Marius Pedersen,et al.  Evaluation of 60 full-reference image quality metrics on the CID:IQ , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[16]  Valero Laparra,et al.  Visual aftereffects and sensory nonlinearities from a single statistical framework , 2015, Front. Hum. Neurosci..

[17]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[18]  Touradj Ebrahimi,et al.  Subjective quality assessment database of HDR images compressed with JPEG XT , 2015, 2015 Seventh International Workshop on Quality of Multimedia Experience (QoMEX).

[19]  Nikolay N. Ponomarenko,et al.  Image database TID2013: Peculiarities, results and perspectives , 2015, Signal Process. Image Commun..

[20]  Kedarnath P Vilankar,et al.  Local masking in natural images: a database and analysis. , 2014, Journal of vision.

[21]  Marcelo Bertalmío,et al.  From image processing to computational neuroscience: a neural model based on histogram equalization , 2014, Front. Comput. Neurosci..

[22]  Christophe Charrier,et al.  Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[23]  A. Hyvärinen,et al.  Spatio-Chromatic Adaptation via Higher-Order Canonical Correlation Analysis of Natural Images , 2014, PloS one.

[24]  Mahdi Nezamabadi,et al.  Color Appearance Models , 2014, J. Electronic Imaging.

[25]  Mark D. Fairchild,et al.  Color Appearance Models: Fairchild/Color Appearance Models , 2013 .

[26]  Valero Laparra,et al.  Nonlinearities and Adaptation of Color Vision from Sequential Principal Curves Analysis , 2016, Neural Computation.

[27]  Christophe Charrier,et al.  Blind Image Quality Assessment: A Natural Scene Statistics Approach in the DCT Domain , 2012, IEEE Transactions on Image Processing.

[28]  Xueliang Li,et al.  On a Relation Between , 2012 .

[29]  M. Carandini,et al.  Normalization as a canonical neural computation , 2011, Nature Reviews Neuroscience.

[30]  Alan C. Bovik,et al.  Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality , 2011, IEEE Transactions on Image Processing.

[31]  Valero Laparra,et al.  Psychophysically Tuned Divisive Normalization Approximately Factorizes the PDF of Natural Images , 2010, Neural Computation.

[32]  Valero Laparra,et al.  Divisive normalization image quality metric revisited. , 2010, Journal of the Optical Society of America. A, Optics, image science, and vision.

[33]  Alan C. Bovik,et al.  A Two-Step Framework for Constructing Blind Image Quality Indices , 2010, IEEE Signal Processing Letters.

[34]  Christophe Charrier,et al.  A DCT Statistics-Based Blind Image Quality Index , 2010, IEEE Signal Processing Letters.

[35]  Eric C. Larson,et al.  Most apparent distortion: full-reference image quality assessment and the role of strategy , 2010, J. Electronic Imaging.

[36]  Marcelo Bertalmío,et al.  Implementing the Retinex algorithm with Wilson–Cowan equations , 2009, Journal of Physiology-Paris.

[37]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[38]  Nikolay N. Ponomarenko,et al.  Color image database for evaluation of image quality metrics , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[39]  David H. Brainard,et al.  The Relation Between Color Discrimination and Color Constancy: When Is Optimal Adaptation Task Dependent? , 2007, Neural Computation.

[40]  J. Malo,et al.  V1 non-linear properties emerge from local-to-global non-linear ICA , 2006, Network.

[41]  Eero P. Simoncelli,et al.  Nonlinear image representation for efficient perceptual coding , 2006, IEEE Transactions on Image Processing.

[42]  Francesc J. Ferri,et al.  Regularization operators for natural images based on nonlinear perception models , 2006, IEEE Transactions on Image Processing.

[43]  Nicole C. Rust,et al.  In praise of artifice , 2005, Nature Neuroscience.

[44]  Philip Corriveau,et al.  Video Quality Experts Group , 2005 .

[45]  David H Brainard,et al.  Do common mechanisms of adaptation mediate color discrimination and appearance? Uniform backgrounds. , 2005, Journal of the Optical Society of America. A, Optics, image science, and vision.

[46]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[47]  Nikolay N. Ponomarenko,et al.  TID2008 – A database for evaluation of full-reference visual quality assessment metrics , 2004 .

[48]  Jesús Malo,et al.  Video quality measures based on the standard spatial observer , 2002, Proceedings. International Conference on Image Processing.

[49]  Donald I. A. MacLeod,et al.  Color discrimination, color constancy and natural scene statistics , 2002 .

[50]  Francesc J. Ferri,et al.  Perceptual feedback in multigrid motion estimation using an improved DCT quantization , 2001, IEEE Trans. Image Process..

[51]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[52]  Peter Dayan,et al.  Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems , 2001 .

[53]  Jesús Malo,et al.  Importance of quantiser design compared to optimal multigrid motion estimation in video coding , 2000 .

[54]  Francesc J. Ferri,et al.  The role of perceptual contrast non-linearities in image transform quantization , 2000, Image Vis. Comput..

[55]  J A Solomon,et al.  Model of visual contrast gain control and pattern masking. , 1997, Journal of the Optical Society of America. A, Optics, image science, and vision.

[56]  J. M. Foley,et al.  Human luminance pattern-vision mechanisms: masking experiments require a new model. , 1994, Journal of the Optical Society of America. A, Optics, image science, and vision.

[57]  M. Carandini,et al.  Summation and division by neurons in primate visual cortex. , 1994, Science.

[58]  Patrick C. Teo,et al.  Perceptual image distortion , 1994, Proceedings of 1st International Conference on Image Processing.

[59]  Edward H. Adelson,et al.  Shiftable multiscale transforms , 1992, IEEE Trans. Inf. Theory.

[60]  S. Laughlin,et al.  Matching Coding to Scenes to Enhance Efficiency , 1983 .

[61]  G. Legge A power law for contrast discrimination , 1981, Vision Research.

[62]  D. Sakrison,et al.  On the Role of the Observer and a Distortion Measure in Image Transmission , 1977, IEEE Trans. Commun..

[63]  J. Cowan,et al.  Excitatory and inhibitory interactions in localized populations of model neurons. , 1972, Biophysical journal.

[64]  J. Robson,et al.  Application of fourier analysis to the visibility of gratings , 1968, The Journal of physiology.

[65]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[66]  T. Smith,et al.  The C.I.E. colorimetric standards and their use , 1931 .