Visual Texture Perception with Feature Learning Models and Deep Architectures

Texture is an important property of images, and a key component for human visual perception. In this work, based on several feature learning models and deep architectures, we study the visual texture perception problem, which is helpful for understanding both the impact of texture itself and the basic mechanisms of human visual systems. Through a series of psychophysical experiments, we find that 12 perceptual features are significant to describe the texture images with regard to the human perceptions. Hence, we represent each texture image with a 12-dimensional vector, corresponding to the values of the 12 perceptual features. To improve the learnablity of existing feature learning models, we propose a set of deep architectures to learn compact representations of the texture perceptual features. Extensive experiments on texture images classification demonstrate the effectiveness of both the feature learning models and the deep architectures. In particular, the advantage of deep architectures over existing feature learning models is shown.

[1]  I. Jolliffe Principal Component Analysis , 2002 .

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  John K. Tsotsos,et al.  Parameterless Isomap with Adaptive Neighborhood Selection , 2006, DAGM-Symposium.

[4]  Wu-Jun Li,et al.  Gaussian Process Latent Random Field , 2010, AAAI.

[5]  Michael S. Landy,et al.  Visual perception of texture , 2002 .

[6]  Deli Zhao,et al.  Linear local tangent space alignment and application to face recognition , 2007, Neurocomputing.

[7]  Geoffrey E. Hinton,et al.  Deep Boltzmann Machines , 2009, AISTATS.

[8]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[9]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[10]  D. Field,et al.  Sensitivity to contrast histogram differences in synthetic wavelet-textures , 2001, Vision Research.

[11]  F. Durgin Texture contrast aftereffects are monocular; texture density aftereffects are binocular , 2001, Vision Research.

[12]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[13]  A. Ravishankar Rao,et al.  Towards a texture naming system: Identifying relevant dimensions of texture , 1993, Vision Research.

[14]  H. Zha,et al.  Principal manifolds and nonlinear dimensionality reduction via tangent space alignment , 2004, SIAM J. Sci. Comput..

[15]  Cheng-Lin Liu,et al.  Error-correcting output codes based ensemble feature extraction , 2013, Pattern Recognit..

[16]  Geoffrey E. Hinton,et al.  Stochastic Neighbor Embedding , 2002, NIPS.

[17]  S. S. Wolfson,et al.  PII: S0042-6989(97)00153-3 , 2003 .

[18]  Mohamed Cheriet,et al.  Large Margin Low Rank Tensor Analysis , 2013, Neural Computation.

[19]  Neil D. Lawrence,et al.  Gaussian Process Latent Variable Models for Visualisation of High Dimensional Data , 2003, NIPS.

[20]  Mike J. Chantler,et al.  Identifying perceptual features of procedural textures , 2013 .

[21]  L. J. P. van der Maaten,et al.  An Introduction to Dimensionality Reduction Using Matlab , 2007 .