A Perception-Inspired Deep Learning Framework for Predicting Perceptual Texture Similarity

Similarity learning plays a fundamental role in the fields of multimedia retrieval and pattern recognition. Prediction of perceptual similarity is a challenging task as in most cases we lack human labeled ground-truth data and robust models to mimic human visual perception. Although in the literature, some studies have been dedicated to similarity learning, they mainly focus on the evaluation of whether or not two images are similar, rather than prediction of perceptual similarity which is consistent with human perception. Inspired by the human visual perception mechanism, we here propose a novel framework in order to predict perceptual similarity between two texture images. Our proposed framework is built on the top of Convolutional Neural Networks (CNNs). The proposed framework considers both powerful features and perceptual characteristics of contours extracted from the images. The similarity value is computed by aggregating resemblances between the corresponding convolutional layer activations of the two texture maps. Experimental results show that the predicted similarity values are consistent with the human-perceived similarity data.

[1]  Shree K. Nayar,et al.  Reflectance and Texture of Real-World Surfaces Authors , 1997, CVPR 1997.

[2]  Stéphane Mallat,et al.  Rotation, Scaling and Deformation Invariant Scattering for Texture Discrimination , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  C. Heaps,et al.  Similarity and Features of Natural Textures , 1999 .

[4]  Ko Nishino,et al.  Automatically discovering local visual material attributes , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  B. S. Manjunath,et al.  Texture features and learning similarity , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[7]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[8]  Phil Brodatz,et al.  Textures: A Photographic Album for Artists and Designers , 1966 .

[9]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  B Julesz,et al.  Experiments in the visual perception of texture. , 1975, Scientific American.

[11]  A. Ravishankar Rao,et al.  Identifying High Level Features of Texture Perception , 1993, CVGIP Graph. Model. Image Process..

[12]  Xinghui Dong,et al.  Perceptual texture similarity estimation , 2014 .

[13]  Lei Wang,et al.  Learning Texture Similarity with Perceptual Pairwise Distance , 2005 .

[14]  Noureddine Abbadeni,et al.  Computational Perceptual Features for Texture Representation and Retrieval , 2011, IEEE Transactions on Image Processing.

[15]  Matti Pietikäinen,et al.  Outex - new framework for empirical evaluation of texture analysis algorithms , 2002, Object recognition supported by user interaction for service robots.

[16]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jorma Laaksonen,et al.  Measuring Concept Similarities in Multimedia Ontologies: Analysis and Evaluations , 2007, IEEE Transactions on Multimedia.

[18]  G. Gescheider Psychophysics: The Fundamentals , 1997 .

[19]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[20]  Mike J. Chantler,et al.  Perceptual Similarity: A Texture Challenge , 2011, BMVC.

[21]  Kristen Grauman,et al.  Relative attributes , 2011, 2011 International Conference on Computer Vision.

[22]  Edward H. Adelson,et al.  Recognizing Materials Using Perceptually Inspired Features , 2013, International Journal of Computer Vision.

[23]  Wojciech Czarnecki,et al.  On Loss Functions for Deep Neural Networks in Classification , 2017, ArXiv.

[24]  Béla Julesz,et al.  Visual Pattern Discrimination , 1962, IRE Trans. Inf. Theory.

[25]  David J. Field,et al.  Contour integration by the human visual system: Evidence for a local “association field” , 1993, Vision Research.

[26]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[28]  Jana Reinhard,et al.  Textures A Photographic Album For Artists And Designers , 2016 .

[29]  Johan Wagemans,et al.  The Awakening of Attneave's Sleeping cat: Identification of Everyday Objects on the Basis of Straight-Line Versions of Outlines , 2008, Perception.

[30]  Fraser Halley,et al.  Perceptually relevant browsing environments for large texture databases , 2012 .

[31]  Eero P. Simoncelli,et al.  Texture modeling and synthesis using joint statistics of complex wavelet coefficients , 1999 .

[32]  Eero P. Simoncelli,et al.  Representation of Naturalistic Image Structure in the Primate Visual Cortex. , 2014, Cold Spring Harbor symposia on quantitative biology.

[33]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[34]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Nanning Zheng,et al.  Large Margin Learning in Set-to-Set Similarity Comparison for Person Reidentification , 2017, IEEE Transactions on Multimedia.

[36]  Kristin J. Dana,et al.  Deep TEN: Texture Encoding Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Meng Wang,et al.  Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation , 2009, IEEE Transactions on Multimedia.

[38]  Junyu Dong,et al.  Visual Perception of Procedural Textures: Identifying Perceptual Dimensions and Predicting Generation Models , 2015, PloS one.

[39]  A. Ravishankar Rao,et al.  Towards a texture naming system: Identifying relevant dimensions of texture , 1993, Vision Research.

[40]  Ying Gao,et al.  Perceptual texture similarity learning using deep neural networks , 2017, 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[41]  Yan Lu,et al.  Local Descriptors Optimized for Average Precision , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42]  Mike J. Chantler,et al.  Perceptually Motivated Image Features Using Contours , 2016, IEEE Transactions on Image Processing.

[43]  Nuno Vasconcelos,et al.  On the Design of Loss Functions for Classification: theory, robustness to outliers, and SavageBoost , 2008, NIPS.

[44]  Xianghua Xie,et al.  Handbook of Texture Analysis , 2008 .

[45]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Mike J. Chantler,et al.  The Importance of Long-Range Interactions to Texture Similarity , 2013, CAIP.

[47]  Junyu Dong,et al.  A Procedural Texture Generation Framework Based on Semantic Descriptions , 2019, Knowl. Based Syst..

[48]  B. Julesz Textons, the elements of texture perception, and their interactions , 1981, Nature.

[49]  Junyu Dong,et al.  The Visual Word Booster: A Spatial Layout of Words Descriptor Exploiting Contour Cues , 2018, IEEE Transactions on Image Processing.

[50]  Leon A. Gatys,et al.  Texture Synthesis Using Convolutional Neural Networks , 2015, NIPS.

[51]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[52]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[53]  Shengcai Liao,et al.  Deep Hybrid Similarity Learning for Person Re-Identification , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[54]  Wee Kheng Leow,et al.  A hybrid model for invariant and perceptual texture mapping , 2002, Object recognition supported by user interaction for service robots.

[55]  Nikos Komodakis,et al.  Learning to compare image patches via convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Lorenzo Rosasco,et al.  Are Loss Functions All the Same? , 2004, Neural Computation.

[57]  Pengfei Xu,et al.  Multiscale road centerlines extraction from high-resolution aerial imagery , 2019, Neurocomputing.

[58]  Robert King,et al.  Textural features corresponding to textural properties , 1989, IEEE Trans. Syst. Man Cybern..

[59]  Paul F. Whelan,et al.  Convolutional neural network on three orthogonal planes for dynamic texture classification , 2017, Pattern Recognit..

[60]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[61]  Junyu Dong,et al.  Learning perceptual texture similarity and relative attributes from computational features , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[62]  Eero P. Simoncelli,et al.  Texture characterization via joint statistics of wavelet coefficient magnitudes , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[63]  Alberto Del Bimbo,et al.  Retrieval by Shape Similarity with Perceptual Distance and Effective Indexing , 2000, IEEE Trans. Multim..

[64]  Rahul Sukthankar,et al.  MatchNet: Unifying feature and metric learning for patch-based matching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[65]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[66]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.