A scale-invariant framework for image classification with deep learning

In this paper, we propose a scale-invariant framework based on Convolutional Neural Networks (CNNs). The network exhibits robustness to scale and resolution variations in data. Previous efforts in achieving scale invariance were made on either integrating several variant-specific CNNs or data augmentation. However, these methods did not solve the fundamental problem that CNNs develop different feature representations for the variants of the same image. The topology proposed by this paper develops a uniform representation for each of the variants of the same image. The uniformity is acquired by concatenating scale-variant and scale-invariant features to enlarge the feature space so that the case when input images are of diverse variations but from the same class can be distinguished from another case when images are of different classes. Higher-order decision boundaries lead to the success of the framework. Experimental results on a challenging dataset substantiates that our framework performs better than traditional frameworks with the same number of free parameters. Our proposed framework can also achieve a higher training efficiency.

[1]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[2]  Joshua Gluckman,et al.  Scale Variant Image Pyramids , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Don H. Johnson,et al.  Interpreting Canvas Weave Matches , 2010 .

[4]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Neural Networks , 2013 .

[5]  Lei Yao,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Rhythmic Brushstrokes Distinguish Van Gogh from His Contemporaries: Findings via Automated Brushstroke Extraction , 2022 .

[6]  Thomas Brox,et al.  Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[7]  Patrice Abry,et al.  When Van Gogh meets Mandelbrot: Multifractal classification of painting's texture , 2013, Signal Process..

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Joshua Gluckman,et al.  Higher Order Image Pyramids , 2006, ECCV.

[10]  Thomas Mensink,et al.  The Rijksmuseum Challenge: Museum-Centered Visual Recognition , 2014, ICMR.

[11]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Eric O. Postma,et al.  Learning scale-variant and scale-invariant features for deep image classification , 2016, Pattern Recognit..

[13]  Allan Pinkus,et al.  Multilayer Feedforward Networks with a Non-Polynomial Activation Function Can Approximate Any Function , 1991, Neural Networks.

[14]  Jiaxing Zhang,et al.  Scale-Invariant Convolutional Neural Networks , 2014, ArXiv.

[15]  Quoc V. Le,et al.  Tiled convolutional neural networks , 2010, NIPS.

[16]  C. Richard Johnson,et al.  Weave analysis of paintings on canvas from radiographs , 2013, Signal Process..

[17]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Eric O. Postma,et al.  Toward Discovery of the Artist's Style: Learning to recognize artists by their artworks , 2015, IEEE Signal Processing Magazine.

[19]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Eric O. Postma,et al.  Computer analysis of Van Gogh's complementary colours , 2007, Pattern Recognit. Lett..

[21]  Eric O. Postma,et al.  Texton-based analysis of paintings , 2010, Optical Engineering + Applications.

[22]  Tsuhan Chen,et al.  A framework of extracting multi-scale features using multiple convolutional neural networks , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[23]  Shiming Xiang,et al.  Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks , 2014, IEEE Geoscience and Remote Sensing Letters.

[24]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.