论文信息 - Scene Classification Using Transfer Learning

Scene Classification Using Transfer Learning

Categorization of scene images is considered as a challenging prospect due to the fact that different classes of scene images often share similar image statistics. This chapter presents a transfer learning based approach for scene classification. A pre-trained Convolutional Neural Network (CNN) is used as a feature extractor for the images. The pre-trained network along with classifiers such as Support Vector Machines (SVM) or Multi Layer Perceptron (MLP) are used to classify the images. Also, the effect of single plane images such as, RGB2Gray, SVD Decolorized and Modified SVD decolorized images are analysed based on classification accuracy, class-wise precision, recall, F1-score and equal error rate (EER). The classification experiment for SVM was also done using a dimensionality reduction technique known as principal component analysis (PCA) on the feature vector. By comparing the results of models trained on RGB images with those grayscale images, the difference in the results is very small. These grayscale images were capable of retaining the required shape and texture information from the original RGB images and were also sufficient to categorize the classes of the given scene images.

V. Sowmya | K. P. Soman | D. Govind | Nikhil Damodaran

[1] K. P. Soman,et al. Significance of perceptually relevant image decolorization for scene classification , 2017, J. Electronic Imaging.

[2] Ivan Laptev,et al. Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Qi Tian,et al. Good Practice in CNN Feature Transfer , 2016, ArXiv.

[4] V. Sowmya,et al. Significance of incorporating chrominance information for effective color-to-grayscale image conversion , 2016, Signal, Image and Video Processing.

[5] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[6] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[7] Kai Zeng,et al. Objective Quality Assessment for Color-to-Gray Image Conversion , 2015, IEEE Transactions on Image Processing.

[8] Victor S. Lempitsky,et al. Neural Codes for Image Retrieval , 2014, ECCV.

[9] Jorma Laaksonen,et al. Techniques for Still Image Scene Classification and Object Detection , 2006, ICANN.

[10] V. Sowmya,et al. Dependency of Various Color and Intensity Planes on CNN Based Image Classification , 2017, SIRS.

[11] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Nuno Vasconcelos,et al. Scene classification with low-dimensional semantic spaces and weak supervision , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[14] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15] Jorge Cadima,et al. Principal component analysis: a review and recent developments , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[16] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[17] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[18] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[19] Bolei Zhou,et al. Places: An Image Database for Deep Scene Understanding , 2016, ArXiv.

[20] Atsuto Maki,et al. Visual Instance Retrieval with Deep Convolutional Networks , 2014, ICLR.

[21] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.