论文信息 - Image Classification Based on Mid-Level Feature Fusion

Image Classification Based on Mid-Level Feature Fusion

Image classification and analysis aim to classify the images according to the nature of visual contents. Due to recent increase in the number of multi-media contents, the classification of images is considered as a challenging and complex problem. A list of low-level and mid-level features is available in the literature that aims to represent the images in the form of feature vectors to be used as an input for classification-based problems. The main problem with these features is the domain and application specific nature and the use of features in one domain may not show the same result when applied in a different domain. The feature fusion-based approaches aim to enhance the performance of image classification models as single feature-based approach is not robust to handle image transformations such as translation, rotation, scaling, etc. In this research, we aim to investigate the image classification-based performance of mid-level features. We applied Bag of Visual Words (BoVW) model with image classification framework. This proposed late feature fusion result in a higher classification accuracy. The features are retrieved from the images by using two well-known feature extraction techniques that are Scale-Invariant Feature Transform (SIFT) and Histogram of Oriented Gradients (HOG). The experiments are performed while using two data sets that are Corel-1k and Corel-1.5k. The result shows that the classification accuracy increases when SIFT and HOG are used in fusion and the proposed results outperforms the standard BoVw model when SIFT and HOG is used separately.

[1] Andrew Zisserman,et al. Efficient additive kernels via explicit feature maps , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2] Frédéric Jurie,et al. Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[3] Zahid Mehmood,et al. Image retrieval by addition of spatial information based on histograms of triangular regions , 2016, Comput. Electr. Eng..

[4] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Alessia Amelio,et al. A new axiomatic methodology for the image similarity , 2019, Appl. Soft Comput..

[6] James Ze Wang,et al. Real-Time Computerized Annotation of Pictures , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Bushra Zafar,et al. Data Augmentation-Assisted Makeup-Invariant Face Recognition , 2018, Mathematical Problems in Engineering.

[8] Rehan Ashraf,et al. A Novel Discriminating and Relative Global Spatial Image Representation with Applications in CBIR , 2018, Applied Sciences.

[9] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[10] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[11] Matthijs Douze,et al. Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[12] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[13] Md. Monirul Islam,et al. A review on automatic image annotation techniques , 2012, Pattern Recognit..

[14] Anzar Mahmood,et al. Deeply Learned Pose Invariant Image Analysis with Applications in 3D Face Recognition , 2019, Mathematical Problems in Engineering.

[15] Rehan Ashraf,et al. Content based image retrieval system by using HSV color histogram, discrete wavelet transform and edge histogram descriptor , 2018, 2018 International Conference on Computing, Mathematics and Engineering Technologies (iCoMET).

[16] Alessia Amelio,et al. Classification Methods in Image Analysis with a Special Focus on Medical Analytics , 2018, Machine Learning Paradigms.

[17] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18] Muhammad Tariq Mahmood,et al. Modeling global geometric spatial information for rotation invariant classification of satellite images , 2019, PloS one.

[19] Rehan Ashraf,et al. Content-Based Image Retrieval Based on Late Fusion of Binary and Local Descriptors , 2017, ArXiv.

[20] Christopher Hunt,et al. Notes on the OpenSURF Library , 2009 .

[21] Pierre Vandergheynst,et al. FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[22] Li Fei-Fei,et al. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Jameel Ahmed,et al. Content-Based Image Retrieval and Feature Extraction: A Comprehensive Review , 2019, Mathematical Problems in Engineering.

[24] Savvas A. Chatzichristofis,et al. A Novel Image Retrieval Based on Visual Words Integration of SIFT and SURF , 2016, PloS one.