Scene classification using a new radial basis function classifier and integrated SIFT–LBP features

Scene classification is one of the most significant and challenging tasks in computer vision. This paper presents a new method for scene classification using bag of visual words and a particle swarm optimization (PSO)-based artificial neural network classifier. Contributions of this paper are introducing a novel feature integration method using scale invariant feature transform (SIFT) and local binary pattern (LBP) and a new framework for training radial basis function neural network, combining optimum steepest decent method with a specially designed PSO-based optimizer for center adjustment of radial basis function neural network. Our study shows that using LBP increases the performance of classification task compared to using SIFT only. In addition, our experiments on Proben1 dataset demonstrate improvements in classification performance (averagely about 6.04%) and convergence speed of the proposed radial basis function neural network. The proposed radial basis function neural network is then employed in scene classification task. Results are reported for classification of the Oliva and Torralba, Fei–Fei and Perona and Lazebnik et al. datasets. We compare the performance of the proposed classifier with a multi-way SVM classifier. Experimental results show the superiority of the proposed classifier over the state-of-the-art on the three datasets.

[1]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[2]  R. Cancelliere,et al.  A comparative analysis of neural network performances in astronomical imaging , 2003 .

[3]  Davar Giveki,et al.  Designing a New Radial Basis Function Neural Network by Harmony Search for Diabetes Diagnosis , 2019, Optical Memory and Neural Networks.

[4]  Gholam Ali Montazer,et al.  Scene Classification Using Multi-Resolution WAHOLB Features and Neural Network Classifier , 2017, Neural Processing Letters.

[5]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Jiwen Lu,et al.  PCANet: A Simple Deep Learning Baseline for Image Classification? , 2014, IEEE Transactions on Image Processing.

[7]  Reza Sabzevari,et al.  Three-phase strategy for the OSD learning method in RBF neural networks , 2009, Neurocomputing.

[8]  Haoqiang Fan,et al.  Approaching human level facial landmark localization by deep learning , 2016, Image Vis. Comput..

[9]  Song Bai,et al.  Deep learning representation using autoencoder for 3D shape retrieval , 2014, Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics (SPAC).

[10]  Xiaogang Wang,et al.  Deep Learning Strong Parts for Pedestrian Detection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[11]  Shiliang Sun,et al.  A survey of multi-view machine learning , 2013, Neural Computing and Applications.

[12]  Hongliang Li,et al.  WaveLBP based hierarchical features for image classification , 2013, Pattern Recognit. Lett..

[13]  Dewen Hu,et al.  Scene classification using a multi-resolution bag-of-features model , 2013, Pattern Recognit..

[14]  Licheng Jiao,et al.  Feature integration of EODH and Color-SIFT: Application to image retrieval based on codebook , 2014, Signal Process. Image Commun..

[15]  Ioannis Pratikakis,et al.  Bag of spatio-visual words for context inference in scene classification , 2013, Pattern Recognit..

[16]  Dit-Yan Yeung,et al.  Learning a Deep Compact Image Representation for Visual Tracking , 2013, NIPS.

[17]  Dacheng Tao,et al.  Non-Local Auto-Encoder With Collaborative Stabilization for Image Restoration , 2016, IEEE Transactions on Image Processing.

[18]  Cor J. Veenman,et al.  Kernel Codebooks for Scene Categorization , 2008, ECCV.

[19]  Dacheng Tao,et al.  A Survey on Multi-view Learning , 2013, ArXiv.

[20]  Lutz Prechelt,et al.  A Set of Neural Network Benchmark Problems and Benchmarking Rules , 1994 .

[21]  Davar Giveki,et al.  A New Content Based Image Retrieval Model Based on Wavelet Transform , 2015 .

[22]  Cheng-Yuan Liou,et al.  Least-Mean-Square Training of Cluster-Weighted Modeling , 2007, ICANN.

[23]  Received January,et al.  A New SIFT-Based Image Descriptor Applicable for Content Based Image Retrieval , 2015 .

[24]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[25]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Reza Sabzevari,et al.  Improvement of learning algorithms for RBF neural networks in a helicopter sound identification system , 2007, Neurocomputing.

[27]  Gholam Ali Montazer,et al.  Content based image retrieval system using clustered scale invariant feature transforms , 2015 .

[28]  Yong Wang,et al.  Conditional Random Field for Natural Scene Categorization , 2007, BMVC.

[29]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[30]  Beng Chin Ooi,et al.  Effective deep learning-based multi-modal retrieval , 2015, The VLDB Journal.

[31]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[32]  Yuan Yan Tang,et al.  High-Order Distance-Based Multiview Stochastic Learning in Image Classification , 2014, IEEE Transactions on Cybernetics.

[33]  Xiyuan Chen Deformation Measurement of the Large Flexible Surface by Improved RBFNN Algorithm and BPNN Algorithm , 2007, ISNN.

[34]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[35]  ObjectPatchNet: Towards scalable and semantic image annotation and retrieval , 2014, Comput. Vis. Image Underst..

[36]  Shuyuan Yang,et al.  Sparse coding and classifier ensemble based multi-instance learning for image categorization , 2013, Signal Process..

[37]  R. Balasubramanian,et al.  Local maximum edge binary patterns: A new descriptor for image retrieval and object tracking , 2012, Signal Process..

[38]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[39]  N. H. C. Yung,et al.  Scene categorization via contextual visual words , 2010, Pattern Recognit..

[40]  Gholam Ali Montazer,et al.  A new image feature descriptor for content based image retrieval using scale invariant feature transform and local derivative pattern , 2017 .

[41]  Gholam Ali Montazer,et al.  Atanassov's intuitionistic fuzzy histon for robust moving object detection , 2017, Int. J. Approx. Reason..

[42]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[43]  Xiao Bai,et al.  Discriminative Features for Image Classification and Retrieval , 2011, 2011 Sixth International Conference on Image and Graphics.

[44]  Zhengzhi Wang,et al.  Building global image features for scene recognition , 2012, Pattern Recognit..

[45]  Xiao Bai,et al.  Discriminative Features for Image Classification and Retrieval , 2011, 2011 Sixth International Conference on Image and Graphics.

[46]  Song-Chun Zhu,et al.  Hierarchical Space Tiling for Scene Modeling , 2012, ACCV.

[47]  Silvio Savarese,et al.  Hierarchical classification of images by sparse approximation , 2013, Image Vis. Comput..

[48]  Hui Wei,et al.  Improvements in image categorization using codebook ensembles , 2011, Image Vis. Comput..

[49]  Gholam Ali Montazer,et al.  An improvement in RBF learning algorithm based on PSO for real time applications , 2013, Neurocomputing.

[50]  Qi Tian,et al.  Image classification using Harr-like transformation of local features with coding residuals , 2013, Signal Process..

[51]  Nelson H. C. Yung,et al.  Feature fusion within local region using localized maximum-margin learning for scene categorization , 2012, Pattern Recognit..

[52]  Davar Giveki,et al.  Erratum to: A New Neural Network Classifier Based on Atanassov’s Intuitionistic Fuzzy Set Theory , 2019, Optical Memory and Neural Networks.