A Parallel Adaboost-Backpropagation Neural Network for Massive Image Dataset Classification

Image classification uses computers to simulate human understanding and cognition of images by automatically categorizing images. This study proposes a faster image classification approach that parallelizes the traditional Adaboost-Backpropagation (BP) neural network using the MapReduce parallel programming model. First, we construct a strong classifier by assembling the outputs of 15 BP neural networks (which are individually regarded as weak classifiers) based on the Adaboost algorithm. Second, we design Map and Reduce tasks for both the parallel Adaboost-BP neural network and the feature extraction algorithm. Finally, we establish an automated classification model by building a Hadoop cluster. We use the Pascal VOC2007 and Caltech256 datasets to train and test the classification model. The results are superior to those obtained using traditional Adaboost-BP neural network or parallel BP neural network approaches. Our approach increased the average classification accuracy rate by approximately 14.5% and 26.0% compared to the traditional Adaboost-BP neural network and parallel BP neural network, respectively. Furthermore, the proposed approach requires less computation time and scales very well as evaluated by speedup, sizeup and scaleup. The proposed approach may provide a foundation for automated large-scale image classification and demonstrates practical value.

[1]  Min Han,et al.  Remote sensing image classification based on neural network ensemble algorithm , 2012, Neurocomputing.

[2]  Jean-Marc Odobez,et al.  A Thousand Words in a Scene , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  K. Gunavathi,et al.  Lung cancer classification using neural networks for CT images , 2014, Comput. Methods Programs Biomed..

[4]  Sungyoung Lee,et al.  MRPack: Multi-Algorithm Execution Using Compute-Intensive Approach in MapReduce , 2015, PloS one.

[5]  Guo Li,et al.  Retrieving Image Resource Technique Based on Bayes Semantic Classification and Visual Feature Extraction , 2013 .

[6]  R. Lienhart,et al.  Continuous visual vocabulary modelsfor pLSA-based scene recognition , 2008, CIVR '08.

[7]  Zhaohui Xue,et al.  Harmonic Analysis for Hyperspectral Image Classification Integrated With PSO Optimized SVM , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[8]  Junjie Chen,et al.  An Adaboost-Backpropagation Neural Network for Automated Image Sentiment Classification , 2014, TheScientificWorldJournal.

[9]  Robert E. Schapire,et al.  Theoretical Views of Boosting , 1999, EuroCOLT.

[10]  Christoforos E. Kozyrakis,et al.  Evaluating MapReduce for Multi-core and Multiprocessor Systems , 2007, 2007 IEEE 13th International Symposium on High Performance Computer Architecture.

[11]  Liang Liang,et al.  [Hyperspectral remote sensing image classification based on ICA and SVM algorithm]. , 2010, Guang pu xue yu guang pu fen xi = Guang pu.

[12]  William J. Emery,et al.  SVM Active Learning Approach for Image Classification Using Spatial Information , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Yan Li,et al.  Research on supervised classification of fully polarimetric SAR image using BP neural network trained by PSO , 2010, 2010 8th World Congress on Intelligent Control and Automation.

[14]  Kiminori Matsuzaki,et al.  Parallel Tree Accumulations on MapReduce , 2015, International Journal of Parallel Programming.

[15]  Dou Peng,et al.  Research and application of Object-oriented remote sensing image classification based on decision tree , 2013 .

[16]  Zhou Jie A Segmentation Method for Stratum Image Based on FCM and Random Walks , 2013 .

[17]  Hao Wu,et al.  An effective feature selection method for hyperspectral image classification based on genetic algorithm and support vector machine , 2011, Knowl. Based Syst..

[18]  Kęstutis Dučinskas,et al.  Application of Bayes linear discriminant functions in image classification , 2012, Pattern Recognit. Lett..

[19]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[20]  Christos Doulkeridis,et al.  A survey of large-scale analytical query processing in MapReduce , 2013, The VLDB Journal.

[21]  De Xu,et al.  Transductive Multi-Instance Multi-Label learning algorithm with application to automatic image annotation , 2010, Expert Syst. Appl..

[22]  Konstantinos Topouzelis,et al.  Oil spill feature selection and classification using decision tree forest on SAR image data , 2012 .

[23]  Seyong Lee,et al.  MapReduce with communication overlap (MaRCO) , 2013, J. Parallel Distributed Comput..

[24]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[25]  Rajiv Ranjan,et al.  G-Hadoop: MapReduce across distributed data centers for data-intensive computing , 2013, Future Gener. Comput. Syst..

[26]  Xiaodong Yue,et al.  Parallel attribute reduction algorithms using MapReduce , 2014, Inf. Sci..

[27]  Bo Cheng,et al.  An Unsupervised Classification Method of Remote Sensing Images Based on Ant Colony Optimization Algorithm , 2010, ADMA.

[28]  Haisheng Song,et al.  Classification of ETM+ Remote Sensing Image Based on Hybrid Algorithm of Genetic Algorithm and Back Propagation Neural Network , 2013 .

[29]  Rui Zhang,et al.  The bank risk forewarning model of BP neural network based on the clound computing , 2012, 2012 8th International Conference on Computing and Networking Technology (INC, ICCIS and ICMIC).

[30]  Oleksandr Makeyev,et al.  Neural network with ensembles , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[31]  Sherif Sakr,et al.  The family of mapreduce and large-scale data processing systems , 2013, CSUR.

[32]  Anil K. Jain,et al.  Image classification for content-based indexing , 2001, IEEE Trans. Image Process..

[33]  Fuzhen Zhuang,et al.  Parallel extreme learning machine for regression based on MapReduce , 2013, Neurocomputing.

[34]  Dacheng Tao,et al.  A survey on scene image classi cation , 2015 .

[35]  Jesús M. González-Barahona,et al.  The evolution of the laws of software evolution , 2013, ACM Comput. Surv..