A framework for selection and fusion of pattern classifiers in multimedia recognition

The frequent growth of visual data, either by countless monitoring video cameras wherever we go or the popularization of mobile devices that allow each person to create and edit their own images and videos have contributed enormously to the so-called ''big-data revolution''. This shear amount of visual data gives rise to a Pandora box of new visual classification problems never imagined before. Image and video classification tasks have been inserted in different and complex applications and the use of machine learning-based solutions has become the most popular approach for several applications. Notwithstanding, there is no silver bullet that solves all the problems, i.e., it is not possible to characterize all images of different domains with the same description method nor is it possible to use the same learning method to achieve good results in any kind of application. In this work, we aim at proposing a framework for classifier selection and fusion. Our method seeks to combine image characterization and learning methods by means of a meta-learning approach responsible for assessing which methods contribute more towards the solution of a given problem. The framework uses a strategy of classifier selection which pinpoints the less correlated, yet effective, classifiers through a series of diversity measures analysis. The experiments show that the proposed approach achieves comparable results to well-known algorithms from the literature on four different applications but using less learning and description methods as well as not incurring in the curse of dimensionality and normalization problems common to some fusion techniques. Furthermore, our approach is able to achieve effective classification results using very reduced training sets. The proposed method is also amenable to continuous learning and flexible enough for implementation in highly-parallel architectures.

[1]  Ricardo da Silva Torres,et al.  RECOD at ImageCLEF 2011: Medical Modality Classification using Genetic Programming , 2011, CLEF.

[2]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[3]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[4]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Austin Troy,et al.  Object-based land cover classification of shaded areas in high spatial resolution imagery of urban areas: A comparison study , 2009 .

[6]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[7]  Dale J. Prediger,et al.  Coefficient Kappa: Some Uses, Misuses, and Alternatives , 1981 .

[8]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[9]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Lawrence O'Gorman,et al.  Innovations in fingerprint capture devices , 2003, Pattern Recognit..

[11]  Sylvie Philipp-Foliguet,et al.  Interactive Multiscale Classification of High-Resolution Remote Sensing Images , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  João Paulo Papa,et al.  A novel algorithm for feature selection using Harmony Search and its application for non-technical losses detection , 2011, Comput. Electr. Eng..

[13]  Ming Yang,et al.  Large-scale image classification: Fast feature extraction and SVM training , 2011, CVPR 2011.

[14]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[15]  Johannes R. Sveinsson,et al.  Parallel consensual neural networks , 1997, IEEE Trans. Neural Networks.

[16]  João Paulo Papa,et al.  Automatic Segmentation and Classification of Human Intestinal Parasites From Microscopy Images , 2013, IEEE Transactions on Biomedical Engineering.

[17]  João Paulo Papa,et al.  How Far do We Get Using Machine Learning Black-Boxes? , 2012, Int. J. Pattern Recognit. Artif. Intell..

[18]  ByoungChul Ko,et al.  Cell image classification based on ensemble features and random forest , 2011 .

[19]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  Kagan Tumer,et al.  Classifier ensembles: Select real-world applications , 2008, Inf. Fusion.

[21]  Paolo Frasconi,et al.  New results on error correcting output codes of kernel machines , 2004, IEEE Transactions on Neural Networks.

[22]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[23]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[24]  Anderson Rocha,et al.  Automatic fusion of region-based classifiers for coffee crop recognition , 2012, 2012 IEEE International Geoscience and Remote Sensing Symposium.

[25]  Quan Liu,et al.  An Orientation Independent Texture Descriptor for Image Retrieval , 2007, 2007 International Conference on Communications, Circuits and Systems.

[26]  Jian Li,et al.  Reducing the Overfitting of Adaboost by Controlling its Data Distribution Skewness , 2006, Int. J. Pattern Recognit. Artif. Intell..

[27]  Hervé Le Men,et al.  Scale-Sets Image Analysis , 2005, International Journal of Computer Vision.

[28]  Mario A. Nascimento,et al.  A compact and efficient image retrieval approach based on border/interior pixel classification , 2002, CIKM '02.

[29]  Arun Ross,et al.  Handbook of Multibiometrics , 2006, The Kluwer international series on biometrics.

[30]  Neucimar J. Leite,et al.  Wavelet-based fingerprint image retrieval , 2009 .

[31]  Jefersson Alex dos Santos,et al.  Evaluating the Potential of Texture and Color Descriptors for Remote Sensing Image Retrieval and Classification , 2010, VISAPP.

[32]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[33]  Eduardo F. Nakamura,et al.  Information fusion for wireless sensor networks: Methods, models, and classifications , 2007, CSUR.

[34]  Nan Yu,et al.  Introduce randomness into AdaBoost for robust performance on noisy data , 2010, 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery.

[35]  Zhenkui Ma,et al.  Tau coefficients for accuracy assessment of classification of remote sensing data , 1995 .

[36]  L. Cooper,et al.  When Networks Disagree: Ensemble Methods for Hybrid Neural Networks , 1992 .

[37]  Neucimar J. Leite,et al.  Wavelet-based Feature Extraction for Fingerprint Image Retrieval , 2007 .

[38]  Robert P. W. Duin,et al.  Open Issues in Pattern Recognition , 2005, CORES.

[39]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[40]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[41]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[42]  Fei-Fei Li,et al.  What Does Classifying More Than 10, 000 Image Categories Tell Us? , 2010, ECCV.

[43]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[44]  Whoi-Yul Kim,et al.  Reducing overfitting of AdaBoost by clustering-based pruning of hard examples , 2013, ICUIMC '13.

[45]  Ricardo da Silva Torres,et al.  Exploiting contextual spaces for image re-ranking and rank aggregation , 2011, ICMR.

[46]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[47]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[48]  Enrico Blanzieri,et al.  Fast and Scalable Local Kernel Machines , 2010, J. Mach. Learn. Res..

[49]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[50]  Shengli Wu,et al.  Classification by Cluster Analysis: A New Meta-Learning Based Approach , 2011, MCS.

[51]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[52]  Osmar R. Zaïane,et al.  Application of Data Mining Techniques for Medical Image Classification , 2001, MDM/KDD.

[53]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[54]  Anderson Rocha,et al.  Automatic Tracking of Indoor Soccer Players Using Videos from Multiple Cameras , 2012, 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images.

[55]  ChoSung-Bae,et al.  Fingerprint classification using one-vs-all support vector machines dynamically ordered with naïve Bayes classifiers , 2008 .

[56]  Jacques Wainer,et al.  Automatic fruit and vegetable classification from images , 2010 .

[57]  Ricardo da Silva Torres,et al.  Image Re-ranking and Rank Aggregation Based on Similarity of Ranked Lists , 2011, CAIP.

[58]  Michael Unser,et al.  Sum and Difference Histograms for Texture Classification , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59]  Bo Tao,et al.  Texture Recognition and Image Retrieval Using Gradient Indexing , 2000, J. Vis. Commun. Image Represent..

[60]  João Paulo Papa,et al.  Improving Image Classification through Descriptor Combination , 2012, 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images.

[61]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[62]  Pier Luca Lanzi,et al.  Classifier Systems , 2010, Encyclopedia of Machine Learning.

[63]  Alexander H. Waibel,et al.  The Meta-Pi Network: Building Distributed Knowledge Representations for Robust Multisource Pattern Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[64]  Sylvie Philipp-Foliguet,et al.  Descriptor correlation analysis for remote sensing image multi-scale classification , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[65]  Yong Yang,et al.  Evaluating Feature Combination in Object Classification , 2011, ISVC.

[66]  Anderson Rocha,et al.  Automatic Classifier Fusion for Produce Recognition , 2012, 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images.

[67]  Sabina Leonelli,et al.  The Philosophy of Data , 2016 .

[68]  Sung-Bae Cho,et al.  Fingerprint classification using one-vs-all support vector machines dynamically ordered with naive Bayes classifiers , 2008, Pattern Recognit..

[69]  Bernard Zenko,et al.  Is Combining Classifiers with Stacking Better than Selecting the Best One? , 2004, Machine Learning.

[70]  Sebastian Nowozin,et al.  On feature combination for multiclass object classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[71]  Marc Sebban,et al.  Discriminative feature fusion for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[72]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[73]  Jamshid Shanbehzadeh,et al.  Image retrieval based on shape similarity by edge orientation autocorrelogram , 2003, Pattern Recognit..

[74]  Matthijs Douze,et al.  Large-scale image classification with trace-norm regularization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[75]  Ricardo da Silva Torres,et al.  Comparative study of global color and texture descriptors for web image retrieval , 2012, J. Vis. Commun. Image Represent..