Other Ensemble Approaches

This chapter describes several new fields, beside the feature selection preprocessing step (the theme of this book), in which ensembles have been successfully used. First, in Sect. 7.1, we introduce a very brief review of the different application fields in which ensembles have been applied, together with basic levels that are used to produce different ensemble designs, and a sample taxonomy. Then, in Sect. 7.2 basic ideas in ensemble classification design, one of the very first machine learning areas in which the idea of ensembles was applied, are stated. As there are many interesting and reference books in ensemble classification, we focus on describing the latest ideas in classification ensembles that address problems such as classification, stream data, missing data and imbalance data. Afterwards, the first attempts for applying the ensemble paradigm to the relatively new field of quantification are described in Sect. 7.3. In Sect. 7.4 we move to describing ensembles for clustering, another area in which ensembles have been increasingly popular. In Sect. 7.5 an attempt on ensembles for discretization is described and, finally, Sect. 7.6 summarizes and discusses the contents of this chapter.

[1]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1995, COLT '90.

[2]  Ricardo Vilalta,et al.  A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.

[3]  Mohamed Limam,et al.  CONSENSUS FUNCTIONS FOR CLUSTER ENSEMBLES , 2012, Appl. Artif. Intell..

[4]  Bernhard Schölkopf,et al.  Support Vector Method for Novelty Detection , 1999, NIPS.

[5]  Giuliano Armano,et al.  Building forests of local trees , 2018, Pattern Recognit..

[6]  Yang Wang,et al.  Cost-sensitive boosting for classification of imbalanced data , 2007, Pattern Recognit..

[7]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[8]  Sandrine Dudoit,et al.  Bagging to Improve the Accuracy of A Clustering Procedure , 2003, Bioinform..

[9]  Bartosz Krawczyk,et al.  Selecting locally specialised classifiers for one-class classification ensembles , 2017, Pattern Analysis and Applications.

[10]  Chris H. Q. Ding,et al.  Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[11]  Geoff Holmes,et al.  Having a Blast: Meta-Learning and Heterogeneous Ensembles for Data Streams , 2015, 2015 IEEE International Conference on Data Mining.

[12]  Francisco Herrera,et al.  EUSBoost: Enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling , 2013, Pattern Recognit..

[13]  Enrique F. Castillo,et al.  Distributed One-Class Support Vector Machine , 2015, Int. J. Neural Syst..

[14]  Xin Yao,et al.  Diversity analysis on imbalanced data sets by using ensemble models , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[15]  Jerzy Stefanowski,et al.  Combining block-based and online methods in learning ensembles from concept drifting data streams , 2014, Inf. Sci..

[16]  Joydeep Ghosh,et al.  Cluster ensembles , 2011, Data Clustering: Algorithms and Applications.

[17]  Jean Paul Barddal,et al.  A survey on feature drift adaptation: Definition, benchmark, challenges and future directions , 2017, J. Syst. Softw..

[18]  Verónica Bolón-Canedo,et al.  Preprocessing in High Dimensional Datasets , 2018 .

[19]  Hamido Fujita,et al.  Incremental fuzzy cluster ensemble learning based on rough set theory , 2017, Knowl. Based Syst..

[20]  João Gama,et al.  Ensemble learning for data stream analysis: A survey , 2017, Inf. Fusion.

[21]  Krzysztof J. Cios,et al.  Review of ensembles of multi-label classifiers: Models, experimental study and prospects , 2018, Inf. Fusion.

[22]  Petia Radeva,et al.  Approximate polytope ensemble for one-class classification , 2014, Pattern Recognit..

[23]  Zhou Zimu,et al.  RSSIからCSIへ:チャネルレスポンスによるインドア・ローカリゼーション , 2013 .

[24]  Geoff Holmes,et al.  Leveraging Bagging for Evolving Data Streams , 2010, ECML/PKDD.

[25]  Jugurta R. Montalvão Filho,et al.  Clustering ensembles and space discretization - A new regard toward diversity and consensus , 2010, Pattern Recognit. Lett..

[26]  Thierry Bouwmans,et al.  Superpixel-based online wagging one-class ensemble for feature selection in foreground/background separation , 2017, Pattern Recognit. Lett..

[27]  Szymon Wilk,et al.  Integrating Selective Pre-processing of Imbalanced Data with Ivotes Ensemble , 2010, RSCTC.

[28]  Gerhard Tröster,et al.  Using ensemble classifier systems for handling missing data in emotion recognition from physiology: One step towards a practical system , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[29]  Jill P. Mesirov,et al.  Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data , 2003, Machine Learning.

[30]  Piotr Duda,et al.  How to adjust an ensemble size in stream data mining? , 2017, Inf. Sci..

[31]  Jean Paul Barddal,et al.  A Survey on Ensemble Learning for Data Stream Classification , 2017, ACM Comput. Surv..

[32]  Latifur Khan,et al.  Incremental Ensemble Classifier Addressing Non-stationary Fast Data Streams , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[33]  Hui-lan Luo,et al.  Combining Multiple Clusterings using Information Theory based Genetic Algorithm , 2006, 2006 International Conference on Computational Intelligence and Security.

[34]  Joung Woo Ryu,et al.  Efficiently Maintaining the Performance of an Ensemble Classifier in Streaming Data , 2012, ICHIT.

[35]  Bhekisipho Twala,et al.  Ensemble missing data techniques for software effort prediction , 2010, Intell. Data Anal..

[36]  A. Mukhopadhyay,et al.  Clustering Ensemble: A Multiobjective Genetic Algorithm based Approach , 2013 .

[37]  Emilio Corchado,et al.  A survey of multiple classifier systems as hybrid systems , 2014, Inf. Fusion.

[38]  Yuchou Chang,et al.  Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm , 2008, Pattern Recognit..

[39]  Yuxing Peng,et al.  A subspace ensemble framework for classification with high dimensional missing data , 2016, Multidimensional Systems and Signal Processing.

[40]  Joan Claudi Socoró,et al.  Feature diversity in cluster ensembles for robust document clustering , 2006, SIGIR '06.

[41]  Jerzy Stefanowski,et al.  Ensemble Diversity in Evolving Data Streams , 2016, DS.

[42]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[43]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[44]  Verónica Bolón-Canedo,et al.  Data discretization: taxonomy and big data challenge , 2016, WIREs Data Mining Knowl. Discov..

[45]  Xin Yao,et al.  Diversity creation methods: a survey and categorisation , 2004, Inf. Fusion.

[46]  Mohamed Medhat Gaber,et al.  Knowledge discovery from data streams , 2009, IDA 2009.

[47]  Tossapon Boongoen,et al.  An Enhanced Univariate Discretization Based on Cluster Ensembles , 2016 .

[48]  Hamed R. Bonab,et al.  Less Is More: A Comprehensive Framework for the Number of Components of Ensemble Classifiers , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[49]  Ana L. N. Fred,et al.  Data clustering using evidence accumulation , 2002, Object recognition supported by user interaction for service robots.

[50]  George Forman,et al.  Quantifying counts and costs via classification , 2008, Data Mining and Knowledge Discovery.

[51]  Isabelle Herlin,et al.  Quantification of uncertainties from ensembles of simulations , 2016 .

[52]  Nikunj C. Oza,et al.  Online Ensemble Learning , 2000, AAAI/IAAI.

[53]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[54]  Shehroz S. Khan,et al.  One-class classification: taxonomy of study and review of techniques , 2013, The Knowledge Engineering Review.

[55]  Suresh N. Mali,et al.  Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach☆ , 2016 .

[56]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[57]  Zhong Liu,et al.  A Novel Ensemble Method for Imbalanced Data Learning: Bagging of Extrapolation-SMOTE SVM , 2017, Comput. Intell. Neurosci..

[58]  Foster J. Provost,et al.  Handling Missing Values when Applying Classification Models , 2007, J. Mach. Learn. Res..

[59]  Manuel Graña,et al.  Guest Editorial: Hybrid intelligent fusion systems , 2014, Inf. Fusion.

[60]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[61]  Daoqiang Zhang,et al.  Weighted Spectral Cluster Ensemble , 2015, 2015 IEEE International Conference on Data Mining.

[62]  B.V. Dasarathy,et al.  A composite classifier system design: Concepts and methodology , 1979, Proceedings of the IEEE.

[63]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[64]  Yi-Ning Quan,et al.  Modular ensembles for one-class classification based on density analysis , 2016, Neurocomputing.

[65]  Loris Nanni,et al.  A classifier ensemble approach for the missing feature problem , 2012, Artif. Intell. Medicine.

[66]  Gavin Brown,et al.  Learn++.MF: A random subspace approach for the missing feature problem , 2010, Pattern Recognit..

[67]  Francky Catthoor,et al.  Classification of Resilience Techniques Against Functional Errors at Higher Abstraction Layers of Digital Systems , 2017, ACM Comput. Surv..

[68]  María José del Jesús,et al.  KEEL 3.0: An Open Source Software for Multi-Stage Analysis in Data Mining , 2017, Int. J. Comput. Intell. Syst..

[69]  Yiwen Zhang,et al.  A selective neural network ensemble classification for incomplete data , 2016, International Journal of Machine Learning and Cybernetics.

[70]  George D. C. Cavalcanti,et al.  Dynamic classifier selection: Recent advances and perspectives , 2018, Inf. Fusion.

[71]  Robi Polikar,et al.  Incremental Learning of Concept Drift in Nonstationary Environments , 2011, IEEE Transactions on Neural Networks.

[72]  Hareton K. N. Leung,et al.  Incremental Semi-Supervised Clustering Ensemble for High Dimensional Data Clustering , 2016, IEEE Trans. Knowl. Data Eng..

[73]  Daniel Hernández-Lobato,et al.  How large should ensembles of classifiers be? , 2013, Pattern Recognit..

[74]  Monidipa Das,et al.  A Deep-Learning-Based Forecasting Ensemble to Predict Missing Data for Remote Sensing Analysis , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[75]  José Ramón Quevedo,et al.  Using ensembles for problems with characterizable changes in data distribution: A case study on quantification , 2017, Inf. Fusion.

[76]  Zhong Yin,et al.  Recognition of emotions using multimodal physiological signals and an ensemble deep learning model , 2017, Comput. Methods Programs Biomed..

[77]  Joshua Zhexue Huang,et al.  Incremental density-based ensemble clustering over evolving data streams , 2016, Neurocomputing.

[78]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[79]  Chih-Fong Tsai,et al.  Clustering-based undersampling in class-imbalanced data , 2017, Inf. Sci..

[80]  Monireh Abdoos,et al.  A New Efficient Approach in Clustering Ensembles , 2007, IDEAL.

[81]  Yong Zhong,et al.  Anomaly Detection from Distributed Flight Record Data for Aircraft Health Management , 2010, 2010 International Conference on Computational and Information Sciences.

[82]  Jun Wu,et al.  A deep learning-based multi-model ensemble method for cancer prediction , 2018, Comput. Methods Programs Biomed..

[83]  Cesare Furlanello,et al.  Deep learning for automatic stereotypical motor movement detection using wearable sensors in autism spectrum disorders , 2017, Signal Process..

[84]  Arindam Banerjee,et al.  Bayesian cluster ensembles , 2011, Stat. Anal. Data Min..

[85]  Anil K. Jain,et al.  Clustering ensembles: models of consensus and weak partitions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[86]  Xin Yao,et al.  Resampling-Based Ensemble Methods for Online Class Imbalance Learning , 2015, IEEE Transactions on Knowledge and Data Engineering.

[87]  Juan José del Coz,et al.  Quantification-oriented learning based on reliable classifiers , 2015, Pattern Recognit..

[88]  John W. Tukey,et al.  Exploratory data analysis , 1977, Addison-Wesley series in behavioral science : quantitative methods.

[89]  Wenhu Tang,et al.  Consensus clustering algorithms for Asset Management in power systems , 2015, 2015 5th International Conference on Electric Utility Deregulation and Restructuring and Power Technologies (DRPT).

[90]  Zhe Li,et al.  Adaptive Ensemble Undersampling-Boost: A novel learning framework for imbalanced data , 2017, J. Syst. Softw..

[91]  Ludmila I. Kuncheva,et al.  Using diversity in cluster ensembles , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[92]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[93]  Ge Yu,et al.  Parallel ensemble of online sequential extreme learning machine based on MapReduce , 2016, Neurocomputing.

[94]  Bhekisipho Twala,et al.  Ensemble imputation methods for missing software engineering data , 2005, 11th IEEE International Software Metrics Symposium (METRICS'05).

[95]  Feng Duan,et al.  Recognizing the Gradual Changes in sEMG Characteristics Based on Incremental Learning of Wavelet Neural Network Ensemble , 2017, IEEE Transactions on Industrial Electronics.

[96]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[97]  Miin-Shen Yang,et al.  Evaluation measures for cluster ensembles based on a fuzzy generalized Rand index , 2017, Appl. Soft Comput..

[98]  Lalita Udpa,et al.  Ensembles of novelty detection classifiers for structural health monitoring using guided waves , 2018 .

[99]  Carlotta Domeniconi,et al.  Weighted cluster ensembles: Methods and analysis , 2009, TKDD.

[100]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[101]  Hamidah Ibrahim,et al.  A review: accuracy optimization in clustering ensembles using genetic algorithms , 2011, Artificial Intelligence Review.

[102]  Ludmila I. Kuncheva,et al.  Evaluation of Stability of k-Means Cluster Ensembles with Respect to Random Initialization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[103]  Edwin Lughofer,et al.  Editorial on the special issue “Hybrid and ensemble techniques in soft computing: recent advances and emerging trends” , 2015, Soft Comput..

[104]  Giorgio Valentini,et al.  Applications of Supervised and Unsupervised Ensemble Methods , 2009, Applications of Supervised and Unsupervised Ensemble Methods.

[105]  Luiz Eduardo Soares de Oliveira,et al.  A framework for dynamic classifier selection oriented by the classification problem difficulty , 2018, Pattern Recognit..

[106]  Giorgio Valentini,et al.  Ensemble methods : a review , 2012 .

[107]  Amanda J. C. Sharkey,et al.  Types of Multinet System , 2002, Multiple Classifier Systems.

[108]  Heitor Murilo Gomes,et al.  SAE: Social Adaptive Ensemble classifier for data streams , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[109]  Raafat S. Elfouly,et al.  NOVEL ENSEMBLE TECHNIQUES FOR REGRESSION WITH MISSING DATA , 2009 .

[110]  Jesús S. Aguilar-Ruiz,et al.  Knowledge discovery from data streams , 2009, Intell. Data Anal..

[111]  Amir F. Atiya,et al.  Forward and Backward Forecasting Ensembles for the Estimation of Time Series Missing Data , 2014, ANNPR.

[112]  Dongqing Xie,et al.  Unsupervised evaluation of cluster ensemble solutions , 2015, 2015 Seventh International Conference on Advanced Computational Intelligence (ICACI).

[113]  Amparo Alonso-Betanzos,et al.  One-Class Convex Hull-Based Algorithm for Classification in Distributed Environments , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.