rEDA: reverse estimation of distribution algorithm for classification

Estimation of distribution algorithm EDA is a branch of evolutionary algorithms. EDA replaces recombination and mutation operators with the estimation of probabilistic distribution of selected individuals. However, these selected individuals only cover part of the problem to be optimised, which causes that the algorithm may easily fall into a local optimum. In this paper, we propose a variation of EDA, reverse estimation of distribution algorithm rEDA, from the perspective of reverse process. Different from the EDA process that individuals are firstly given and then the estimation of models starts, rEDA is to firstly give initial models and then regulate these models relying on sampling from the models and optimisation objective. We employ rEDA to classification in data mining area and propose a novel classification algorithm based on rEDA. The proposed rEDA algorithm and rEDA-based classification algorithm are analysed theoretically. The empirical results show our proposed algorithm outperforms some classical classification algorithms in accuracy.

[1]  David G. Kleinbaum,et al.  Logistic Regression. A Self- Learning Text , 1994 .

[2]  Shengyao Wang,et al.  An effective estimation of distribution algorithm for the flexible job-shop scheduling problem with fuzzy processing time , 2013 .

[3]  Xin-She Yang,et al.  Metaheuristic algorithms for inverse problems , 2013 .

[4]  Ling Wang,et al.  A Pareto-based estimation of distribution algorithm for the multi-objective flexible job-shop scheduling problem , 2013 .

[5]  Bernhard Sendhoff,et al.  Voronoi-based estimation of distribution algorithm for multi-objective optimization , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[6]  Alexander Mendiburu,et al.  A Distance-Based Ranking Model Estimation of Distribution Algorithm for the Flowshop Scheduling Problem , 2014, IEEE Transactions on Evolutionary Computation.

[7]  Michael Andrew Christie,et al.  Estimation of Distribution Algorithms Applied to History Matching , 2013 .

[8]  Irina Rish,et al.  An empirical study of the naive Bayes classifier , 2001 .

[9]  Shingo Mabu,et al.  A Novel Graph-Based Estimation of the Distribution Algorithm and its Extension Using Reinforcement Learning , 2014, IEEE Transactions on Evolutionary Computation.

[10]  Uwe Aickelin,et al.  An estimation of distribution algorithm with intelligent local search for rule-based nurse rostering , 2007, J. Oper. Res. Soc..

[11]  Zhihua Cui,et al.  EDA-USL: unsupervised clustering algorithm based on estimation of distribution algorithm , 2011, Int. J. Wirel. Mob. Comput..

[12]  Mo-Yuen Chow,et al.  Performance Evaluation of an EDA-Based Large-Scale Plug-In Hybrid Electric Vehicle Charging Algorithm , 2012, IEEE Transactions on Smart Grid.

[13]  Jian-cong Fan,et al.  SSL-EDA: semi-supervised learning algorithm based on estimation of distribution algorithm , 2012 .

[14]  Qiang Xu,et al.  A novel classification learning framework based on estimation of distribution algorithms , 2012, Int. J. Comput. Sci. Math..

[15]  Hisashi Handa EDA-RL: estimation of distribution algorithms for reinforcement learning problems , 2009, GECCO '09.

[16]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[17]  Pedro Larrañaga,et al.  Research topics in discrete estimation of distribution algorithms based on factorizations , 2009, Memetic Comput..

[18]  Fan Jiancong,et al.  A Hybrid Recommendation Model Based on Estimation of Distribution Algorithms , 2014 .

[19]  Sayan Mukherjee,et al.  Choosing Multiple Parameters for Support Vector Machines , 2002, Machine Learning.

[20]  Qingfu Zhang,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION 1 RM-MEDA: A Regularity Model-Based Multiobjective Estimation of , 2022 .

[21]  Yvan Saeys,et al.  Fast feature selection using a simple estimation of distribution algorithm: a case study on splice site prediction , 2003, ECCB.

[22]  I. J. Myung,et al.  The Importance of Complexity in Model Selection. , 2000, Journal of mathematical psychology.

[23]  Jiancong Fan,et al.  Predicting Yeast Protein Localization Sites by a New Clustering Algorithm Based on Weighted Feature Ensemble , 2014 .

[24]  Shih-Hsin Chen,et al.  Addressing the advantages of using ensemble probabilistic models in Estimation of Distribution Algorithms for scheduling problems , 2013 .

[25]  Chen Fang,et al.  An effective estimation of distribution algorithm for the multi-mode resource-constrained project scheduling problem , 2012, Comput. Oper. Res..

[26]  David E. Goldberg,et al.  A Survey of Optimization by Building and Using Probabilistic Models , 2002, Comput. Optim. Appl..

[27]  Cong Jin,et al.  Software reliability prediction model based on support vector regression with improved estimation of distribution algorithms , 2014, Appl. Soft Comput..

[28]  Yaxin Bi,et al.  KNN Model-Based Approach in Classification , 2003, OTM.

[29]  Peter A. N. Bosman,et al.  Matching inductive search bias and problem structure in continuous Estimation-of-Distribution Algorithms , 2008, Eur. J. Oper. Res..

[30]  Feng Xu,et al.  Decomposition of Multi-Objective Evolutionary Algorithm based on Estimation of Distribution , 2014 .

[31]  Ken Aho,et al.  Model selection for ecologists: the worldviews of AIC and BIC. , 2014, Ecology.