A Local Search with a Surrogate Assisted Option for Instance Reduction

In data mining, instance reduction is a key data pre-processing step that simplifies and cleans raw data, by either selecting or creating new samples, before applying a learning algorithm. This usually yields to a complex large scale and computationally expensive optimisation problem which has been typically tackled by sophisticated population-based metaheuristics. Unlike the recent literature, in order to accomplish this target, this article proposes the use of a simple local search algorithm and its integration with an optional surrogate assisted model. This local search, in accordance with variable decomposition techniques for large scale problems, perturbs an n-dimensional vector along the directions identified by its design variables one by one.

[1]  Francisco Herrera,et al.  Instance reduction for one-class classification , 2018, Knowledge and Information Systems.

[2]  Bernhard Sendhoff,et al.  Generalizing Surrogate-Assisted Evolutionary Computation , 2010, IEEE Transactions on Evolutionary Computation.

[3]  Yaochu Jin,et al.  Surrogate-assisted evolutionary computation: Recent advances and future challenges , 2011, Swarm Evol. Comput..

[4]  Natalio Krasnogor,et al.  Towards Robust Memetic Algorithms , 2005 .

[5]  Fabio Caraffini,et al.  An analysis on separability for Memetic Computing automatic design , 2014, Inf. Sci..

[6]  Xiaodong Li,et al.  Cooperatively Coevolving Particle Swarms for Large Scale Optimization , 2012, IEEE Transactions on Evolutionary Computation.

[7]  Vasant Dhar,et al.  Data science and prediction , 2012, CACM.

[8]  Chun Chen,et al.  Multiple trajectory search for Large Scale Global Optimization , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[9]  Francisco Herrera,et al.  A Taxonomy and Experimental Study on Prototype Generation for Nearest Neighbor Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[10]  J. D. Jobson,et al.  Multiple linear regression. , 1991 .

[11]  María José del Jesús,et al.  KEEL 3.0: An Open Source Software for Multi-Stage Analysis in Data Mining , 2017, Int. J. Comput. Intell. Syst..

[12]  Francisco Herrera,et al.  A combined MapReduce-windowing two-level parallel scheme for evolutionary prototype generation , 2014, 2014 IEEE Congress on Evolutionary Computation (CEC).

[13]  Bu-Sung Lee,et al.  Memetic algorithm using multi-surrogates for computationally expensive optimization problems , 2007, Soft Comput..

[14]  Yong Wang,et al.  Global and Local Surrogate-Assisted Differential Evolution for Expensive Constrained Optimization Problems With Inequality Constraints , 2019, IEEE Transactions on Cybernetics.

[15]  Sheng-Fuu Lin,et al.  A separability detection approach to cooperative particle swarm optimization , 2011, 2011 Seventh International Conference on Natural Computation.

[16]  Francisco Herrera,et al.  MRPR: A MapReduce solution for prototype reduction in big data classification , 2015, Neurocomputing.

[17]  Bertha A Hidalgo,et al.  Multivariate or multivariable regression? , 2013, American journal of public health.

[18]  Francisco Herrera,et al.  IPADE: Iterative Prototype Adjustment for Nearest Neighbor Classification , 2010, IEEE Transactions on Neural Networks.

[19]  Virginia Torczon,et al.  On the Convergence of Pattern Search Algorithms , 1997, SIAM J. Optim..

[20]  Daniele Loiacono,et al.  Speeding-Up Expensive Evaluations in High-Level Synthesis Using Solution Modeling and Fitness Inheritance , 2010 .

[21]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[22]  Raymond Ros,et al.  A Simple Modification in CMA-ES Achieving Linear Time and Space Complexity , 2008, PPSN.

[23]  Rommel G. Regis Surrogate-Assisted Particle Swarm with Local Search for Expensive Constrained Optimization , 2018, BIOMA.

[24]  Javier Pérez-Rodríguez,et al.  A Scalable Memetic Algorithm for Simultaneous Instance and Feature Selection , 2014, Evolutionary Computation.

[25]  Loris Nanni,et al.  Particle swarm optimization for prototype reduction , 2009, Neurocomputing.

[26]  Francisco Herrera,et al.  A memetic algorithm for evolutionary prototype selection: A scaling up approach , 2008, Pattern Recognit..

[27]  N. Salvatore,et al.  Surrogate assisted local search in PMSM drive design , 2008 .

[28]  Xin Yao,et al.  Voronoi-based Efficient Surrogate-assisted Evolutionary Algorithm for Very Expensive Problems , 2019, 2019 IEEE Congress on Evolutionary Computation (CEC).

[29]  Francisco Herrera,et al.  Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study , 2003, IEEE Trans. Evol. Comput..

[30]  Francisco Herrera,et al.  Differential evolution for optimizing the positioning of prototypes in nearest neighbor classification , 2011, Pattern Recognit..

[31]  Francisco Herrera,et al.  Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Kai-Yew Lum,et al.  Max-min surrogate-assisted evolutionary algorithm for robust design , 2006, IEEE Transactions on Evolutionary Computation.

[33]  Giovanni Iacca,et al.  Large Scale Problems in Practice: The Effect of Dimensionality on the Interaction Among Variables , 2017, EvoApplications.

[34]  Giovanni Iacca,et al.  Parallel memetic structures , 2013, Inf. Sci..

[35]  Dirk Sudholt,et al.  Memetic algorithms beat evolutionary algorithms on the class of hurdle problems , 2018, GECCO.

[36]  Ponnuthurai N. Suganthan,et al.  Self-adaptive differential evolution with multi-trajectory search for large-scale optimization , 2011, Soft Comput..