Towards exploratory landscape analysis for large-scale optimization: a dimensionality reduction framework

Although exploratory landscape analysis (ELA) has shown its effectiveness in various applications, most previous studies focused only on low- and moderate-dimensional problems. Thus, little is known about the scalability of the ELA approach for large-scale optimization. In this context, first, this paper analyzes the computational cost of features in the flacco package. Our results reveal that two important feature classes (ela_level and ela_meta) cannot be applied to large-scale optimization due to their high computational cost. To improve the scalability of the ELA approach, this paper proposes a dimensionality reduction framework that computes features in a reduced lower-dimensional space than the original solution space. We demonstrate that the proposed framework can drastically reduce the computation time of ela_level and ela_meta for large dimensions. In addition, the proposed framework can make the cell-mapping feature classes scalable for large-scale optimization. Our results also show that features computed by the proposed framework are beneficial for predicting the high-level properties of the 24 large-scale BBOB functions.

[1]  Tome Eftimov,et al.  Linear Matrix Factorization Embeddings for Single-objective Optimization Landscapes , 2020, 2020 IEEE Symposium Series on Computational Intelligence (SSCI).

[2]  Eric O. Postma,et al.  Dimensionality Reduction: A Comparative Review , 2008 .

[3]  Anne Auger,et al.  Benchmarking large-scale continuous optimizers: The bbob-largescale testbed, a COCO software guide and beyond , 2020, Appl. Soft Comput..

[4]  Saman K. Halgamuge,et al.  Exploratory Landscape Analysis of Continuous Space Optimization Problems Using Information Content , 2015, IEEE Transactions on Evolutionary Computation.

[5]  Anne Auger,et al.  COCO: a platform for comparing continuous optimizers in a black-box setting , 2016, Optim. Methods Softw..

[6]  Phil Husbands,et al.  Fitness Landscapes and Evolvability , 2002, Evolutionary Computation.

[7]  Carola Doerr,et al.  Landscape-aware fixed-budget performance regression and algorithm selection for modular CMA-ES variants , 2020, GECCO.

[8]  Mike Preuss,et al.  Improved Topological Niching for Real-Valued Global Optimization , 2012, EvoApplications.

[9]  Tomoharu Nagao,et al.  Bag of local landscape features for fitness landscape analysis , 2016, Soft Comput..

[10]  Benjamin Doerr,et al.  Exploratory Landscape Analysis is Strongly Sensitive to the Sampling Strategy , 2020, PPSN.

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Qingfu Zhang,et al.  A Gaussian Process Surrogate Model Assisted Evolutionary Algorithm for Medium Scale Expensive Optimization Problems , 2014, IEEE Transactions on Evolutionary Computation.

[13]  Sébastien Vérel,et al.  New features for continuous exploratory landscape analysis based on the SOO tree , 2019, FOGA '19.

[14]  Heike Trautmann,et al.  Benchmarking Evolutionary Algorithms: Towards Exploratory Landscape Analysis , 2010, PPSN.

[15]  Bernd Bischl,et al.  Cell Mapping Techniques for Exploratory Landscape Analysis , 2014 .

[16]  Vincenzo Cutello,et al.  Parallel Problem Solving from Nature - PPSN XII , 2012, Lecture Notes in Computer Science.

[17]  Abdullah Al Mamun,et al.  Evolutionary big optimization (BigOpt) of signals , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[18]  Anne Auger,et al.  Real-Parameter Black-Box Optimization Benchmarking 2009: Noiseless Functions Definitions , 2009 .

[19]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[20]  Michael Affenzeller,et al.  A Comprehensive Survey on Fitness Landscape Analysis , 2012, Recent Advances in Intelligent Engineering Systems.

[21]  Tome Eftimov,et al.  Towards Feature-Based Performance Regression Using Trajectory Data , 2021, EvoApplications.

[22]  Marc Schoenauer,et al.  Feature Based Algorithm Configuration: A Case Study with Differential Evolution , 2016, PPSN.

[23]  Bernd Bischl,et al.  Exploratory landscape analysis , 2011, GECCO '11.

[24]  Andries Petrus Engelbrecht,et al.  A survey of techniques for characterising fitness landscapes and some possible ways forward , 2013, Inf. Sci..

[25]  Heike Trautmann,et al.  Detecting Funnel Structures by Means of Exploratory Landscape Analysis , 2015, GECCO.

[26]  Pascal Kerschke,et al.  Comprehensive Feature-Based Landscape Analysis of Continuous and Constrained Optimization Problems Using the R-Package Flacco , 2017, Studies in Classification, Data Analysis, and Knowledge Organization.

[27]  L. Darrell Whitley,et al.  The dispersion metric and the CMA evolution strategy , 2006, GECCO.

[28]  Gabriela Ochoa,et al.  Visualising the global structure of search landscapes: genetic improvement as a case study , 2018, Genetic Programming and Evolvable Machines.

[29]  Marc Schoenauer,et al.  Per instance algorithm configuration of CMA-ES with limited budget , 2017, GECCO.

[30]  Marcus Gallagher,et al.  Sampling Techniques and Distance Metrics in High Dimensional Continuous Landscape Analysis: Limitations and Improvements , 2014, IEEE Transactions on Evolutionary Computation.

[31]  Marcus Gallagher,et al.  Length Scale for Characterising Continuous Optimization Problems , 2012, PPSN.

[32]  Heike Trautmann,et al.  Automated Algorithm Selection on Continuous Black-Box Problems by Combining Exploratory Landscape Analysis and Machine Learning , 2017, Evolutionary Computation.

[33]  Andrew M. Sutton,et al.  The Impact of Global Structure on Search , 2008, PPSN.

[34]  Ramana V. Grandhi,et al.  Improved Distributed Hypercube Sampling , 2002 .

[35]  Nando de Freitas,et al.  Bayesian Optimization in a Billion Dimensions via Random Embeddings , 2013, J. Artif. Intell. Res..

[36]  Heike Trautmann,et al.  Low-Budget Exploratory Landscape Analysis on Multiple Peaks Models , 2016, GECCO.

[37]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[38]  Tome Eftimov,et al.  Understanding the problem space in single-objective numerical optimization using exploratory landscape analysis , 2020, Appl. Soft Comput..

[39]  Christian L. Müller,et al.  Global Characterization of the CEC 2005 Fitness Landscapes Using Fitness-Distance Analysis , 2011, EvoApplications.

[40]  Bernd Bischl,et al.  Algorithm selection based on exploratory landscape analysis and cost-sensitive learning , 2012, GECCO '12.

[41]  Adam Prügel-Bennett,et al.  An Analysis of the Fitness Landscape of Travelling Salesman Problem , 2016, Evolutionary Computation.

[42]  Hao Wang,et al.  High Dimensional Bayesian Optimization Assisted by Principal Component Analysis , 2020, PPSN.

[43]  Kate Smith-Miles,et al.  Effects of function translation and dimensionality reduction on landscape analysis , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[44]  Jonathon Shlens,et al.  A Tutorial on Principal Component Analysis , 2014, ArXiv.

[45]  Deli Zhao,et al.  Scalable Gaussian Process Regression Using Deep Neural Networks , 2015, IJCAI.

[46]  Tobias Glasmachers,et al.  Challenges in High-dimensional Reinforcement Learning with Evolution Strategies , 2018, PPSN.

[47]  Bernd Bischl,et al.  Analyzing the BBOB Results by Means of Benchmarking Concepts , 2015, Evolutionary Computation.

[48]  Bernhard Sendhoff,et al.  Exploring Dimensionality Reduction Techniques for Efficient Surrogate-Assisted optimization , 2020, 2020 IEEE Symposium Series on Computational Intelligence (SSCI).

[49]  Marc Schoenauer,et al.  Surrogate Assisted Feature Computation for Continuous Problems , 2016, LION.

[50]  Terry Jones,et al.  Fitness Distance Correlation as a Measure of Problem Difficulty for Genetic Algorithms , 1995, ICGA.

[51]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..