Sampling Techniques and Distance Metrics in High Dimensional Continuous Landscape Analysis: Limitations and Improvements

In metaheuristic optimization, understanding the relationship between problems and algorithms is important but nontrivial. There has been a growing interest in the literature on techniques for analyzing problems and algorithm performance; however, the validity of the assumptions and implementation choices behind many techniques is often not closely examined. In this paper, we review some interesting theoretical properties regarding sampling techniques and distance metrics in continuous spaces. In particular, we examine the effect of using Euclidean distance in conjunction with uniform random sampling on the behavior of the Dispersion metric. We show that the current methodology employed for the estimation of dispersion has important flaws, and we propose and evaluate modifications to improve the methodology. The modifications are simple and do not add significant complexity or computational effort to the methodology.

[1]  Marcus Gallagher,et al.  Multi-layer Perceptron Error Surfaces: Visualization, Structure and Modelling , 2000 .

[2]  Christian L. Müller,et al.  Particle Swarm CMA Evolution Strategy for the optimization of multi-funnel landscapes , 2009, 2009 IEEE Congress on Evolutionary Computation.

[3]  L. Darrell Whitley,et al.  Comparing the Niches of CMA-ES, CHC and Pattern Search Using Diverse Benchmarks , 2006, PPSN.

[4]  Christian L. Müller,et al.  Global Characterization of the CEC 2005 Fitness Landscapes Using Fitness-Distance Analysis , 2011, EvoApplications.

[5]  Julian Francis Miller,et al.  Information Characteristics and the Structure of Landscapes , 2000, Evolutionary Computation.

[6]  Mario A. Muñoz,et al.  Landscape characterization of numerical optimization problems using biased scattered data , 2012, 2012 IEEE Congress on Evolutionary Computation.

[7]  Anne Auger,et al.  Real-Parameter Black-Box Optimization Benchmarking 2009: Noiseless Functions Definitions , 2009 .

[8]  C. Reeves,et al.  Properties of fitness functions and search landscapes , 2001 .

[9]  Marcus Gallagher,et al.  Length Scale for Characterising Continuous Optimization Problems , 2012, PPSN.

[10]  Mario A. Muñoz,et al.  A Meta-learning Prediction Model of Algorithm Performance for Continuous Optimization Problems , 2012, PPSN.

[11]  Charu C. Aggarwal,et al.  On the Surprising Behavior of Distance Metrics in High Dimensional Spaces , 2001, ICDT.

[12]  D. Steinberg,et al.  Computer experiments: a review , 2010 .

[13]  Heike Trautmann,et al.  Benchmarking Evolutionary Algorithms: Towards Exploratory Landscape Analysis , 2010, PPSN.

[14]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[15]  Michael Affenzeller,et al.  A Comprehensive Survey on Fitness Landscape Analysis , 2012, Recent Advances in Intelligent Engineering Systems.

[16]  L. Darrell Whitley,et al.  The dispersion metric and the CMA evolution strategy , 2006, GECCO.