Evolutionary algorithms for hyperparameter optimization in machine learning for application in high energy physics

The analysis of vast amounts of data constitutes a major challenge in modern high energy physics experiments. Machine learning (ML) methods, typically trained on simulated data, are often employed to facilitate this task. Several choices need to be made by the user when training the ML algorithm. In addition to deciding which ML algorithm to use and choosing suitable observables as inputs, users typically need to choose among a plethora of algorithm-specific parameters. We refer to parameters that need to be chosen by the user as hyperparameters. These are to be distinguished from parameters that the ML algorithm learns autonomously during the training, without intervention by the user. The choice of hyperparameters is conventionally done manually by the user and often has a significant impact on the performance of the ML algorithm. In this paper, we explore two evolutionary algorithms: particle swarm optimization (PSO) and genetic algorithm (GA), for the purposes of performing the choice of optimal hyperparameter values in an autonomous manner. Both of these algorithms will be tested on different datasets and compared to alternative methods.

[1]  John H. Holland,et al.  Distributed genetic algorithms for function optimization , 1989 .

[2]  Kenneth A. De Jong,et al.  An Analysis of Multi-Point Crossover , 1990, FOGA.

[3]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[4]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[5]  C. A. Murthy,et al.  Genetic Algorithm with Elitist Model and Its Convergence , 1996, Int. J. Pattern Recognit. Artif. Intell..

[6]  Anil K. Jain,et al.  Artificial Neural Networks: A Tutorial , 1996, Computer.

[7]  Jiaping Yang,et al.  Structural Optimization by Genetic Algorithms with Tournament Selection , 1997 .

[8]  Russell C. Eberhart,et al.  Comparison between Genetic Algorithms and Particle Swarm Optimization , 1998, Evolutionary Programming.

[9]  Russell C. Eberhart,et al.  Parameter Selection in Particle Swarm Optimization , 1998, Evolutionary Programming.

[10]  Lothar Thiele,et al.  Comparison of Multiobjective Evolutionary Algorithms: Empirical Results , 2000, Evolutionary Computation.

[11]  Dan Boneh,et al.  Where Genetic Algorithms Excel , 2001, Evolutionary Computation.

[12]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[13]  Kenneth A. De Jong,et al.  A formal analysis of the role of multi-point crossover in genetic algorithms , 1992, Annals of Mathematics and Artificial Intelligence.

[14]  B. Roe,et al.  Boosted decision trees as an alternative to artificial neural networks for particle identification , 2004, physics/0408124.

[15]  Andries Petrus Engelbrecht,et al.  A study of particle swarm optimization particle trajectories , 2006, Inf. Sci..

[16]  Yun-Wei Shang,et al.  A Note on the Extended Rosenbrock Function , 2006, Evolutionary Computation.

[17]  Yun Shang,et al.  A Note on the Extended Rosenbrock Function , 2006 .

[18]  Shimon Whiteson,et al.  Machine learning for event selection in high energy physics , 2009, Eng. Appl. Artif. Intell..

[19]  Schalk Kok,et al.  Locating and Characterizing the Stationary Points of the Extended Rosenbrock Function , 2009, Evolutionary Computation.

[20]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[21]  John Geraghty,et al.  Genetic Algorithm Performance with Different Selection Strategies in Solving TSP , 2011 .

[22]  Balázs Kégl,et al.  The Higgs boson machine learning challenge , 2014, HEPML@NIPS.

[23]  Bart De Moor,et al.  Hyperparameter Search in Machine Learning , 2015, ArXiv.

[24]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[25]  Marcin Andrychowicz,et al.  Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[26]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[27]  Gradient descent , 2018, Radiopaedia.org.

[28]  Eli Upfal,et al.  Machine Learning in High Energy Physics Community White Paper , 2018, Journal of Physics: Conference Series.