Deep Learning in High-Energy Physics: Improving the Search for Exotic Particles

Collisions at high-energy particle colliders are a traditionally fruitful source of exotic particle discoveries. Finding these rare exotic particles requires solving difficult signal-versus-background classification problems, hence machine learning approaches are often used for this task. Standard approaches in the past have relied on ‘shallow’ machine learning models that have a limited capacity to learn complex non-linear functions of the inputs, and rely on a pain-staking search through manually constructed non-linear inputs. Progress on this problem has slowed, as a variety of techniques (neural networks, boosted decision trees, support vector machines) have shown equivalent performance. Recent advances in the field of deep learning, particularly with artificial neural networks, make it possible to learn more complex functions and better discriminate between signal and background classes. Using benchmark datasets, we show that deep learning methods need no manually constructed inputs and yet improve the AUC (Area Under the ROC Curve) classification metric by as much as 8% over the best current approaches. This is a large relative improvement and demonstrates that deep learning approaches can improve the power of collider searches for exotic particles.

[1]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[2]  S Amerio,et al.  Search for a two-Higgs-boson doublet using a simplified model in pp collisions at sqrt[s] = 1.96 TeV. , 2013, Physical review letters.

[3]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[4]  V. Lemaître,et al.  DELPHES, a framework for fast simulation of a generic collider experiment , 2009, 0903.2225.

[5]  Pierre Baldi,et al.  The dropout learning algorithm , 2014, Artif. Intell..

[6]  F. Maltoni,et al.  MadGraph 5: going beyond , 2011, 1106.0522.

[7]  M. Buckley,et al.  Super-Razor and Searches for Sleptons and Charginos at the LHC , 2013, 1310.4827.

[8]  F. Tegenfeldt,et al.  TMVA - Toolkit for multivariate data analysis , 2012 .

[9]  Hsin-Chia Cheng,et al.  Minimal kinematic constraints and m T2 , 2008, 0810.5178.

[10]  S. Mrenna,et al.  Pythia 6.3 physics and manual , 2003, hep-ph/0308153.

[11]  A. Barr,et al.  m(T2): The Truth behind the glamour , 2003, hep-ph/0304226.

[12]  S. Hochreiter Recurrent Neural Net Learning and Vanishing , 1998 .

[13]  C. Rogan Kinematical variables towards new dynamics at the LHC , 2010, 1006.2727.

[14]  J. T. Childers,et al.  Search for a multi-Higgs-boson cascade in W+W -bb̄ events with the ATLAS detector in pp collisions at √s = 8 TeV , 2014 .

[15]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[16]  The Cms Collaboration Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC , 2012, 1207.7235.