Machine learning for event selection in high energy physics

The field of high energy physics aims to discover the underlying structure of matter by searching for and studying exotic particles, such as the top quark and Higgs boson, produced in collisions at modern accelerators. Since such accelerators are extraordinarily expensive, extracting maximal information from the resulting data is essential. However, most accelerator events do not produce particles of interest, so making effective measurements requires event selection, in which events producing particles of interest (signal) are separated from events producing other particles (background). This article studies the use of machine learning to aid event selection. First, we apply supervised learning methods, which have succeeded previously in similar tasks. However, they are suboptimal in this case because they assume that the selector with the highest classification accuracy will yield the best final analysis; this is not true in practice, as such analyses are more sensitive to some backgrounds than others. Second, we present a new approach that uses stochastic optimization techniques to directly search for selectors that maximize either the precision of top quark mass measurements or the sensitivity to the presence of the Higgs boson. Empirical results confirm that stochastically optimized selectors result in substantially better analyses. We also describe a case study in which the best selector is applied to real data from the Fermilab Tevatron accelerator, resulting in the most precise top quark mass measurement of this type to date. Hence, this new approach to event selection has already contributed to our knowledge of the top quark's mass and our understanding of the larger questions upon which it sheds light.

[1]  D. O. Whiteson,et al.  Support vector regression as a signal discriminator in high energy physics , 2003, Neurocomputing.

[2]  F. Gianfelici,et al.  Nearest-Neighbor Methods in Learning and Vision (Shakhnarovich, G. et al., Eds.; 2006) [Book review] , 2008 .

[3]  Alan D. Martin,et al.  Review of Particle Physics , 2010 .

[4]  Ben Taskar,et al.  Learning structured prediction models: a large margin approach , 2005, ICML.

[5]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[6]  Dap Hartmann,et al.  MACHINES THAT LEARN TO PLAY GAMES , 2002 .

[7]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[8]  Risto Miikkulainen,et al.  Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[9]  Brian Fields,et al.  Big-Bang nucleosynthesis (2006 Particle Data Group mini-review) , 2006, astro-ph/0601514.

[10]  Delphi collaboration,et al.  Search for the Standard Model Higgs Boson at LEP , 2001, hep-ex/0107029.

[11]  G De Lorenzo,et al.  Cross-section-constrained top-quark mass measurement from dilepton events at the Tevatron. , 2007, Physical review letters.

[12]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[13]  K. Bloom,et al.  Measurement of the tt production cross section in pp collisions at √s = 1.96 TeV , 2008 .

[14]  Risto Miikkulainen,et al.  Real-time neuroevolution in the NERO video game , 2005, IEEE Transactions on Evolutionary Computation.

[15]  S. Glashow Partial Symmetries of Weak Interactions , 1961 .

[16]  M. Spiropulu,et al.  Measurement of the tt production cross section in pp collisions at sqrt[s]=1.96 TeV using dilepton events. , 2004, Physical review letters.

[17]  E. al.,et al.  Measurement of the cross section for tt̄ production in pp̄ collisions using the kinematics of lepton+jets events , 2005, hep-ex/0504053.

[18]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[19]  Risto Miikkulainen,et al.  Automatic feature selection in neuroevolution , 2005, GECCO '05.

[20]  V. Ramakrishnan,et al.  Measurement of the top-quark mass with dilepton events selected using neuroevolution at CDF. , 2008, Physical review letters.

[21]  A. Ribon,et al.  Observation of top quark production in $\bar{p}p$ collisions , 1995 .

[22]  M. Hashimoto,et al.  Top Mode Standard Model with Extra Dimensions , 2000, hep-ph/0010260.

[23]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[24]  J. A. Lozano,et al.  Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[25]  P. W. Higgs Spontaneous Symmetry Breakdown without Massless Bosons , 1966 .

[26]  Risto Miikkulainen,et al.  Efficient Non-linear Control Through Neuroevolution , 2006, ECML.

[27]  R. Illingworth,et al.  Search for single top quark production at DØ using neural networks , 2001 .

[28]  M. Feindt,et al.  Top-quark mass measurement from dilepton events at CDF II. , 2005, Physical review letters.

[29]  E. al.,et al.  Measurement of the J/ψ meson and b-hadron production cross sections in pp̄ collisions at √s = 1960 GeV , 2004, hep-ex/0412071.

[30]  Pedro M. Domingos MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[31]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[32]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[33]  W. Porod,et al.  Impact of a precise top mass measurement , 2004, hep-ph/0409063.

[34]  K. Cranmer,et al.  PhysicsGP: A Genetic Programming approach to event selection , 2004, Comput. Phys. Commun..

[35]  Paul E. Utgoff,et al.  Feature construction for game playing , 2001 .

[36]  M. Tanabashi,et al.  Is the t Quark Responsible for the Mass of W and Z Bosons , 1989 .

[37]  A. Cho Aging Atom Smasher Runs All Out in Race for Most Coveted Particle , 2006, Science.

[38]  Trevor Darrell,et al.  Nearest-Neighbor Methods in Learning and Vision , 2008, IEEE Trans. Neural Networks.

[39]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[40]  Massless Bosons,et al.  Spontaneous Symmetry Breakdown without Massless Bosons , 2011 .

[41]  Steven Weinberg,et al.  A Model of Leptons , 1967 .

[42]  The Aleph Collaboration,et al.  A Combination of Preliminary Electroweak Measurements and Constraints on the Standard Model , 2001, hep-ex/0612034.

[43]  X. Yao Evolving Artificial Neural Networks , 1999 .

[44]  Shimon Whiteson,et al.  Evolutionary Function Approximation for Reinforcement Learning , 2006, J. Mach. Learn. Res..

[45]  S. Heinemeyer,et al.  Physics Impact of a Precise Determination of the Top Quark Mass at an $e^{+}e^{-}$ Linear Collider , 2003 .

[46]  S. Mrenna,et al.  High-energy physics event generation with PYTHIA 6.1 , 2000, hep-ph/0010017.

[47]  Tom Elliott Fawcett Feature discovery for problem solving systems , 1993 .

[48]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[49]  Bauer,et al.  Observation of Top Quark Production in p-barp Collisions with the Collider Detector at Fermilab. , 1995, Physical review letters.

[50]  Butler,et al.  Observation of the Top Quark. , 1995, Physical review letters.

[51]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[52]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[53]  Shimon Whiteson,et al.  Stochastic Optimization for Collision Selection in High Energy Physics , 2006, AAAI.