Enhanced Higgs boson to τ(+)τ(-) search with deep learning.

The Higgs boson is thought to provide the interaction that imparts mass to the fundamental fermions, but while measurements at the Large Hadron Collider (LHC) are consistent with this hypothesis, current analysis techniques lack the statistical power to cross the traditional 5σ significance barrier without more data. Deep learning techniques have the potential to increase the statistical power of this analysis by automatically learning complex, high-level data representations. In this work, deep neural networks are used to detect the decay of the Higgs boson to a pair of tau leptons. A Bayesian optimization algorithm is used to tune the network architecture and training algorithm hyperparameters, resulting in a deep network of eight nonlinear processing layers that improves upon the performance of shallow classifiers even without the use of features specifically engineered by physicists for this application. The improvement in discovery significance is equivalent to an increase in the accumulated data set of 25%.

[1]  Razvan Pascanu,et al.  Pylearn2: a machine learning research library , 2013, ArXiv.

[2]  I. Ial,et al.  Nature Communications , 2010, Nature Cell Biology.

[3]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[4]  V. Lemaître,et al.  DELPHES, a framework for fast simulation of a generic collider experiment , 2009, 0903.2225.

[5]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[6]  Razvan Pascanu,et al.  On the Number of Linear Regions of Deep Neural Networks , 2014, NIPS.

[7]  Shay B. Cohen,et al.  Advances in Neural Information Processing Systems 25 , 2012, NIPS 2012.

[8]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[9]  E. LESTER SMITH,et al.  AND OTHERS , 2005 .

[10]  F. Maltoni,et al.  MadGraph 5: going beyond , 2011, 1106.0522.

[11]  Tod S. Levitt,et al.  Uncertainty in artificial intelligence , 1988 .

[12]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[13]  D. Geiger,et al.  Uncertainty in artificial intelligence : proceedings of the Thirteenth Conference (1997) : August 1-3, 1997, Brown University, Providence, Rhode Island, USA , 1997 .

[14]  K. Cranmer,et al.  Asymptotic formulae for likelihood-based tests of new physics , 2010, 1007.1727.

[15]  A. Elagin,et al.  A new mass reconstruction technique for resonances decaying to ττ , 2011 .

[16]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[17]  S. Hochreiter Recurrent Neural Net Learning and Vanishing , 1998 .