DeepJet : Generic physics object based jet multiclass classification for LHC experiments

The detectors at the Large Hadron Collider at CERN reconstruct, among other objects, collimated sprays of particles, which are referred to as “jets”. An important task is to identify the type of the elementary particle that initiated the jet, i.e. whether it is a light quark, a heavy quark or a gluon, leading to a multiclass classification problem. We present results from a realistic simulation of one of the two multi-purpose detectors at the LHC, the Compact Muon Solenoid. The basic network architecture relies heavily on using convolutional layers on low-level physics objects, like individual particle objects, and uses much more information than previous algorithms in the literature. It stands out as the first proposal that can be applied to multiclass classification for all types of jet initiators as well as for jets of different widths. We demonstrate significant improvements by the new approach on the classification capabilities for several of the tested particle classes. In one specific case, and at high momentum, a decrease of nearly 90% in the rate of false positives is achieved for a constant true positive rate of 40%.

[1]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[2]  Kyunghyun Cho,et al.  QCD-aware recursive neural networks for jet physics , 2017, Journal of High Energy Physics.

[3]  Jürgen Schmidhuber,et al.  Flat Minima , 1997, Neural Computation.

[4]  C. Collaboration,et al.  Particle-flow reconstruction and global event description with the CMS detector , 2017, 1706.04965.

[5]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  J. Cogan,et al.  Jet-images: computer vision inspired techniques for jet tagging , 2014, 1407.5675.

[7]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[8]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[9]  T. Tuuva,et al.  Search for direct production of supersymmetric partners of the top quark in the all-jets final state in proton-proton collisions at s=13$$ \sqrt{s}=13 $$ TeV , 2017, 1707.03316.

[10]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[11]  João Paulo Teixeira,et al.  The CMS experiment at the CERN LHC , 2008 .

[12]  Patrick T. Komiske,et al.  Deep learning in color: towards automated quark/gluon jet discrimination , 2016, Journal of High Energy Physics.

[13]  Zihao Jiang,et al.  Identification of Jets Containing b-Hadrons with Recurrent Neural Networks at the ATLAS Experiment , 2017 .

[14]  P. Baldi,et al.  Jet Substructure Classification in High-Energy Physics with Deep Neural Networks , 2016, 1603.09349.

[15]  Zheng Zhang,et al.  MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems , 2015, ArXiv.

[16]  G. Kasieczka,et al.  Deep-learning top taggers or the end of QCD? , 2017, 1701.08784.

[17]  P. Baldi,et al.  Jet flavor classification in high-energy physics with deep neural networks , 2016, 1607.08633.

[18]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[19]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.