SPANet: Generalized Permutationless Set Assignment for Particle Physics using Symmetry Preserving Attention

The creation of unstable heavy particles at the Large Hadron Collider is the most direct way to address some of the deepest open questions in physics. Collisions typically produce variable-size sets of observed particles which have inherent ambiguities complicating the assignment of observed particles to the decay products of the heavy particles. Current strategies for tackling these challenges in the physics community ignore the physical symmetries of the decay products and consider all possible assignment permutations and do not scale to complex configurations. Attention based deep learning methods for sequence modelling have achieved state-of-the-art performance in natural language processing, but they lack built-in mechanisms to deal with the unique symmetries found in physical set-assignment problems. We introduce a novel method for constructing symmetry-preserving attention networks which reflect the problem's natural invariances to efficiently find assignments without evaluating all permutations. This general approach is applicable to arbitrarily complex configurations and significantly outperforms current methods, improving reconstruction efficiency between 19% - 35% on typical benchmark problems while decreasing inference time by two to five orders of magnitude on the most complex events, making many important and previously intractable cases tractable. A full code repository containing a general library, the specific configuration used, and a complete dataset release, are available at https://github.com/Alexanders101/SPANet

[1]  Daniel Whiteson,et al.  Permutationless Many-Jet Event Reconstruction with Symmetry Preserving Attention Networks , 2020, Physical Review D.

[2]  P. Baldi,et al.  Safety of Quark/Gluon Jet Classification , 2021, 2103.09103.

[3]  Max Welling,et al.  Gauge Equivariant Mesh CNNs: Anisotropic convolutions on geometric graphs , 2020, ICLR.

[4]  W. Y. Chan,et al.  CP Properties of Higgs Boson Interactions with Top Quarks in the tt[over ¯]H and tH Processes Using H→γγ with the ATLAS Detector. , 2020, Physical review letters.

[5]  Hoang Dai Nghia Nguyen,et al.  Evidence for $$t\bar{t}t\bar{t}$$ production in the multilepton final state in proton–proton collisions at $$\sqrt{s}=13$$ $$\text {TeV}$$ with the ATLAS detector , 2020 .

[6]  Atlas Collaboration Measurements of top-quark pair single- and double-differential cross-sections in the all-hadronic channel in $pp$ collisions at $\sqrt{s}=13~\mbox{TeV}$ using the ATLAS detector , 2020, 2006.09274.

[7]  Lars Hertel,et al.  Sherpa: Robust Hyperparameter Optimization for Machine Learning , 2020, SoftwareX.

[8]  M. D. Pietra,et al.  Higgs boson production cross-section measurements and their EFT interpretation in the $4\ell $ decay channel at $\sqrt{s}=$13 TeV with the ATLAS detector , 2020, 2004.03447.

[9]  C. Collaboration,et al.  Measurements of tt¯H Production and the CP Structure of the Yukawa Interaction between the Higgs Boson and Top Quark in the Diphoton Decay Channel , 2020, 2003.10866.

[10]  S. M. Etesami,et al.  Search for production of four top quarks in final states with same-sign or multiple leptons in proton–proton collisions at \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} , 2019, The European Physical Journal. C, Particles and Fields.

[11]  C. Collaboration Search for production of four top quarks in final states with same-sign or multiple leptons in proton–proton collisions at $$\sqrt{s}=13$$ $$\,\text {TeV}$$ , 2019, The European Physical Journal C.

[12]  B. Nachman,et al.  Jet substructure at the Large Hadron Collider: A review of recent advances in theory and machine learning , 2017, Physics Reports.

[13]  Arnulf Quadt,et al.  Oxford University Press : Review of Particle Physics, 2020-2021 , 2020 .

[14]  Hoang Dai Nghia Nguyen,et al.  Measurements of top-quark pair differential and double-differential cross-sections in the ℓ +jets channel with pp collisions at √{s }=13 TeV using the ATLAS detector , 2019 .

[15]  Atlas Collaboration Measurements of top-quark pair differential and double-differential cross-sections in the $$\ell $$+jets channel with pp collisions at $$\sqrt{s}=13$$ TeV using the ATLAS detector , 2019, The European Physical Journal C.

[16]  Olaf Nackenhorst,et al.  From the bottom to the top—reconstruction of t t̄ events with deep learning , 2019, Journal of Instrumentation.

[17]  S. M. Etesami,et al.  Search for the production of four top quarks in the single-lepton and opposite-sign dilepton final states in proton-proton collisions at √s = 13 TeV , 2019 .

[18]  Dan Pei,et al.  Personalized re-ranking for recommendation , 2019, RecSys.

[19]  Barnabás Póczos,et al.  End-to-End Jet Classification of Quarks and Gluons with the CMS Open Data , 2019, ArXiv.

[20]  Max Welling,et al.  Gauge Equivariant Convolutional Networks and the Icosahedral CNN 1 , 2019 .

[21]  Yang Song,et al.  Class-Balanced Loss Based on Effective Number of Samples , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  C. Collaboration,et al.  Measurement of the top quark mass in the all-jets final state at $$\sqrt{s}=13\,\text {TeV} $$ s , 2018, The European Physical Journal C.

[23]  Yee Whye Teh,et al.  Set Transformer , 2018, ICML.

[24]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[25]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[26]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[27]  Joshua Garland,et al.  Anomaly Detection in Paleoclimate Records Using Permutation Entropy , 2018, Entropy.

[28]  Johnnie Gray,et al.  opt\_einsum - A Python package for optimizing contraction order for einsum-like expressions , 2018, J. Open Source Softw..

[29]  L. F. Chaparro Sierra,et al.  Observation of tt[over ¯]H Production. , 2018, Physical review letters.

[30]  Atlas Collaboration Observation of Higgs boson production in association with a top quark pair at the LHC with the ATLAS detector , 2018, 1806.00425.

[31]  Bruce Yabsley,et al.  Search for the standard model Higgs boson produced in association with top quarks and decaying into a bb¯ pair in pp collisions at √s=13 TeV with the ATLAS detector , 2018 .

[32]  Patrick T. Komiske,et al.  Energy flow polynomials: a complete linear basis for jet substructure , 2017, 1712.07124.

[33]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[34]  D. Kar,et al.  Systematics of quark/gluon tagging , 2017, 1704.03878.

[35]  Anoop Cherian,et al.  DeepPermNet: Visual Permutation Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Barnabás Póczos,et al.  Estimating Cosmological Parameters from the Dark Matter Distribution , 2016, ICML.

[37]  P. Baldi,et al.  Jet Substructure Classification in High-Energy Physics with Deep Neural Networks , 2016, 1603.09349.

[38]  Max Welling,et al.  Group Equivariant Convolutional Networks , 2016, ICML.

[39]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Restarts , 2016, ArXiv.

[40]  Tinne Tuytelaars,et al.  Learning to Rank Based on Subsequences , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[41]  Johannes Bellm,et al.  Herwig 7.0/Herwig++ 3.0 release note , 2015, 1512.01178.

[42]  Danica J. Sutherland,et al.  DYNAMICAL MASS MEASUREMENTS OF CONTAMINATED GALAXY CLUSTERS USING MACHINE LEARNING , 2015, 1509.05409.

[43]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[44]  Peter Skands,et al.  An introduction to PYTHIA 8.2 , 2014, Comput. Phys. Commun..

[45]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[46]  R. Frederix,et al.  The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations , 2014, 1405.0301.

[47]  J. Erdmann,et al.  A likelihood-based reconstruction algorithm for top-quark pairs and the KLFitter framework , 2013, 1312.5595.

[48]  J. Favereau,et al.  DELPHES 3: A modular framework for fast-simulation of generic collider experiments , 2014 .

[49]  Tim Weyrich,et al.  A system for high-volume acquisition and matching of fresco fragments: reassembling Theran wall paintings , 2008, ACM Trans. Graph..

[50]  M. Cacciari,et al.  The anti-$k_t$ jet clustering algorithm , 2008, 0802.1189.

[51]  Danqi Chen,et al.  of the Association for Computational Linguistics: , 2001 .