Reporting results in High Energy Physics publications: A manifesto

The complexity of collider data analyses has dramatically increased from early colliders to the CERN LHC. Reconstruction of the collision products in the particle detectors has reached a point that requires dedicated publications documenting the techniques, and periodic retuning of the algorithms themselves. Analysis methods evolved to account for the increased complexity of the combination of particles required in each collision event (final states) and for the need of squeezing every last bit of sensitivity from the data; physicists often seek to fully reconstruct the final state, a process that is mostly relatively easy at lepton colliders but sometimes exceedingly difficult at hadron colliders to the point of requiring sometimes using advanced statistical techniques such as machine learning. The need for keeping the publications documenting results to a reasonable size implies a greater level of compression or even omission of information with respect to publications from twenty years ago. The need for compression should however not prevent sharing a reasonable amount of information that is essential to understanding a given analysis. Infrastructures like Rivet or HepData have been developed to host additional material, but physicists in the experimental Collaborations often still send an insufficient amount of material to these databases. In this manuscript I advocate for an increase in the information shared by the Collaborations, and try to define a minimum standard for acceptable level of information when reporting the results of statistical procedures in High Energy Physics publications.

[1]  Lukas Heinrich,et al.  HEPData: a repository for high energy physics data , 2017, ArXiv.

[2]  P. Clifford,et al.  How to combine correlated estimates of a single physical quantity , 1988 .

[3]  J. Winter,et al.  Event generation with , 2009 .

[4]  Michael J. Baker,et al.  Reinterpretation of LHC Results for New Physics: Status and Recommendations after Run 2 , 2020 .

[5]  J. C. Ward,et al.  Weak and electromagnetic interactions , 1959 .

[6]  V. M. Ghete,et al.  Identification of heavy-flavour jets with the CMS detector in pp collisions at 13 TeV , 2017 .

[7]  John Skilling,et al.  Data analysis : a Bayesian tutorial , 1996 .

[8]  Steven Weinberg,et al.  A Model of Leptons , 1967 .

[9]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[10]  Jeff Gill,et al.  What are Bayesian Methods , 2008 .

[11]  R. Cousins,et al.  Lectures on Statistics in Theory: Prelude to Statistics in Practice. , 2018, 1807.05996.

[12]  A. Read Modified frequentist analysis of search results (The CL(s) method) , 2000 .

[13]  P. Gras,et al.  Les Houches 2015: Physics at TeV colliders - new physics working group report , 2016, 1605.02684.

[14]  J. Caudron,et al.  Search for Higgs boson pair production in the γγbb¯$$ \upgamma \upgamma b\overline{b} $$ final state with 13 TeV pp collision data collected by the ATLAS experiment , 2018, Journal of High Energy Physics.

[15]  V. M. Ghete,et al.  Measurement of the $${\mathrm {t}\overline{\mathrm {t}}}$$tt¯ production cross section, the top quark mass, and the strong coupling constant using dilepton events in pp collisions at $$\sqrt{s}=13\,\text {Te}\text {V} $$s=13Te , 2018, The European physical journal. C, Particles and fields.

[16]  V. M. Ghete,et al.  Search for heavy Majorana neutrinos in same-sign dilepton channels in proton-proton collisions at s=13$$ \sqrt{s}=13 $$ TeV , 2019 .

[17]  Mossolov,et al.  Search for a charged Higgs boson decaying into top and bottom quarks in events with electrons or muons in proton-proton collisions at √s = 13 TeV , 2019, 1908.09206.

[18]  S. Glashow Partial Symmetries of Weak Interactions , 1961 .

[19]  E. Conte,et al.  Toward a public analysis database for LHC new physics searches using MADANALYSIS 5 , 2014, 1407.3278.

[20]  G. Aad,et al.  Evidence for the production of three massive vector bosons with the ATLAS detector , 2019 .

[21]  B. Nachman,et al.  Extending the search for new resonances with machine learning , 2019, Physical Review D.

[22]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[23]  D. S. Sivia,et al.  Data Analysis , 1996, Encyclopedia of Evolutionary Psychological Science.

[24]  Louis Lyons,et al.  Statistics for Nuclear and Particle Physicists , 1989 .

[25]  Robert D. Cousins,et al.  The Jeffreys–Lindley paradox and discovery criteria in high energy physics , 2013, Synthese.

[26]  Jared A. Evans,et al.  Simplified Models for LHC New Physics Searches , 2011, 1105.2838.

[27]  Rory A. Fisher,et al.  Studies in crop variation. I. An examination of the yield of dressed grain from Broadbalk , 1921, The Journal of Agricultural Science.

[28]  D. Rathlev,et al.  W±Z production at the LHC: fiducial cross sections and distributions in NNLO QCD , 2016, 1703.09065.

[29]  A. A. Ocampo Rios,et al.  Measurement of the top-quark mass in $ \mathrm{t}\overline{\mathrm{t}} $ events with lepton+jets final states in pp collisions at $ \sqrt{s}=7 $ TeV , 2012 .

[30]  F. Siegert,et al.  Event generation with SHERPA 1.1 , 2008, 0811.4622.

[31]  M. P. Casado,et al.  Measurement of $$W^{\pm }Z$$W±Z production cross sections and gauge boson polarisation in pp collisions at $$\sqrt{s} = 13~\text {TeV}$$s=13TeV with the ATLAS detector , 2019 .

[32]  D. Soper Parton distribution functions , 1996, hep-lat/9609018.

[33]  J. Neyman Outline of a Theory of Statistical Estimation Based on the Classical Theory of Probability , 1937 .

[34]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[35]  P. Schuster,et al.  Simplified models for a first characterization of new physics at the LHC , 2008, 0810.3921.

[36]  G. R. Lee,et al.  $ZZ \to \ell^{+}\ell^{-}\ell^{\prime +}\ell^{\prime -}$ cross-section measurements and search for anomalous triple gauge couplings in 13 TeV $pp$ collisions with the ATLAS detector , 2017 .

[37]  R. Cousins,et al.  A Unified Approach to the Classical Statistical Analysis of Small Signals , 1997, physics/9711021.

[38]  Frank Krauss,et al.  Soft photon radiation in particle decays in SHERPA , 2008, 0810.5071.

[39]  T. Tuuva,et al.  Measurement and QCD analysis of double-differential inclusive jet cross-sections in pp collisions at sqrt(s) = 8 TeV and ratios to 2.76 and 7 TeV , 2016, 1609.05331.

[40]  V. M. Ghete,et al.  Measurement of the tt¯$$ \mathrm{t}\overline{\mathrm{t}} $$ production cross section using events with one lepton and at least one jet in pp collisions at s=13$$ \sqrt{s}=13 $$ TeV , 2017 .

[41]  Gustavo K. Rohde,et al.  Optimal Mass Transport: Signal processing and machine-learning applications , 2017, IEEE Signal Processing Magazine.

[42]  Bruce Yabsley,et al.  Search for the standard model Higgs boson produced in association with top quarks and decaying into a bb¯ pair in pp collisions at √s=13 TeV with the ATLAS detector , 2018 .

[43]  F. James Statistical Methods in Experimental Physics , 1973 .

[44]  B. Póczos,et al.  End-to-end particle and event identification at the Large Hadron Collider with CMS Open Data , 2019, 1910.07029.

[45]  A. C. Aitken IV.—On Least Squares and Linear Combination of Observations , 1936 .

[46]  G. D'Agostini,et al.  A Multidimensional unfolding method based on Bayes' theorem , 1995 .

[47]  G. Zanderighi,et al.  $$W^+W^-$$W+W-, $$WZ$$WZ and $$ZZ$$ZZ production in the POWHEG-BOX-V2 , 2013, 1311.1365.

[48]  Hayes,et al.  Review of Particle Physics. , 1996, Physical review. D, Particles and fields.

[49]  Tom Melia,et al.  W+W−, WZ and ZZ production in the POWHEG BOX , 2011, 1107.5051.

[50]  K. Cranmer,et al.  Asymptotic formulae for likelihood-based tests of new physics , 2010, 1007.1727.

[51]  Tilman Plehn,et al.  The gauge-Higgs legacy of the LHC Run I , 2016, SciPost Physics.

[52]  E. Re,et al.  A general framework for implementing NLO calculations in shower Monte Carlo programs: the POWHEG BOX , 2010, 1002.2581.

[53]  Tilman Plehn,et al.  Decoupling Theoretical Uncertainties from Measurements of the Higgs Boson , 2013, 1401.0080.

[54]  M. Gell-Mann,et al.  Advantages of the color octet gluon picture , 1973 .

[55]  C. Degrande A basis of dimension-eight operators for anomalous neutral triple gauge boson interactions , 2013, 1308.6323.

[56]  D. Zerwas,et al.  The Global Higgs Picture at 27 TeV , 2018, SciPost Physics.

[57]  P. Nason A new method for combining NLO QCD with shower Monte Carlo algorithms , 2004, hep-ph/0409146.

[58]  V. M. Ghete,et al.  Combined search for electroweak production of charginos and neutralinos in proton-proton collisions at s=13$$ \sqrt{s}=13 $$ TeV , 2018, 1801.03957.

[59]  Luca Lista Combination of measurements and the BLUE method , 2016 .

[60]  B. Betchart,et al.  Analytic solutions for neutrino momenta in decay of top quarks , 2013, 1305.1878.

[61]  Andy Buckley,et al.  The simplified likelihood framework , 2018, Journal of High Energy Physics.

[62]  Andrei,et al.  Measurements of the Higgs boson production and decay rates and constraints on its couplings from a combined ATLAS and CMS analysis of the LHC pp collision data at √s = 7 and 8 TeV , 2016, 1606.02266.

[63]  M. Gell-Mann A Schematic Model of Baryons and Mesons , 1964, Resonance.

[64]  Cms Collaborations Measurements of the Higgs boson production and decay rates and constraints on its couplings from a combined ATLAS and CMS analysis of the LHC $pp$ collision data at $\sqrt{s}=$ 7 and 8 TeV , 2016 .

[65]  Andy Buckley,et al.  Rivet user manual , 2010, Comput. Phys. Commun..

[66]  P. Nason,et al.  Matching NLO QCD computations with Parton Shower simulations: the POWHEG method , 2007, 0709.2092.

[67]  Kyle Cranmer,et al.  Practical Statistics for the LHC , 2014, 1503.07622.

[68]  M. Kadastik,et al.  Searches for new physics: Les Houches recommendations for the presentation of LHC results , 2012, 1203.2489.

[69]  G. Zweig An SU(3) model for strong interaction symmetry and its breaking. Version 1 , 1964 .

[70]  Stefan Schmitt,et al.  Data Unfolding Methods in High Energy Physics , 2016, 1611.01927.

[71]  A. Read Presentation of search results: the CLs technique , 2002 .

[72]  A Supersymmetry Primer , 1997, hep-ph/9709356.

[73]  A. Tikhonov On the stability of inverse problems , 1943 .

[74]  R. Trotta Bayesian Methods in Cosmology , 2017, 1701.01467.

[75]  Pablo de Castro,et al.  INFERNO: Inference-Aware Neural Optimisation , 2018, Comput. Phys. Commun..

[76]  F. Garwood,et al.  i) Fiducial Limits for the Poisson Distribution , 1936 .

[77]  R. Frederix,et al.  The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations , 2014, 1405.0301.

[78]  G. D'Agostini,et al.  Improved iterative Bayesian unfolding , 2010, 1010.0632.

[79]  Alan D. Martin,et al.  Review of Particle Physics , 2018, Physical Review D.

[80]  L. Lyons Bayes and Frequentism: a particle physicist’s perspective , 2013, 1301.1273.

[81]  R. Brenner,et al.  Measurement of the W⁺W⁻ production cross section in pp collisions at a centre-of-mass energy of √s = 13 TeV with the ATLAS experiment , 2017 .

[82]  Measurements of the pp → WZ inclusive and differential production cross sections and constraints on charged anomalous triple gauge couplings at s$$ \sqrt{s} $$ = 13 TeV , 2019, Journal of High Energy Physics.

[83]  J. Latorre,et al.  Parton distributions from high-precision collider data , 2017, The European Physical Journal C.

[84]  S. M. Etesami,et al.  Evidence for $$\text {W}\text {W}$$ production from double-parton interactions in proton–proton collisions at $$\sqrt{s} = 13 \,\text {TeV} $$ , 2019, The European Physical Journal C.