Data Preservation in High Energy Physics -- DPHEP Global Report 2022

This document summarizes the status of data preservation in high energy physics. The paradigms and the methodological advances are discussed from a perspective of more than ten years of experience with a structured effort at international level. The status and the scientific return related to the preservation of data accumulated at large collider experiments are presented, together with an account of ongoing efforts to ensure long-term analysis capabilities for ongoing and future experiments. Transverse projects aimed at generic solutions, most of which are specifically inspired by open science and FAIR principles, are presented as well. A prospective and an action plan are also indicated.

[1]  Armenia,et al.  Measurement of Lepton-Jet Correlation in Deep-Inelastic Scattering with the H1 Detector Using Machine Learning for Unfolding. , 2021, Physical review letters.

[2]  M. Arratia,et al.  Reconstructing the kinematics of deep inelastic scattering with deep learning , 2021, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment.

[3]  S. Schmitt,et al.  Preservation through modernisation: The software of the H1 experiment at HERA , 2021, EPJ Web of Conferences.

[4]  Tibor Simko,et al.  Scalable Declarative HEP Analysis Workflows for Containerised Compute Clouds , 2021, Frontiers in Big Data.

[5]  F. Zomer,et al.  Measurement of jet production cross sections in deep-inelastic ep scattering at HERA , 2017, The European Physical Journal C.

[6]  Implementation of simplified likelihoods in HistFactory for searches for supersymmetry , 2021 .

[7]  A. B. Kaliyar,et al.  Search for Axionlike Particles Produced in e^{+}e^{-} Collisions at Belle II. , 2020, Physical review letters.

[8]  Abdul Quamar,et al.  ATHENA++ , 2020 .

[9]  A. B. Kaliyar,et al.  Search for an Invisibly Decaying Z^{'} Boson at Belle II in e^{+}e^{-}→μ^{+}μ^{-}(e^{±}μ^{∓}) Plus Missing Energy Final States. , 2019, Physical review letters.

[10]  Li-Yu Daisy Liu,et al.  Future Physics Programme of BESIII , 2019, Chinese Physics C.

[11]  T. Simko,et al.  Dataset of tau neutrino interactions recorded by the OPERA experiment , 2020, EPJ Web of Conferences.

[12]  Tibor Simko,et al.  Open data provenance and reproducibility: a case study from publishing CMS open data , 2020, EPJ Web of Conferences.

[13]  Harri Hirvonsalo,et al.  REANA: A System for Reusable Research Data Analyses , 2019, EPJ Web of Conferences.

[14]  Jose Benito Gonzalez Lopez,et al.  A Roadmap for HEP Software and Computing R&D for the 2020s , 2017, Computing and Software for Big Science.

[15]  K. Cranmer,et al.  Open is not enough , 2018, Nature Physics.

[16]  Kyle Cranmer,et al.  Search for Computational Workflow Synergies in Reproducible Research Data Analyses in Particle Physics and Life Sciences , 2018, 2018 IEEE 14th International Conference on e-Science (e-Science).

[17]  Heather Gray,et al.  The TrackML Particle Tracking Challenge , 2018 .

[18]  P. Stark Before reproducibility must come preproducibility , 2018, Nature.

[19]  Lukas Heinrich,et al.  HEPData: a repository for high energy physics data , 2017, ArXiv.

[20]  A. Verbytskyi The ZEUS ong term data preservation project , 2016, 1607.01898.

[21]  Tibor Simko,et al.  CERN Services for Long Term Data Preservation , 2016, iPRES.

[22]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[23]  S. Schmitt,et al.  Summary of workshop on Future Physics with HERA Data , 2016, 1601.01499.

[24]  Patrick Fuhrmann,et al.  Data preservation for the HERA experiments at DESY using dCache technology , 2015 .

[25]  R. Illingworth,et al.  Data preservation at the Fermilab Tevatron , 2015, 1701.07773.

[26]  A. Geiser,et al.  Possible future HERA analyses , 2015, 1512.03624.

[27]  Roger Jones,et al.  Status Report of the DPHEP Collaboration: A Global Effort for Sustainable Data Preservation in High Energy Physics , 2015, ArXiv.

[28]  A. Bodek,et al.  MINERvA neutrino detector response measured with test beam data , 2015, 1501.06431.

[29]  B. Kégl,et al.  The ATLAS Higgs Boson Machine Learning Challenge , 2014 .

[30]  H. Collaboration Measurement of Multijet Production in ep Collisions at High Q^2 and Determination of the Strong Coupling alpha_s , 2014, 1406.4709.

[31]  Marco Cattaneo,et al.  Update of the Computing Models of the WLCG and the LHC Experiments , 2014 .

[32]  Dmitri Ozerov,et al.  A Validation Framework for the Long Term Preservation of High Energy Physics Data , 2013, 2014 IEEE 30th International Conference on Data Engineering Workshops.

[33]  S. Bethke,et al.  Measurement of the strong coupling αS from the three-jet rate in e+e−-annihilation using JADE data , 2013, The European Physical Journal C.

[34]  J. Malka,et al.  The ZEUS data preservation project , 2012, 2012 IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC).

[35]  V. Cerný,et al.  Inclusive deep inelastic scattering at high Q2 with longitudinally polarised lepton beams at HERA , 2012, 1310.0968.

[36]  Predrag Buncic,et al.  Distributing LHC application software and conditions databases using the CernVM file system , 2011 .

[37]  K. Cranmer,et al.  RECAST — extending the impact of existing analyses , 2010, 1010.2506.

[38]  Magenta Book,et al.  AUDIT AND CERTIFICATION OF TRUSTWORTHY DIGITAL REPOSITORIES , 2011 .

[39]  Karol Kruzelecki,et al.  Servicing HEP experiments with a complete set of ready integreated and configured common software components , 2010 .

[40]  Lorenzo Moneta,et al.  ROOT - A C++ framework for petabyte data storage, statistical analysis and visualization , 2009, Comput. Phys. Commun..

[41]  S. Bethke,et al.  Study of moments of event shapes and a determination of αS using e+e− annihilation data from JADE , 2008, 0810.2933.

[42]  Gerd Behrmann,et al.  A distributed storage system with dCache , 2008 .

[43]  A. Pfeiffer,et al.  Overview of the LCG applications area software projects , 2004, IEEE Symposium Conference Record Nuclear Science 2004..

[44]  S. Mrenna,et al.  Pythia 6.3 physics and manual , 2003, hep-ph/0308153.

[45]  S. Moretti,et al.  HERWIG 6: an event generator for hadron emission reactions with interfering gluons (including supersymmetric processes) , 2001 .

[46]  Raymond A. Lorie,et al.  Long term preservation of digital information , 2001, JCDL '01.

[47]  A. Bondar,et al.  The Belle detector , 1998 .

[48]  E. Barrelet,et al.  The tracking calorimeter and muon detectors of the H1 experiment at Hera , 1996 .

[49]  Paul DuBois,et al.  Software portability with imake - practical software engineering (2. ed.) , 1996 .

[50]  T. Sjöstrand High-energy-physics event generation with PYTHIA 5.7 and JETSET 7.4 , 1994 .

[51]  F. W. Brasse,et al.  The H1 detector of HERA , 1992 .

[52]  H. Johnstad,et al.  PAW (Physics Analysis Workstation) at Fermilab: CORE based graphics implementation of HIGZ (High Level Interface to Graphics and Zebra) , 1989 .

[53]  B. Naroska E+ e- Physics with the JADE Detector at PETRA , 1987 .