Autoencoder-Based Anomaly Detection System for Online Data Quality Monitoring of the CMS Electromagnetic Calorimeter

The CMS detector is a general-purpose apparatus that detects high-energy collisions produced at the LHC. Online Data Quality Monitoring of the CMS electromagnetic calorimeter is a vital operational tool that allows detector experts to quickly identify, localize, and diagnose a broad range of detector issues that could affect the quality of physics data. A real-time autoencoder-based anomaly detection system using semi-supervised machine learning is presented enabling the detection of anomalies in the CMS electromagnetic calorimeter data. A novel method is introduced which maximizes the anomaly detection performance by exploiting the time-dependent evolution of anomalies as well as spatial variations in the detector response. The autoencoder-based system is able to efficiently detect anomalies, while maintaining a very low false discovery rate. The performance of the system is validated with anomalies found in 2018 and 2022 LHC collision data. Additionally, the first results from deploying the autoencoder-based system in the CMS online Data Quality Monitoring workflow during the beginning of Run 3 of the LHC are presented, showing its ability to detect issues missed by the existing system.

[1]  C. Collaboration,et al.  Electron and photon reconstruction and identification with the CMS experiment at the CERN LHC , 2020, Journal of Instrumentation.

[2]  B. Nachman Anomaly Detection for Physics Analysis and Less Than Supervised Learning , 2020, Artificial Intelligence for High Energy Physics.

[3]  C. Collaboration,et al.  Reconstruction of signal amplitudes in the CMS electromagnetic calorimeter in the presence of overlapping proton-proton interactions , 2020, Journal of Instrumentation.

[4]  C. Collaboration,et al.  Performance of the CMS Level-1 trigger in proton-proton collisions at √s = 13 TeV , 2020, Journal of Instrumentation.

[5]  S. M. Etesami,et al.  A measurement of the Higgs boson mass in the diphoton decay channel , 2020, Physics Letters B.

[6]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[7]  Gianluca Cerminara,et al.  Detector Monitoring with Artificial Neural Networks at the CMS Experiment at the CERN Large Hadron Collider , 2018, Computing and Software for Big Science.

[8]  Eli Upfal,et al.  Machine Learning in High Energy Physics Community White Paper , 2018, Journal of Physics: Conference Series.

[9]  P. Siddireddy The CMS ECAL Trigger and DAQ system: electronics auto-recovery and monitoring , 2018, 1806.09136.

[10]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  L. Borrello,et al.  The Data Quality Monitoring Software for the CMS experiment at the LHC , 2014, 2014 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC).

[12]  A. Trzupek,et al.  Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC , 2012, 1207.7214.

[13]  The Cms Collaboration Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC , 2012, 1207.7235.

[14]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[15]  L Tuura,et al.  CMS data quality monitoring: Systems and experiences , 2010 .

[16]  João Paulo Teixeira,et al.  The CMS experiment at the CERN LHC , 2008 .

[17]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[18]  I. Mandjavidze,et al.  The selective read-out processor for the CMS electromagnetic calorimeter , 2004, IEEE Transactions on Nuclear Science.

[19]  J. Varela,et al.  The CMS trigger system , 2004, 1609.02366.

[20]  Scott Rutherford,et al.  Study of the Effects of Data Reduction Algorithms on Physics Reconsruction in the CMS ECAL , 2003 .

[21]  M. Pierini,et al.  Improving data quality monitoring via a partnership of technologies and resources between the CMS experiment at CERN and industry , 2019, EPJ Web of Conferences.

[22]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.