FPGA-Accelerated Machine Learning Inference as a Service for Particle Physics Computing

Large-scale particle physics experiments face challenging demands for high-throughput computing resources both now and in the future. New heterogeneous computing paradigms on dedicated hardware with increased parallelization, such as Field Programmable Gate Arrays (FPGAs), offer exciting solutions with large potential gains. The growing applications of machine learning algorithms in particle physics for simulation, reconstruction, and analysis are naturally deployed on such platforms. We demonstrate that the acceleration of machine learning inference as a web service represents a heterogeneous computing solution for particle physics experiments that potentially requires minimal modification to the current computing model. As examples, we retrain the ResNet-50 convolutional neural network to demonstrate state-of-the-art performance for top quark jet tagging at the LHC and apply a ResNet-50 model with transfer learning for neutrino event classification. Using Project Brainwave by Microsoft to accelerate the ResNet-50 image classification model, we achieve average inference times of 60 (10) ms with our experimental physics software framework using Brainwave as a cloud (edge or on-premises) service, representing an improvement by a factor of approximately 30 (175) in model inference latency over traditional CPU inference in current experimental hardware. A single FPGA service accessed by many CPUs achieves a throughput of 600–700 inferences per second using an image batch of one, comparable to large batch-size GPU throughput and significantly better than small batch-size GPU throughput. Deployed as an edge or cloud service for the particle physics computing model, coprocessor accelerators can have a higher duty cycle and are potentially much more cost-effective.

[1]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  M Gantsweg,et al.  Preliminary Design Report , 1966 .

[3]  P. Vahle,et al.  A convolutional neural network neutrino event classifier , 2016, ArXiv.

[4]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Huilin Qu,et al.  ParticleNet: Jet Tagging via Particle Clouds , 2019, Physical Review D.

[6]  P. Skands,et al.  Tuning PYTHIA 8.1: the Monash 2013 tune , 2014, 1404.5630.

[7]  Heather Gray,et al.  The TrackML Particle Tracking Challenge , 2018 .

[8]  Hari Angepat,et al.  A cloud-scale acceleration architecture , 2016, 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[9]  Manfred Lindner,et al.  The NOvA Technical Design Report , 2007 .

[10]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[11]  Ryszard S. Romaniuk,et al.  Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC , 2012 .

[12]  Prabhat,et al.  The HEP.TrkX Project: deep neural networks for HL-LHC online and offline tracking , 2017 .

[13]  Song Han,et al.  Fast inference of deep neural networks in FPGAs for particle physics , 2018, Journal of Instrumentation.

[14]  Wei Shi,et al.  Boosted decision trees in the CMS Level-1 endcap muon trigger , 2018 .

[15]  J. Favereau,et al.  DELPHES 3: a modular framework for fast simulation of a generic collider experiment , 2013, Journal of High Energy Physics.

[16]  Peter Skands,et al.  An introduction to PYTHIA 8.2 , 2014, Comput. Phys. Commun..

[17]  Michela Paganini,et al.  CaloGAN: Simulating 3D High Energy Particle Showers in Multi-Layer Electromagnetic Calorimeters with Generative Adversarial Networks , 2017, ArXiv.

[18]  Eli Upfal,et al.  Machine Learning in High Energy Physics Community White Paper , 2018, Journal of Physics: Conference Series.

[19]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[20]  L. Gouskos,et al.  The Machine Learning landscape of top taggers , 2019, SciPost Physics.

[21]  J. T. Childers,et al.  Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC , 2012 .

[22]  M. Cacciari,et al.  FastJet user manual , 2011, 1111.6097.

[23]  M. Cacciari,et al.  The anti-$k_t$ jet clustering algorithm , 2008, 0802.1189.

[24]  Kazuhiro Terao,et al.  Machine learning at the energy and intensity frontiers of particle physics , 2018, Nature.

[25]  Lucio Rossi,et al.  High-Luminosity Large Hadron Collider (HL-LHC) : Preliminary Design Report , 2015 .

[26]  D. P. Méndez,et al.  Constraints on Oscillation Parameters from ν_{e} Appearance and ν_{μ} Disappearance in NOvA. , 2017, Physical review letters.

[27]  M. Cacciari,et al.  Dispelling the N3 myth for the kt jet-finder , 2005, hep-ph/0512210.

[28]  Wei Shi,et al.  Boosted Decision Trees in the Level-1 Muon Endcap Trigger at CMS , 2018, Journal of Physics: Conference Series.

[29]  Maria Spiropulu,et al.  Topology Classification with Deep Learning to Improve Real-Time Event Selection at the LHC , 2018, Computing and Software for Big Science.

[30]  Steven R. Simon,et al.  Energy calibration and resolution of the CMS electromagnetic calorimeter in pp collisions at √s = 7 TeV , 2013 .

[31]  D. A. Wickremasinghe,et al.  Convolutional neural networks applied to neutrino events in a liquid argon time projection chamber , 2016, 1611.05531.

[32]  Hannes Jensen,et al.  Reionization and the Cosmic Dawn with the Square Kilometre Array , 2012, 1210.0197.

[33]  Samuel H. Fuller,et al.  The Future of Computing Performance: Game Over or Next Level? , 2014 .

[34]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[35]  Gregor Kasieczka,et al.  Deep-learned Top Tagging with a Lorentz Layer , 2017, SciPost Physics.

[36]  Walter Lampl,et al.  A Roadmap for HEP Software and Computing R&D for the 2020s , 2019 .

[37]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[38]  R.Gill,et al.  Long-Baseline Neutrino Facility (LBNF) and Deep Underground Neutrino Experiment (DUNE) Conceptual Design Report Volume 1: The LBNF and DUNE Projects , 2015 .