HPC resource integration into CMS Computing via HEPCloud

The higher energy and luminosity from the LHC in Run 2 have put increased pressure on CMS computing resources. Extrapolating to even higher luminosities (and thus higher event complexities and trigger rates) beyond Run 3, it becomes clear that simply scaling up the the current model of CMS computing alone will become economically unfeasible. High Performance Computing (HPC) facilities, widely used in scientific computing outside of HEP, have the potential to help fill the gap. Here we describe the U.S.CMS efforts to integrate US HPC resources into CMS Computing via the HEPCloud project at Fermilab. We present advancements in our ability to use NERSC resources at scale and efforts to integrate other HPC sites as well. We present experience in the elastic use of HPC resources, quickly scaling up use when so required by CMS workflows. We also present performance studies of the CMS multi-threaded framework on both Haswell and KNL HPC resources.

[1]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[2]  Nancy Wilkins-Diehr,et al.  XSEDE: Accelerating Scientific Discovery , 2014, Computing in Science & Engineering.

[3]  Dirk Hufnagel CMS use of allocation based HPC resources , 2017 .

[4]  Igor Sfiligoi,et al.  The Pilot Way to Grid Resources Using glideinWMS , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[5]  Tina Declerck,et al.  Enabling a SuperFacility with Software Defined Networking , 2017 .

[6]  David R. Swanson,et al.  Accessing opportunistic resources with Bosco , 2014 .

[7]  A Pérez-Calero Yzquierdo,et al.  Stability and scalability of the CMS Global Pool: Pushing HTCondor and glideinWMS to new limits , 2017 .

[8]  David Skinner,et al.  Enhancing supercomputing with software defined networking , 2018, 2018 International Conference on Information Networking (ICOIN).

[9]  Brian Bockelman,et al.  Any Data, Any Time, Anywhere: Global Data Access for Science , 2015, 2015 IEEE/ACM 2nd International Symposium on Big Data Computing (BDC).