EXSCALATE: An extreme-scale in-silico virtual screening platform to evaluate 1 trillion compounds in 60 hours on 81 PFLOPS supercomputers

The social and economic impact of the COVID-19 pandemic demands the reduction of the time required to find a therapeutic cure. In the contest of urgent computing, we re-designed the Exscalate molecular docking platform to benefit from heterogeneous computation nodes and to avoid scaling issues. We deployed the Exscalate platform on two top European supercomputers (CINECA-Marconi100 and ENI-HPC5), with a combined computational power of 81 PFLOPS, to evaluate the interaction between 70 billions of small molecules and 15 binding-sites of 12 viral proteins of Sars-Cov2. The experiment lasted 60 hours and overall it performed a trillion of evaluations.

[1]  Franck Cappello,et al.  Addressing failures in exascale computing , 2014, Int. J. High Perform. Comput. Appl..

[2]  P. Alam ‘A’ , 2021, Composites Engineering: An A–Z Guide.

[3]  Graph Topology MPI at Exascale , 2010 .

[4]  Gianluca Palermo,et al.  Legio: fault resiliency for embarrassingly parallel MPI applications , 2021, The Journal of Supercomputing.

[5]  Ruben Abagyan,et al.  Docking and scoring with ICM: the benchmarking results and strategies for improvement , 2012, Journal of Computer-Aided Molecular Design.

[6]  Carlo Cavazzoni,et al.  Exploiting OpenMP and OpenACC to accelerate a geometric approach to molecular docking in heterogeneous HPC nodes , 2019, The Journal of Supercomputing.

[7]  Margaret H. Wright,et al.  The opportunities and challenges of exascale computing , 2010 .

[8]  Simon Fong,et al.  PSOVina: The hybrid particle swarm optimization algorithm for protein-ligand docking , 2015, J. Bioinform. Comput. Biol..

[9]  P. Alam ‘L’ , 2021, Composites Engineering: An A–Z Guide.

[10]  Jessica Holien,et al.  Improvements, trends, and new ideas in molecular docking: 2012–2013 in review , 2015, Journal of molecular recognition : JMR.

[11]  Yanli Wang,et al.  Structure-Based Virtual Screening for Drug Discovery: a Problem-Centric Review , 2012, The AAPS Journal.

[12]  Torsten Schwede,et al.  A Comprehensive Mapping of the Druggable Cavities within the SARS-CoV-2 Therapeutically Relevant Proteins by Combining Pocket and Docking Searches as Implemented in Pockets 2.0 , 2020, International journal of molecular sciences.

[13]  Pedro J. Ballester,et al.  Performance of machine-learning scoring functions in structure-based virtual screening , 2017, Scientific Reports.

[14]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[15]  Stephen Dunn Smiles , 1932 .

[16]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[17]  Carlo Cavazzoni,et al.  Use of Experimental Design To Optimize Docking Performance: The Case of LiGenDock, the Docking Module of Ligen, a New De Novo Design Program , 2013, J. Chem. Inf. Model..

[18]  Thomas Hérault,et al.  Post-failure recovery of MPI communication capability , 2013, Int. J. High Perform. Comput. Appl..

[19]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[20]  Andy B. Yoo,et al.  Approved for Public Release; Further Dissemination Unlimited X-ray Pulse Compression Using Strained Crystals X-ray Pulse Compression Using Strained Crystals , 2002 .

[21]  Matthew B. Baker,et al.  High-throughput virtual laboratory for drug discovery using massive datasets , 2021, Int. J. High Perform. Comput. Appl..

[22]  Aleksey A. Porollo,et al.  Survey of public domain software for docking simulations and virtual screening , 2011, Human Genomics.

[23]  M. Jette,et al.  Simple Linux Utility for Resource Management , 2009 .

[24]  J. Tuszynski,et al.  Software for molecular docking: a review , 2017, Biophysical Reviews.

[25]  Yuan-Ping Pang,et al.  EUDOC on the IBM Blue Gene/L system: Accelerating the transfer of drug discoveries from laboratory to patient , 2008, IBM J. Res. Dev..

[26]  Message Passing Interface Forum MPI: A message - passing interface standard , 1994 .

[27]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[28]  Stephen R. Johnson,et al.  Molecular properties that influence the oral bioavailability of drug candidates. , 2002, Journal of medicinal chemistry.

[29]  Wei Chen,et al.  FireWorks: a dynamic workflow system designed for high‐throughput applications , 2015, Concurr. Comput. Pract. Exp..

[30]  Message P Forum,et al.  MPI: A Message-Passing Interface Standard , 1994 .

[31]  Gianluca Palermo,et al.  Accelerating a Geometric Approach to Molecular Docking with OpenACC , 2018, PBio 2018.

[32]  René Thomsen,et al.  MolDock: a new technique for high-accuracy molecular docking. , 2006, Journal of medicinal chemistry.

[33]  Gorjan Alagic,et al.  #p , 2019, Quantum information & computation.

[34]  P. Alam ‘G’ , 2021, Composites Engineering: An A–Z Guide.

[35]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..