MEGADOCK 3.0: a high-performance protein-protein interaction prediction software using hybrid parallel computing for petascale supercomputing environments

BackgroundProtein-protein interaction (PPI) plays a core role in cellular functions. Massively parallel supercomputing systems have been actively developed over the past few years, which enable large-scale biological problems to be solved, such as PPI network prediction based on tertiary structures.ResultsWe have developed a high throughput and ultra-fast PPI prediction system based on rigid docking, “MEGADOCK”, by employing a hybrid parallelization (MPI/OpenMP) technique assuming usages on massively parallel supercomputing systems. MEGADOCK displays significantly faster processing speed in the rigid-body docking process that leads to full utilization of protein tertiary structural data for large-scale and network-level problems in systems biology. Moreover, the system was scalable as shown by measurements carried out on two supercomputing environments. We then conducted prediction of biological PPI networks using the post-docking analysis.ConclusionsWe present a new protein-protein docking engine aimed at exhaustive docking of mega-order numbers of protein pairs. The system was shown to be scalable by running on thousands of nodes. The software package is available at: http://www.bi.cs.titech.ac.jp/megadock/k/.

[1]  Yuri Matsuzaki,et al.  MEGADOCK: An All-to-All Protein-Protein Interaction Prediction System Using Tertiary Structure Data , 2013, Protein and peptide letters.

[2]  Toshiyuki Sato,et al.  In silico Screening of protein-protein Interactions with All-to-All Rigid docking and Clustering: an Application to Pathway Analysis , 2009, J. Bioinform. Comput. Biol..

[3]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[4]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[5]  E. Katchalski‐Katzir,et al.  Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Z. Weng,et al.  Protein–protein docking benchmark 2.0: An update , 2005, Proteins.

[7]  Ozlem Keskin,et al.  Enriching the human apoptosis pathway by predicting the structures of protein-protein complexes. , 2012, Journal of structural biology.

[8]  Zhiping Weng,et al.  A protein–protein docking benchmark , 2003, Proteins.

[9]  Yuri Matsuzaki,et al.  Docking-calculation-based method for predicting protein-RNA interactions. , 2011, Genome informatics. International Conference on Genome Informatics.

[10]  Z. Weng,et al.  ZDOCK: An initial‐stage protein‐docking algorithm , 2003, Proteins.

[11]  R. Nussinov,et al.  Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM , 2011, Nature Protocols.

[12]  Yuri Matsuzaki,et al.  Improvement of the Protein-Protein Docking Prediction by Introducing a Simple Hydrophobic Interaction Model: An Application to Interaction Pathway Analysis , 2012, PRIB.

[13]  Takatsugu Hirokawa,et al.  Analysis of protein-protein docking decoys using interaction fingerprints: application to the reconstruction of CaM-ligand complexes , 2009, BMC Bioinformatics.

[14]  Steven G. Johnson,et al.  The Design and Implementation of FFTW3 , 2005, Proceedings of the IEEE.

[15]  Zhiping Weng,et al.  Docking unbound proteins using shape complementarity, desolvation, and electrostatics , 2002, Proteins.

[16]  Hans Werner Meuer,et al.  Top500 Supercomputer Sites , 1997 .