Cloud Computing for Protein-Ligand Binding Site Comparison

The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery.

[1]  Lei Xie,et al.  Detecting evolutionary relationships across existing fold space, using sequence order-independent profile–profile alignments , 2008, Proceedings of the National Academy of Sciences.

[2]  Ronald C. Taylor An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics , 2010, BMC Bioinformatics.

[3]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[4]  Sriram Krishnan,et al.  Design and Evaluation of Opal2: A Toolkit for Scientific Software as a Service , 2009, 2009 Congress on Services - I.

[5]  M. Schatz,et al.  Searching for SNPs with cloud computing , 2009, Genome Biology.

[6]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Repositioning the Safe Medicine Comtan to Treat Multi-Drug and Extensively Drug Resistant Tuberculosis , 2009, PLoS Comput. Biol..

[7]  Alla Karnovsky,et al.  A class of selective antibacterials derived from a protein kinase inhibitor pharmacophore , 2009, Proceedings of the National Academy of Sciences.

[8]  Michael C. Schatz,et al.  CloudBurst: highly sensitive read mapping with MapReduce , 2009, Bioinform..

[9]  Philip E. Bourne,et al.  SMAP-WS: a parallel web service for structural proteome-wide ligand-binding site comparison , 2010, Nucleic Acids Res..

[10]  Jian Wang,et al.  In Silico Elucidation of the Molecular Mechanism Defining the Adverse Effect of Selective Estrogen Receptor Modulators , 2007, PLoS Comput. Biol..

[11]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Identification of the Protein-Ligand Binding Network To Explain the Side Effects of CETP Inhibitors , 2009, PLoS Comput. Biol..

[12]  Philip E. Bourne,et al.  A Multidimensional Strategy to Detect Polypharmacological Targets in the Absence of Structural and Sequence Homology , 2010, PLoS Comput. Biol..

[13]  Philip E. Bourne,et al.  A robust and efficient algorithm for the shape description of protein structures and its application in predicting ligand binding sites , 2007, BMC Bioinformatics.

[14]  Che-Lun Hung,et al.  Open Reading Frame Phylogenetic Analysis on the Cloud , 2013, International journal of genomics.

[15]  Yaw-Ling Lin,et al.  Implementation of a Parallel Protein Structure Alignment Service on Cloud , 2013, International journal of genomics.

[16]  Philip E. Bourne,et al.  A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery , 2009, Bioinform..