metaPIS: A Sequence-based Meta-server for Protein Interaction Site Prediction

The identification of interfaces in protein complexes is effective for the elucidation of protein function and helps us to understand their roles in biological processes. With the exponentially growing amount of protein sequence data, an exploration of new methods that predict protein interaction sites based solely on sequence information is becoming increasingly urgent. Because a combination of different methods could produce better results than a single method, interaction site prediction can be improved through the utilization of different methods. This paper describes a new method that predicts interaction sites based on protein sequences by integrating five different algorithms employing meta-method, Majority Vote and SVMhmm Regression techniques. The 'metaPIS' web-server was implemented for meta-prediction. An evaluation of the meta-methods using independent datasets revealed that Majority Vote achieved the highest average Matthews correlation coefficient (0.181) among all the methods assessed. SVMhmm Regression achieved a lower score but provided a more stable result. The metaPIS server allows experimental biologists to speculate regarding protein function by identifying potential interaction sites based on protein sequence. As a web server, metaPIS is freely accessible to the public at http://202.116.74.5:84/metapis.

[1]  Sitao Wu,et al.  LOMETS: A local meta-threading-server for protein structure prediction , 2007, Nucleic acids research.

[2]  Charles-Edmond Bichot,et al.  A new meta-method for graph partitioning , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[3]  Burkhard Rost,et al.  Protein–Protein Interaction Hotspots Carved into Sequences , 2007, PLoS Comput. Biol..

[4]  Marc F Lensink,et al.  Blind predictions of protein interfaces by docking calculations in CAPRI , 2010, Proteins.

[5]  Harpreet Kaur Saini,et al.  BIOINFORMATICS APPLICATIONS NOTE Structural bioinformatics Meta-DP: domain prediction meta-server , 2022 .

[6]  Joshua S Yuan,et al.  Plant Protein-Protein Interaction Network and Interactome , 2010, Current genomics.

[7]  Tanja Kortemme,et al.  Computer-aided design of functional protein interactions. , 2009, Nature chemical biology.

[8]  F. Balkwill Cancer and the chemokine network , 2004, Nature Reviews Cancer.

[9]  Kristian Vlahovicek,et al.  Prediction of Protein–Protein Interaction Sites in Sequences and 3D Structures by Random Forests , 2009, PLoS Comput. Biol..

[10]  Jinyan Li,et al.  Sequence-based identification of interface residues by an integrative profile combining hydrophobic and evolutionary information , 2010, BMC Bioinformatics.

[11]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[12]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[13]  Xiaolong Wang,et al.  Protein-protein interaction site prediction based on conditional random fields , 2007, Bioinform..

[14]  C. Sander,et al.  The PDBFINDER database: a summary of PDB, DSSP and HSSP information with added value , 1996, Comput. Appl. Biosci..

[15]  L. Miller,et al.  Importance of each residue within secretin for receptor binding and biological activity. , 2011, Biochemistry.

[16]  A. Bulpitt,et al.  Insights into protein-protein interfaces using a Bayesian network prediction method. , 2006, Journal of molecular biology.

[17]  N. Hacohen,et al.  A Physical and Regulatory Map of Host-Influenza Interactions Reveals Pathways in H1N1 Infection , 2009, Cell.

[18]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[19]  Kurt S. Thorn,et al.  ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions , 2001, Bioinform..

[20]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[21]  D. Kihara,et al.  A novel method for protein–protein interaction site prediction using phylogenetic substitution models , 2012, Proteins.

[22]  Burkhard Rost,et al.  ISIS: interaction sites identified from sequence , 2007, Bioinform..

[23]  Xiaolong Wang,et al.  Prediction of protein binding sites in protein structures using hidden Markov support vector machine , 2009, BMC Bioinformatics.

[24]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[25]  Wenchao Jiang,et al.  Identifying protein–protein interaction sites in transient complexes with temperature factor, sequence profile and accessible surface area , 2009, Amino Acids.

[26]  Wen-Lian Hsu,et al.  Protein-Protein Interaction Site Predictions with Three-Dimensional Probability Distributions of Interacting Atoms on Protein Surfaces , 2012, PloS one.

[27]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[28]  P. Bork,et al.  Evolution of biomolecular networks — lessons from metabolic and protein interactions , 2009, Nature Reviews Molecular Cell Biology.

[29]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[30]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[31]  Yaoqi Zhou,et al.  Real‐SPINE: An integrated system of neural networks for real‐value prediction of protein structural properties , 2007, Proteins.

[32]  Lei Shi,et al.  A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge , 2008, BMC Bioinformatics.

[33]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[34]  David R. Westhead,et al.  Improved prediction of protein-protein binding sites using a support vector machines approach. , 2005, Bioinformatics.

[35]  Kai Wang,et al.  Meta-analysis of Inter-species Liver Co-expression Networks Elucidates Traits Associated with Common Human Diseases , 2009, PLoS Comput. Biol..

[36]  Yong-Jun Jiang,et al.  Structural Features Underlying Selective Inhibition of GSK3β by Dibromocantharelline: Implications for Rational Drug Design , 2011, Chemical biology & drug design.

[37]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[38]  Vasant Honavar,et al.  HomPPI: a class of sequence homology based protein-protein interface prediction methods , 2011, BMC Bioinformatics.

[39]  Xiang Zhang,et al.  Radial basis function neural network ensemble for predicting protein-protein interaction sites in heterocomplexes. , 2010, Protein and peptide letters.

[40]  Martin Zacharias,et al.  Prediction of protein-protein interaction sites using electrostatic desolvation profiles. , 2010, Biophysical journal.

[41]  Steven M. Lewis,et al.  Anchored Design of Protein-Protein Interfaces , 2011, PloS one.

[42]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[43]  Janusz M. Bujnicki,et al.  MetaMQAP: A meta-server for the quality assessment of protein models , 2008, BMC Bioinformatics.

[44]  Kenji Mizuguchi,et al.  Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites , 2010, Bioinform..