A New Strategy for Protein Interface Identification Using Manifold Learning Method

Protein interactions play vital roles in biological processes. The study for protein interface will allow people to elucidate the mechanism of protein interaction. However, a large portion of protein interface data is incorrectly collected in current studies. In this paper, a novel strategy of dataset reconstruction using manifold learning method has been proposed for dealing with the noises in the interaction interface data whose definition is based on the residue distances among the different chains within protein complexes. Three support vector machine-based predictors are constructed using different protein features to identify the functional sites involved in the formation of protein interface. The experimental results achieved in this work demonstrate that our strategy can remove noises, and therefore improve the ability for identification of protein interfaces with 77.8% accuracy.

[1]  Xiang Zhang,et al.  Radial basis function neural network ensemble for predicting protein-protein interaction sites in heterocomplexes. , 2010, Protein and peptide letters.

[2]  Mainak Guharoy,et al.  Conserved residue clusters at protein-protein interfaces and their use in binding site identification , 2010, BMC Bioinformatics.

[3]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[4]  Martin Zacharias,et al.  Prediction of protein-protein interaction sites using electrostatic desolvation profiles. , 2010, Biophysical journal.

[5]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.

[6]  Sarah A. Teichmann,et al.  3D Complex: A Structural Classification of Protein Complexes , 2006, PLoS Comput. Biol..

[7]  Xiuquan Du,et al.  Improved Prediction of Protein Binding Sites from Sequences Using Genetic Algorithm , 2009, The protein journal.

[8]  Vasant Honavar,et al.  A two-stage classifier for identification of protein-protein interface residues , 2004, ISMB/ECCB.

[9]  Shuigeng Zhou,et al.  Prediction of protein-protein interaction sites using an ensemble method , 2009, BMC Bioinformatics.

[10]  O. Lichtarge,et al.  Evolutionary predictions of binding surfaces and interactions. , 2002, Current opinion in structural biology.

[11]  Peng Chen,et al.  Predicting protein interaction sites from residue spatial sequence profile and evolution rate , 2006, FEBS Letters.

[12]  Jinyan Li,et al.  Detection of Outlier Residues for Improving Interface Prediction in Protein Hetero-complexes , 2022 .

[13]  Burkhard Rost,et al.  ISIS: interaction sites identified from sequence , 2007, Bioinform..

[14]  Xue-wen Chen,et al.  Sequence-based prediction of protein interaction sites with an integrative method , 2009, Bioinform..

[15]  Tal Pupko,et al.  Structural Genomics , 2005 .

[16]  De-Shuang Huang,et al.  Identifying protein-protein interfacial residues in heterocomplexes using residue conservation scores. , 2006, International journal of biological macromolecules.

[17]  B. Wang,et al.  Inferring protein-protein interacting sites using residue conservation and evolutionary information. , 2006, Protein and peptide letters.

[18]  Kristian Vlahovicek,et al.  Prediction of Protein–Protein Interaction Sites in Sequences and 3D Structures by Random Forests , 2009, PLoS Comput. Biol..

[19]  B. Rost,et al.  Analysing six types of protein-protein interfaces. , 2003, Journal of molecular biology.

[20]  S. Jones,et al.  Analysis of protein-protein interaction sites using surface patches. , 1997, Journal of molecular biology.

[21]  Xiaolong Wang,et al.  Exploiting three kinds of interface propensities to identify protein binding sites , 2009, Comput. Biol. Chem..

[22]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[23]  O. Lichtarge,et al.  Structural clusters of evolutionary trace residues are statistically significant and common in proteins. , 2002, Journal of molecular biology.

[24]  R. Raz,et al.  ProMate: a structure based prediction program to identify the location of protein-protein binding sites. , 2004, Journal of molecular biology.

[25]  S. Jones,et al.  Prediction of protein-protein interaction sites using patch analysis. , 1997, Journal of molecular biology.

[26]  C. Sander,et al.  The HSSP database of protein structure-sequence alignments. , 1994, Nucleic acids research.

[27]  R. Kini,et al.  Prediction of potential protein‐protein interaction sites from amino acid sequence , 1996, FEBS letters.

[28]  Huan-Xiang Zhou,et al.  Prediction of interface residues in protein–protein complexes by a consensus neural network method: Test against NMR data , 2005, Proteins.

[29]  Aleksey A. Porollo,et al.  Prediction‐based fingerprints of protein–protein interactions , 2006, Proteins.