Effective Identification of Hot Spots in PPIs Based on Ensemble Learning

The experiment of alanine scanning has shown that most of the binding energies in protein-protein interactions are contributed by a few significant residues at the protein-protein interfaces, and those important residues are called hot spot residues. On the basis of protein-protein interaction, hot spot residues tend to get together to form modules, and those modules are defined as hot regions. So, hot spot residues play an important role in revealing the life activities of organisms. Therefore, how to predict hot spot residues and non-spot residues effectively and accurately is a vital research direction. A new method is proposed combining protein amino acid physicochemical features and structural features to predict the hot spot residues based on the ensemble learning. The experimental results demonstrate that this method of prediction hot spot residues has a good effect.

[1]  Zhiwen Yu,et al.  Protein Function Prediction Using Multilabel Ensemble Classification , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  Ben Lehner,et al.  A simple principle concerning the robustness of protein complex activity to changes in gene expression , 2008 .

[3]  Xiaolong Zhang,et al.  Prediction of hot regions in protein-protein interactions based on complex network and community detection , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[4]  Kurt S. Thorn,et al.  ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions , 2001, Bioinform..

[5]  Doheon Lee,et al.  A feature-based approach to modeling protein–protein interaction hot spots , 2009, Nucleic acids research.

[6]  M. Šikić,et al.  PSAIA – Protein Structure and Interaction Analyzer , 2008, BMC Structural Biology.

[7]  D. Scott,et al.  Small molecules, big targets: drug discovery faces the protein–protein interaction challenge , 2016, Nature Reviews Drug Discovery.

[8]  Ozlem Keskin,et al.  Analysis of Hot Region Organization in Hub Proteins , 2010, Annals of Biomedical Engineering.

[9]  Xiaolong Zhang,et al.  Identification of Hot Regions in Protein-Protein Interactions Based on Detecting Local Community Structure , 2016, ICIC.

[10]  M Michael Gromiha,et al.  Feature selection and classification of protein–protein complexes based on their binding affinities using machine learning approaches , 2014, Proteins.

[11]  Holger Gohlke,et al.  Targeting protein-protein interactions with small molecules: challenges and perspectives for computational binding epitope detection and ligand finding. , 2006, Current medicinal chemistry.

[12]  Baw-Jhiune Liu,et al.  MAGIIC-PRO: detecting functional signatures by efficient discovery of long patterns in protein sequences , 2006, Nucleic Acids Res..

[13]  Ganapati Panda,et al.  Efficient Localization of Hot Spots in Proteins Using a Novel S-Transform Based Filtering Approach , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Edmond Godfroid,et al.  Distantly related lipocalins share two conserved clusters of hydrophobic residues: use in homology modeling. , 2008, BMC structural biology.

[15]  Ozlem Keskin,et al.  Identification of computational hot spots in protein interfaces: combining solvent accessibility and inter-residue potentials improves the accuracy , 2009, Bioinform..

[16]  G. Weiss,et al.  Combinatorial alanine-scanning. , 2001, Current opinion in chemical biology.

[17]  Ozlem Keskin,et al.  Protein–DNA interactions: structural, thermodynamic and clustering patterns of conserved residues in DNA-binding proteins , 2008, Nucleic acids research.

[18]  Zhiwen Yu,et al.  Erratum to "Protein function prediction using multilabel ensemble classification" , 2014, TCBB.

[19]  N. Ben-Tal,et al.  ConSurf: an algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information. , 2001, Journal of molecular biology.

[20]  O. Dym,et al.  The modular architecture of protein-protein binding interfaces. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Baw-Jhiune Liu,et al.  Identification of hot regions in protein-protein interactions by sequential pattern mining , 2007, BMC Bioinformatics.

[22]  O. Keskin,et al.  Predicting Protein-Protein Interactions from the Molecular to the Proteome Level. , 2016, Chemical reviews.

[23]  Massimiliano Pontil,et al.  Prediction of hot spot residues at protein-protein interfaces by combining machine learning and energy-based methods , 2009, BMC Bioinformatics.

[24]  Yu-Dong Cai,et al.  Prediction of Protein-Protein Interaction Sites by Random Forest Algorithm with mRMR and IFS , 2012, PloS one.

[25]  Juan Fernández-Recio,et al.  Prediction of protein-binding areas by small-world residue networks and application to docking , 2011, BMC Bioinformatics.

[26]  Bin Xu,et al.  A semi-supervised boosting SVM for predicting hot spots at protein-protein Interfaces , 2012, BMC Systems Biology.

[27]  Ozlem Keskin,et al.  HotPoint: hot spot prediction server for protein interfaces , 2010, Nucleic Acids Res..

[28]  Xiaolong Zhang,et al.  Prediction and analysis of hot region in protein-protein interactions , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[29]  Z. R. Li,et al.  Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence , 2006, Nucleic Acids Res..

[30]  Xiaoming Liu,et al.  Prediction of hot regions in protein-protein interaction by combining density-based incremental clustering with feature-based classification , 2015, Comput. Biol. Medicine.

[31]  R. Nussinov,et al.  Hot regions in protein--protein interactions: the organization and contribution of structurally conserved hot spot residues. , 2005, Journal of molecular biology.

[32]  Alfonso Valencia,et al.  Progress and challenges in predicting protein-protein interaction sites , 2008, Briefings Bioinform..

[33]  Richard M. Jackson,et al.  Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces , 2006, Bioinform..