Hub chacterization from sequence information using statistical methods

Maps of Protein Interaction Networks (PIN) are important as they provide clues about the functions of individual proteins as well as enable system level analyses of cellular processes. Predicting hub proteins, the highly connected proteins in PIN, is a challenging computational problem. This paper proposes two statistical methods for predicting hub proteins which applied on two different data bases - APID, HPRD - has revealed good results. In these methods, Shannon Index (a biodiversity measure) is used along with amino acid attributes Transfer Free Energy to Surface (TFES) and Hydrophobicity to distinguish hub proteins from non-hub proteins.

[1]  Forest Rohwer,et al.  FastGroupII: A web-based bioinformatics platform for analyses of large 16S rDNA libraries , 2006, BMC Bioinformatics.

[2]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[3]  Artem Cherkasov,et al.  The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks , 2008, BMC Systems Biology.

[4]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[5]  Marc A. Martí-Renom,et al.  Characterization of Protein Hubs by Inferring Interacting Motifs from Protein Interactions , 2007, PLoS Comput. Biol..

[6]  R. Albert Scale-free networks in cell biology , 2005, Journal of Cell Science.

[7]  B. Achiriloaie,et al.  VI REFERENCES , 1961 .

[8]  Carl Kingsford,et al.  The power of protein interaction networks for associating genes with diseases , 2010, Bioinform..

[9]  A. Nair,et al.  A coding measure scheme employing electron-ion interaction pseudopotential (EIIP) , 2006, Bioinformation.

[10]  T. Mahalakshmi,et al.  Are Categorical Periodograms and Indicator Sequences of Genomes Spectrally Equivalent? , 2006, Silico Biol..

[11]  C. Keylock Simpson diversity and the Shannon–Wiener index as special cases of a generalized entropy , 2005 .

[12]  Yuko Makita,et al.  Amino-Acid Residue Association Models for Large Scale Protein-Protein Interaction Prediction , 2009, Silico Biol..

[13]  Reza Salavati,et al.  Sequence-based prediction of protein-protein interactions by means of codon usage , 2008, Genome Biology.

[14]  P. Uetz,et al.  From protein networks to biological systems , 2005, FEBS letters.

[15]  Joel Dudley,et al.  Network-Based Elucidation of Human Disease Similarities Reveals Common Functional Modules Enriched for Pluripotent Drug Targets , 2010, PLoS Comput. Biol..

[16]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[17]  Mike Tyers,et al.  Evolutionary and Physiological Importance of Hub Proteins , 2006, PLoS Comput. Biol..

[18]  Peter Salamon,et al.  PHACCS, an online tool for estimating the structure and diversity of uncultured viral communities using metagenomic information , 2005, BMC Bioinformatics.

[19]  Carlos Prieto,et al.  APID: Agile Protein Interaction DataAnalyzer , 2006, Nucleic Acids Res..

[20]  Ziv Bar-Joseph,et al.  Evaluation of different biological data and computational classification methods for use in protein interaction prediction , 2006, Proteins.

[21]  Juwen Shen,et al.  Predicting protein–protein interactions based only on sequences information , 2007, Proceedings of the National Academy of Sciences.

[22]  Hany Taher Alashwal Predicting protein-protein interaction from primary structure support vector machines , 2004 .

[23]  Jingkai Yu,et al.  Combining multiple positive training sets to generate confidence scores for protein–protein interactions , 2008, Bioinform..

[24]  Kengo Kinoshita,et al.  Ligand-binding site prediction of proteins based on known fragment–fragment interactions , 2010, Bioinform..