Essential Proteins Discovery from Weighted Protein Interaction Networks

Identifying essential proteins is important for understanding the minimal requirements for cellular survival and development. Fast growth in the amount of available protein-protein interactions has produced unprecedented opportunities for detecting protein essentiality on network level. A series of centrality measures have been proposed to discover essential proteins based on network topology. However, most of them treat all interactions equally and are sensitive to false positives. In this paper, six standard centrality measures are redefined to be used in weighted network. A new method for weighing protein-protein interactions is proposed based on the combination of logistic regression-based model and function similarity. The experimental results on yeast network show that the weighting method can improve the performance of centrality measures considerably. More essential proteins are discovered by the weighted centrality measures than by the original centrality measures used in unweighted network. Even about 20% improvements are obtained from closeness centrality and subgraph centrality.

[1]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[2]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[3]  Shivaram Narayanan,et al.  The Betweenness Centrality Of Biological Networks , 2005 .

[4]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[5]  Matthew W. Hahn,et al.  Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. , 2005, Molecular biology and evolution.

[6]  Dominique Martinez,et al.  A Model of Stimulus-Specific Neural Assemblies in the Insect Antennal Lobe , 2008, PLoS Comput. Biol..

[7]  Yang Dai,et al.  Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction , 2006, BMC Bioinformatics.

[8]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[9]  M. Gerstein,et al.  Genomic analysis of essentiality within protein networks. , 2004, Trends in genetics : TIG.

[10]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[11]  Roded Sharan,et al.  QPath: a method for querying pathways in a protein-protein interaction network , 2006, BMC Bioinformatics.

[12]  Karl W. Broman,et al.  A postgenomic method for predicting essential genes at subsaturation levels of mutagenesis: Application to Mycobacterium tuberculosis , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[14]  J. A. Rodríguez-Velázquez,et al.  Subgraph centrality in complex networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Rick Stevens,et al.  Essential genes on metabolic maps. , 2006, Current opinion in biotechnology.

[16]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[17]  A. Barabasi,et al.  Functional and topological characterization of protein interaction networks , 2004, Proteomics.

[18]  Dianne P. O'Leary,et al.  Why Do Hubs in the Yeast Protein Interaction Network Tend To Be Essential: Reexamining the Connection between the Network Topology and Essentiality , 2008, PLoS Comput. Biol..

[19]  Leon Peeters,et al.  Algorithms for Centrality Indices , 2004, Network Analysis.

[20]  Alexander Rives,et al.  Modular organization of cellular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[21]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[22]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[23]  Mike Tyers,et al.  Evolutionary and Physiological Importance of Hub Proteins , 2006, PLoS Comput. Biol..

[24]  B. Palsson,et al.  Genome-scale reconstruction of the metabolic network in Staphylococcus aureus N315: an initial draft to the two-dimensional annotation , 2005, BMC Microbiology.

[25]  Y. Dong,et al.  Systematic functional analysis of the Caenorhabditis elegans genome using RNAi , 2003, Nature.

[26]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[27]  C. Ball,et al.  Saccharomyces Genome Database. , 2002, Methods in enzymology.

[28]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[29]  S. Coulomb,et al.  Gene essentiality and the topology of protein interaction networks , 2005, Proceedings of the Royal Society B: Biological Sciences.

[30]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[31]  Alpan Raval,et al.  Identifying Hubs in Protein Interaction Networks , 2009, PloS one.

[32]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[33]  V. A. Stewart,et al.  Blood Stage Malaria Vaccine Eliciting High Antigen-Specific Antibody Concentrations Confers No Protection to Young Children in Western Kenya , 2009, PloS one.

[34]  Yang Zhang,et al.  Structure Modeling of All Identified G Protein–Coupled Receptors in the Human Genome , 2006, PLoS Comput. Biol..

[35]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[36]  M. Zelen,et al.  Rethinking centrality: Methods and examples☆ , 1989 .

[37]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[38]  Sanjay Kumar,et al.  Computational prediction of essential genes in an unculturable endosymbiotic bacterium, Wolbachia of Brugia malayi , 2009, BMC Microbiology.

[39]  Stefan Wuchty,et al.  Interaction and domain networks of yeast , 2002, Proteomics.

[40]  P. Stadler,et al.  Centers of complex networks. , 2003, Journal of theoretical biology.

[41]  O Mason,et al.  Graph theory and networks in Biology. , 2006, IET systems biology.

[42]  Jacques van Helden,et al.  Evaluation of clustering algorithms for protein-protein interaction networks , 2006, BMC Bioinformatics.

[43]  M. V. Valkenburg Network Analysis , 1964 .

[44]  Ernesto Estrada Virtual identification of essential proteins within the protein interaction network of yeast , 2005, Proteomics.