Identification of Essential Proteins Based on Ranking Edge-Weights in Protein-Protein Interaction Networks

Essential proteins are those that are indispensable to cellular survival and development. Existing methods for essential protein identification generally rely on knock-out experiments and/or the relative density of their interactions (edges) with other proteins in a Protein-Protein Interaction (PPI) network. Here, we present a computational method, called EW, to first rank protein-protein interactions in terms of their Edge Weights, and then identify sub-PPI-networks consisting of only the highly-ranked edges and predict their proteins as essential proteins. We have applied this method to publicly-available PPI data on Saccharomyces cerevisiae (Yeast) and Escherichia coli (E. coli) for essential protein identification, and demonstrated that EW achieves better performance than the state-of-the-art methods in terms of the precision-recall and Jackknife measures. The highly-ranked protein-protein interactions by our prediction tend to be biologically significant in both the Yeast and E. coli PPI networks. Further analyses on systematically perturbed Yeast and E. coli PPI networks through randomly deleting edges demonstrate that the proposed method is robust and the top-ranked edges tend to be more associated with known essential proteins than the lowly-ranked edges.

[1]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[2]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2011 update , 2010, Nucleic Acids Res..

[3]  Michael G Kearse,et al.  Expression of ribosomal protein L22e family members in Drosophila melanogaster: rpL22-like is differentially expressed and alternatively spliced , 2010, Nucleic acids research.

[4]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[5]  M. Gerstein,et al.  Genomic analysis of essentiality within protein networks. , 2004, Trends in genetics : TIG.

[6]  J. Woolford,et al.  Ytm1, Nop7, and Erb1 Form a Complex Necessary for Maturation of Yeast 66S Preribosomes , 2005, Molecular and Cellular Biology.

[7]  Yi Pan,et al.  A new essential protein discovery method based on the integration of protein-protein interaction and gene expression data , 2012, BMC Systems Biology.

[8]  A. Kudlicki,et al.  Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes , 2005, Science.

[9]  Yi Pan,et al.  Iteration method for predicting essential proteins based on orthology and protein-protein interaction networks , 2012, BMC Systems Biology.

[10]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[11]  J. Shabanowitz,et al.  Composition and functional characterization of yeast 66S ribosome assembly intermediates. , 2001, Molecular cell.

[12]  Chung-Yen Lin,et al.  Hubba: hub objects analyzer—a framework of interactome hubs identification for network biology , 2008, Nucleic Acids Res..

[13]  Shifeng Xue,et al.  Specialized ribosomes: a new frontier in gene regulation and organismal biology , 2012, Nature Reviews Molecular Cell Biology.

[14]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[15]  Daniel N. Wilson,et al.  Structures of the human and Drosophila 80S ribosome , 2013, Nature.

[16]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[17]  Nazar Zaki,et al.  Prediction of Protein-Protein Interactions Using Pairwise Alignment and Inter-Domain Linker Region , 2008, Eng. Lett..

[18]  Yi Pan,et al.  Essential Proteins Discovery from Weighted Protein Interaction Networks , 2010, ISBRA.

[19]  Bruce Stillman,et al.  Yph1p, an ORC-Interacting Protein Potential Links between Cell Proliferation Control, DNA Replication, and Ribosome Biogenesis , 2002, Cell.

[20]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[21]  Scott J. Hultgren,et al.  Functional Genomic Studies of Uropathogenic Escherichia coli and Host Urothelial Cells when Intracellular Bacterial Communities Are Assembled* , 2007, Journal of Biological Chemistry.

[22]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[23]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[24]  Peer Bork,et al.  OGEE: an online gene essentiality database , 2011, Nucleic Acids Res..

[25]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[26]  Benjamin Audit,et al.  An exponential core in the heart of the yeast protein interaction network. , 2005, Molecular biology and evolution.

[27]  S. L. Wong,et al.  A Map of the Interactome Network of the Metazoan C. elegans , 2004, Science.

[28]  Matthew W. Hahn,et al.  Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. , 2005, Molecular biology and evolution.

[29]  J. Woolford,et al.  Role of the yeast Rrp1 protein in the dynamics of pre-ribosome maturation. , 2004, RNA.

[30]  P. Gleizes,et al.  Sequential Protein Association with Nascent 60S Ribosomal Particles , 2003, Molecular and Cellular Biology.

[31]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[32]  Igor Jurisica,et al.  Functional topology in a network of protein interactions , 2004, Bioinform..

[33]  S. Jeffery Evolution of Protein Molecules , 1979 .

[34]  J. A. Rodríguez-Velázquez,et al.  Subgraph centrality in complex networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  John R. Yates,et al.  Tra1p Is a Component of the Yeast Ada·Spt Transcriptional Regulatory Complexes* , 1998, The Journal of Biological Chemistry.

[36]  R. Aebersold,et al.  Molecular architecture of the 26S proteasome holocomplex determined by an integrative approach , 2012, Proceedings of the National Academy of Sciences.

[37]  G. Arndt,et al.  Genome‐wide screening for gene function using RNAi in mammalian cells , 2005, Immunology and cell biology.

[38]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[39]  B. Hong,et al.  Nop2p is required for pre-rRNA processing and 60S ribosome subunit synthesis in yeast , 1997, Molecular and cellular biology.

[40]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2006, Nucleic Acids Res..

[41]  CspC regulates rpoS transcript levels and complements hfq deletions. , 2010, Research in microbiology.

[42]  Kausik Si,et al.  The Saccharomyces cerevisiae TIF6 Gene Encoding Translation Initiation Factor 6 Is Required for 60S Ribosomal Subunit Biogenesis , 2001, Molecular and Cellular Biology.

[43]  P. Stadler,et al.  Centers of complex networks. , 2003, Journal of theoretical biology.

[44]  S. Gygi,et al.  Hexameric assembly of the proteasomal ATPases is templated through their C-termini , 2009, Nature.

[45]  Sanjay Kumar,et al.  Computational prediction of essential genes in an unculturable endosymbiotic bacterium, Wolbachia of Brugia malayi , 2009, BMC Microbiology.

[46]  Stefan Wuchty,et al.  Interaction and domain networks of yeast , 2002, Proteomics.

[47]  M. Zelen,et al.  Rethinking centrality: Methods and examples☆ , 1989 .

[48]  Yi Pan,et al.  Identification of Essential Proteins Based on Edge Clustering Coefficient , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[49]  M. Fromont-Racine,et al.  Nsa2 Is an Unstable, Conserved Factor Required for the Maturation of 27 SB Pre-rRNAs* , 2006, Journal of Biological Chemistry.

[50]  H. Bussey,et al.  Large‐scale essential gene identification in Candida albicans and applications to antifungal drug discovery , 2003, Molecular microbiology.

[51]  I. Moll,et al.  Effects of ribosomal proteins S1, S2 and the DeaD/CsdA DEAD‐box helicase on translation of leaderless and canonical mRNAs in Escherichia coli , 2002, Molecular microbiology.

[52]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[53]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[54]  L. Lindahl,et al.  Transcription of the s10 ribosomal protein operon is regulated by an attenuator in the leader , 1983, Cell.

[55]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[56]  Shifeng Xue,et al.  Ribosome-Mediated Specificity in Hox mRNA Translation and Vertebrate Tissue Patterning , 2011, Cell.

[57]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[58]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.