PathFinder: mining signal transduction pathway segments from protein-protein interaction networks

BackgroundA Signal transduction pathway is the chain of processes by which a cell converts an extracellular signal into a response. In most unicellular organisms, the number of signal transduction pathways influences the number of ways the cell can react and respond to the environment. Discovering signal transduction pathways is an arduous problem, even with the use of systematic genomic, proteomic and metabolomic technologies. These techniques lead to an enormous amount of data and how to interpret and process this data becomes a challenging computational problem.ResultsIn this study we present a new framework for identifying signaling pathways in protein-protein interaction networks. Our goal is to find biologically significant pathway segments in a given interaction network. Currently, protein-protein interaction data has excessive amount of noise, e.g., false positive and false negative interactions. First, we eliminate false positives in the protein-protein interaction network by integrating the network with microarray expression profiles, protein subcellular localization and sequence information. In addition, protein families are used to repair false negative interactions. Then the characteristics of known signal transduction pathways and their functional annotations are extracted in the form of association rules.ConclusionGiven a pair of starting and ending proteins, our methodology returns candidate pathway segments between these two proteins with possible missing links (recovered false negatives). In our study, S. cerevisiae (yeast) data is used to demonstrate the effectiveness of our method.

[1]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[2]  Yoshihiro Yamaguchi,et al.  Roles for the Two-hybrid System in Exploration of the Yeast Protein Interactome* , 2002, Molecular & Cellular Proteomics.

[3]  P. Uetz,et al.  Towards an understanding of complex protein networks. , 2001, Trends in cell biology.

[4]  J. Thorner,et al.  Sst2, a negative regulator of pheromone signaling in the yeast Saccharomyces cerevisiae: expression, localization, and genetic interaction and physical association with Gpa1 (the G-protein alpha subunit) , 1996, Molecular and cellular biology.

[5]  David Eisenberg,et al.  Bioinformatic identification of potential autocrine signaling loops in cancers from gene expression profiles , 2001, Nature Genetics.

[6]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Jiong Yang,et al.  Analyzing and modeling large biological networks: inferring signal transduction pathways , 2007 .

[8]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[9]  J. Konopka,et al.  Genetic fine-structural analysis of the Saccharomyces cerevisiae alpha-pheromone receptor. , 1991, Cell regulation.

[10]  S. Fields,et al.  A novel genetic system to detect protein–protein interactions , 1989, Nature.

[11]  Roded Sharan,et al.  A direct comparison of protein interaction confidence assignment schemes , 2006, BMC Bioinformatics.

[12]  Thomas Lengauer,et al.  Analysis of Gene Expression Data with Pathway Scores , 2000, ISMB.

[13]  Roger E Bumgarner,et al.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. , 2001, Science.

[14]  Gavin Sherlock,et al.  The Stanford Microarray Database accommodates additional microarray platforms and data formats , 2004, Nucleic Acids Res..

[15]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[16]  P. Pryciak,et al.  Interaction with the SH3 Domain Protein Bem1 Regulates Signaling by the Saccharomyces cerevisiae p21-Activated Kinase Ste20 , 2005, Molecular and Cellular Biology.

[17]  Ravi Iyengar,et al.  Modeling Signaling Networks , 2002, Science.

[18]  A. Grigoriev On the number of protein-protein interactions in the yeast proteome. , 2003, Nucleic acids research.

[19]  A. Grigoriev A relationship between gene expression and protein interactions on the proteome scale: analysis of the bacteriophage T7 and the yeast Saccharomyces cerevisiae. , 2001, Nucleic acids research.

[20]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[21]  Colin Cooper,et al.  Improved Duplication Models for Proteome Network Evolution , 2005, Systems Biology and Regulatory Genomics.

[22]  Yanjun Qi,et al.  Random Forest Similarity for Protein-Protein Interaction Prediction from Multiple Sources , 2004, Pacific Symposium on Biocomputing.

[23]  Mark Gerstein,et al.  Bridging structural biology and genomics: assessing protein interaction data with known complexes. , 2002, Drug discovery today.

[24]  Ravi Iyengar,et al.  Quantitative Information Management for the Biochemical Computation of Cellular Networks , 2004, Science's STKE.

[25]  Marc Vidal,et al.  Yeast Two-hybrid Systems and Protein Interaction Mapping Projects for Yeast and Worm , 2022 .

[26]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[27]  Edgar Wingender,et al.  Consistent re-modeling of signaling pathways and its implementation in the TRANSPATH database. , 2004, Genome informatics. International Conference on Genome Informatics.

[28]  M. Vidal,et al.  Protein interaction mapping in C. elegans using proteins involved in vulval development. , 2000, Science.

[29]  J. Rothberg,et al.  Gaining confidence in high-throughput protein interaction networks , 2004, Nature Biotechnology.

[30]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[31]  H. Mewes,et al.  The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. , 2004, Nucleic acids research.

[32]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[33]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[34]  S. L. Wong,et al.  A Map of the Interactome Network of the Metazoan C. elegans , 2004, Science.

[35]  Jin Ho Yoon,et al.  Recruitment of the Swi/Snf Complex by Ste12-Tec1 Promotes Flo8-Mss11-Mediated Activation of STA1 Expression , 2004, Molecular and Cellular Biology.

[36]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[37]  Roded Sharan,et al.  Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks , 2006, J. Comput. Biol..

[38]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[39]  E. O’Shea,et al.  Global analysis of protein localization in budding yeast , 2003, Nature.

[40]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[41]  Martin Steffen,et al.  Automated modelling of signal transduction networks , 2002, BMC Bioinformatics.

[42]  G. Mendel,et al.  Mendel's Principles of Heredity , 1910, Nature.

[43]  Yin Liu,et al.  A computational approach for ordering signal transduction pathway components from genomics and proteomics Data , 2004, BMC Bioinformatics.

[44]  Nancy R. Gough,et al.  Focus Issue: Cell Signaling—Making New Connections , 2004, Science's STKE.

[45]  Charles Boone,et al.  Fus1p Interacts With Components of the Hog1p Mitogen-Activated Protein Kinase and Cdc42p Morphogenesis Signaling Pathways to Control Cell Fusion During Yeast Mating , 2004, Genetics.

[46]  Ting Chen,et al.  Assessment of the reliability of protein-protein interactions and protein function prediction , 2002, Pacific Symposium on Biocomputing.

[47]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[48]  J. Hudson,et al.  C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression , 2003, Nature Genetics.

[49]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[50]  Rajeev Motwani,et al.  Beyond market baskets: generalizing association rules to correlations , 1997, SIGMOD '97.

[51]  Gerald R. Fink,et al.  MAP Kinases with Distinct Inhibitory Functions Impart Signaling Specificity during Yeast Differentiation , 1997, Cell.

[52]  Roded Sharan,et al.  QPath: a method for querying pathways in a protein-protein interaction network , 2006, BMC Bioinformatics.