Topological Analysis of Biological Networks

Declaration I hereby declare that this thesis is entirely my own work except where otherwise indicated. I have used only the resources given in the list of references. Abstract An astonishing diversity of biological concepts and processes can be represented by the relatively simple formalism of networks, which consist of nodes connected by edges. Examples range from neuron communication to metabolism and gene suppression mechanisms. The pursuit of describing many interactions and dependencies between the constituents of a living organism has produced large networks of interacting proteins, coexpressed genes, metabolite transformations and other. With advances in large-scale experimental technologies, such as microarray experiments and yeast-two-hybrid screens, data are constantly increasing both in size and complexity. Additional techniques, such as literature curation and prediction methods diversify the biological data in the available repositories. The development of a unified framework for the analysis and integration of these data has appeared as a central challenge in bioinformatics research. This work presents two applications for the manipulation and investigation of biological networks. The Cytoscape plugin named NetworkAnalyzer facilitates topological analysis of large networks through computation and visualization of a variety of network parameters. Simultaneous inspection of analysis results of two or more networks enables network topology comparison. The stand-alone tool NetworkLoader compiles interaction networks from different molecular interaction datasets, providing flexible criteria for data filtering and integration of identifiers. The successful application of NetworkLoader and NetworkAnalyzer in handling protein-protein interaction data, as well as community demand for the functionality provided by NetworkAnalyzer encourages future support and development of these applications. In addition, we introduce an adaptable approach for quantifying the topological role of a subset of nodes in a network. The approach consists of estimating p-values for a collection of topological properties., It application to groups of polyanion-binding proteins within protein-protein interaction networks in yeast and human revealed intriguing topological features of some of the analyzed groups.

[1]  Shmuel Sattath,et al.  How reliable are experimental protein-protein interaction data? , 2003, Journal of molecular biology.

[2]  K. N. Chandrika,et al.  Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets , 2006, Nature Genetics.

[3]  Markus J. Herrgård,et al.  Untangling the web of functional and physical interactions in yeast , 2005, Journal of biology.

[4]  J. Doyle,et al.  Some protein interaction data do not exhibit power law statistics , 2005, FEBS letters.

[5]  G. Sanyal,et al.  Effect of polyanions on the refolding of human acidic fibroblast growth factor. , 1991, The Journal of biological chemistry.

[6]  J. Weitzman,et al.  A road map of yeast interactions , 2005, Journal of biology.

[7]  C. Sander,et al.  The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data , 2004, Nature Biotechnology.

[8]  Martin Vingron,et al.  IntAct: an open source molecular interaction database , 2004, Nucleic Acids Res..

[9]  H. Lehrach,et al.  A protein interaction network links GIT1, an enhancer of huntingtin aggregation, to Huntington's disease. , 2004, Molecular cell.

[10]  M. Gerstein,et al.  Genomic analysis of the hierarchical structure of regulatory networks , 2006, Proceedings of the National Academy of Sciences.

[11]  C. R. Middaugh,et al.  Potential use of non-classical pathways for the transport of macromolecular drugs , 2000, Expert opinion on investigational drugs.

[12]  David F. Burke,et al.  Crystal structure of fibroblast growth factor receptor ectodomain bound to ligand and heparin , 2000, Nature.

[13]  Jian Wang,et al.  Protein interaction networks of Saccharomyces cerevisiae, Caenorhabditis elegans and Drosophila melanogaster: Large‐scale organization and robustness , 2006, Proteomics.

[14]  Béla Bollobás,et al.  Robustness and Vulnerability of Scale-Free Random Graphs , 2004, Internet Math..

[15]  M. Handcock,et al.  An assessment of preferential attachment as a mechanism for human sexual network formation , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[16]  Bernhard O Palsson,et al.  Hierarchical thinking in network biology: the unbiased modularization of biochemical networks. , 2004, Trends in biochemical sciences.

[17]  L. Castagnoli,et al.  Methods to reveal domain networks. , 2005, Drug discovery today.

[18]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[19]  M. Weigt,et al.  On the properties of small-world network models , 1999, cond-mat/9903411.

[20]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[21]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[22]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[23]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[24]  E. Hall,et al.  The nature of biotechnology. , 1988, Journal of biomedical engineering.

[25]  Mark Gerstein,et al.  Predicting interactions in protein networks by completing defective cliques , 2006, Bioinform..

[26]  Stuart L. Schreiber,et al.  SpectralNET – an application for spectral graph analysis and visualization , 2005, BMC Bioinformatics.

[27]  Marc Vidal,et al.  Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis , 2005, Nature.

[28]  Reinhard Diestel,et al.  Graph Theory , 1997 .

[29]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[30]  S. Wuchty Scale-free behavior in protein domain networks. , 2001, Molecular biology and evolution.

[31]  S. Tumova,et al.  Heparan sulfate proteoglycans on the cell surface: versatile coordinators of cellular functions. , 2000, The international journal of biochemistry & cell biology.

[32]  U. Alon Biological Networks: The Tinkerer as an Engineer , 2003, Science.

[33]  L Holm,et al.  Algorithms for protein interaction networks. , 2005, Biochemical Society transactions.

[34]  Gary D Bader,et al.  Systematic Genetic Analysis with Ordered Arrays of Yeast Deletion Mutants , 2001, Science.

[35]  Albert-László Barabási,et al.  Error and attack tolerance of complex networks , 2000, Nature.

[36]  M. Gerstein,et al.  Global Analysis of Protein Activities Using Proteome Chips , 2001, Science.

[37]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[38]  M. Gerstein,et al.  Genomic analysis of regulatory network dynamics reveals large topological changes , 2004, Nature.

[39]  Qiong Gao,et al.  Resources for integrative systems biology: from data through databases to networks and dynamic system models , 2006, Briefings Bioinform..

[40]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[41]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[42]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[43]  Chris F. Taylor,et al.  Autumn 2005 Workshop of the Human Proteome Organisation Proteomics Standards Initiative (HUPO‐PSI) Geneva, September, 4–6, 2005 , 2006, Proteomics.

[44]  R. Russell,et al.  Structure‐based systems biology: a zoom lens for the cell , 2005, FEBS letters.

[45]  Shoshana J. Wodak,et al.  GenePro: a cytoscape plug-in for advanced visualization and analysis of interaction networks , 2006, Bioinform..

[46]  S. Wuchty,et al.  Evolutionary cores of domain co-occurrence networks , 2005, BMC Evolutionary Biology.

[47]  Quaid Morris,et al.  Transcriptional networks: reverse-engineering gene regulation on a global scale. , 2004, Current opinion in microbiology.

[48]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[49]  M. Gerstein,et al.  Genomic analysis of essentiality within protein networks. , 2004, Trends in genetics : TIG.

[50]  M. Gerstein,et al.  Global analysis of protein phosphorylation in yeast , 2005, Nature.

[51]  T. Sittler,et al.  The Plasmodium protein network diverges from those of other eukaryotes , 2005, Nature.

[52]  Kyongbum Lee,et al.  An algorithm for modularity analysis of directed and weighted biological networks based on edge-betweenness centrality , 2006, Bioinform..

[53]  C. Russell Middaugh,et al.  Formulation Design of Acidic Fibroblast Growth Factor , 1993, Pharmaceutical Research.

[54]  An-Ping Zeng,et al.  Decomposition of metabolic network into functional modules based on the global connectivity structure of reaction graph , 2004, Bioinform..

[55]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[56]  Christian von Mering,et al.  STRING: known and predicted protein–protein associations, integrated and transferred across organisms , 2004, Nucleic Acids Res..

[57]  Carsten Wiuf,et al.  Subnets of scale-free networks are not scale-free: sampling properties of networks. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[58]  Michael E. Cusick,et al.  Yeast Protein Interactome topology provides framework for coordinated-functionality , 2006 .

[59]  A. Barabasi,et al.  Scale-free characteristics of random networks: the topology of the world-wide web , 2000 .

[60]  A. Elofsson,et al.  What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? , 2006, Genome Biology.

[61]  Michel L. Goldstein,et al.  Problems with fitting to the power-law distribution , 2004, cond-mat/0402322.

[62]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[63]  Caroline C. Friedel,et al.  Toward the complete interactome , 2006, Nature Biotechnology.

[64]  B Bhattacharyya,et al.  Chaperone-like Activity of Tubulin* , 1998, The Journal of Biological Chemistry.

[65]  Vladimir Batagelj,et al.  Pajek - Program for Large Network Analysis , 1999 .

[66]  Shoshana J. Wodak,et al.  CYGD: the Comprehensive Yeast Genome Database , 2004, Nucleic Acids Res..

[67]  Béla Bollobás,et al.  The Diameter of a Scale-Free Random Graph , 2004, Comb..

[68]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[69]  Zhenjun Hu,et al.  VisANT: data-integrating visual framework for biological networks and modules , 2005, Nucleic Acids Res..

[70]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[71]  D. Fell,et al.  The small world inside large metabolic networks , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[72]  M. Tyers,et al.  Osprey: a network visualization system , 2003, Genome Biology.

[73]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[74]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[75]  Paul Shannon,et al.  System-based proteomic analysis of the interferon response in human liver cells , 2004, Genome Biology.

[76]  M. Vidal,et al.  Interactome: gateway into systems biology. , 2005, Human molecular genetics.

[77]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2006, Nucleic Acids Res..

[78]  BMC Bioinformatics , 2005 .

[79]  U. Stelzl,et al.  The value of high quality protein-protein interaction networks for systems biology. , 2006, Current opinion in chemical biology.

[80]  B. Bollobás The evolution of random graphs , 1984 .

[81]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[82]  J. Lunn Compartmentation in plant metabolism. , 2006, Journal of experimental botany.

[83]  Walter Willinger,et al.  Towards a Theory of Scale-Free Graphs: Definition, Properties, and Implications , 2005, Internet Math..

[84]  Dong Xu,et al.  Computational analyses of high-throughput protein-protein interaction data. , 2003, Current protein & peptide science.

[85]  Marco Grzegorczyk,et al.  Statistics for Proteomics: A Review of Tools for Analyzing Experimental Data , 2006, Proteomics.

[86]  Benno Schwikowski,et al.  Graph-based methods for analysing networks in cell biology , 2006, Briefings Bioinform..

[87]  Hanno Steen,et al.  Development of human protein reference database as an initial platform for approaching systems biology in humans. , 2003, Genome research.

[88]  R Pastor-Satorras,et al.  Dynamical and correlation properties of the internet. , 2001, Physical review letters.

[89]  S. L. Wong,et al.  Motifs, themes and thematic maps of an integrated Saccharomyces cerevisiae interaction network , 2005, Journal of biology.

[90]  E. Triphosphat,et al.  FEBS Letters , 1987, FEBS Letters.

[91]  Ian M. Donaldson,et al.  The Biomolecular Interaction Network Database and related tools 2005 update , 2004, Nucleic Acids Res..

[92]  M. Gerstein,et al.  TopNet: a tool for comparing biological sub-networks, correlating protein properties with topological statistics. , 2004, Nucleic acids research.

[93]  B. Berger,et al.  Herpesviral Protein Networks and Their Interaction with the Human Proteome , 2006, Science.

[94]  Baldomero Oliva,et al.  PIANA: protein interactions and network analysis , 2006, Bioinform..

[95]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[96]  H. Lehrach,et al.  A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome , 2005, Cell.

[97]  Silvio C. E. Tosatto,et al.  Decomposing protein networks into domain-domain interactions , 2005, ECCB/JBI.

[98]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[99]  P. Bork,et al.  Dynamic Complex Formation During the Yeast Cell Cycle , 2005, Science.

[100]  M. Vidal,et al.  Effect of sampling on topology predictions of protein-protein interaction networks , 2005, Nature Biotechnology.

[101]  L. Amaral,et al.  The web of human sexual contacts , 2001, Nature.

[102]  Robert T. Sauer,et al.  Acceleration of the refolding of Arc repressor by nucleic acids and other polyanions , 1999, Nature Structural Biology.

[103]  P. O’Hare,et al.  Conformational Lability of Herpesvirus Protein VP22* , 2000, The Journal of Biological Chemistry.

[104]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[105]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[106]  A. Barabasi,et al.  A Protein–Protein Interaction Network for Human Inherited Ataxias and Disorders of Purkinje Cell Degeneration , 2006, Cell.

[107]  A. Bergman,et al.  Functional and evolutionary inference in gene networks: does topology matter? , 2006, Genetica.

[108]  Razvan C. Bunescu,et al.  Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome , 2005, Genome Biology.

[109]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[110]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[111]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[112]  R. Albert Scale-free networks in cell biology , 2005, Journal of Cell Science.

[113]  Mark Gerstein,et al.  Analyzing cellular biochemistry in terms of molecular networks. , 2003, Annual review of biochemistry.

[114]  D. Volkin,et al.  Partially structured self-associating states of acidic fibroblast growth factor. , 1993, Biochemistry.

[115]  P K Tsai,et al.  Physical stabilization of acidic fibroblast growth factor by polyanions. , 1993, Archives of biochemistry and biophysics.

[116]  Mario Albrecht,et al.  A Network-based Analysis of Polyanion-binding Proteins Utilizing Human Protein Arrays* , 2007, Journal of Biological Chemistry.

[117]  J. Rothberg,et al.  Gaining confidence in high-throughput protein interaction networks , 2004, Nature Biotechnology.

[118]  J. Mattick,et al.  Genome research , 1990, Nature.

[119]  Brian E Snydsman,et al.  Assigning function to yeast proteins by integration of technologies. , 2003, Molecular cell.

[120]  John Guare,et al.  Six Degrees of Separation: A Play , 1990 .

[121]  I. Madshus,et al.  Tight folding of acidic fibroblast growth factor prevents its translocation to the cytosol with diphtheria toxin as vector. , 1992, The EMBO journal.