How to Understand the Cell by Breaking It: Network Analysis of Gene Perturbation Screens

Functional genomics has demonstratedconsiderable success in inferring the innerworking of a cell through analysis of itsresponse to various perturbations. Inrecent years several technological advanc-es have pushed gene perturbation screensto the forefront of functional genomics.Most importantly, modern technologiesmake it possible to probe gene function ona genome-wide scale in many modelorganisms and human. For example, largecollections of knock-out mutants play aprominent role in the study of Saccharomycescerevisiae [1], and RNA interference (RNAi)has become a widely used high-through-put method to knock-down target genes ina wide range of organisms, includingDrosophila melanogaster, Caenorhabditis elegans,and human [2–4].Another major advance is the develop-ment of rich phenotypic descriptions byimaging or measuring molecular featuresglobally. Observed phenotypes can revealwhich genes are essential for an organism,or work in a particular pathway, or have aspecific cellular function. Combining high-throughput screening techniques with richphenotypes enables researchers to observedetailed reactions to experimental pertur-bations on a genome-wide scale. Thismakes gene perturbation screens one ofthe most promising tools in functionalgenomics.Advances in the design and analysis ofgene perturbation screens may have animmediate impact on many areas ofbiological and medical research. Newscreening and phenotyping techniquesoften directly translate into new insightsin gene and protein functions. Results ofperturbation screens can also reveal unex-ploited areas of potential therapeuticintervention. For example, a recent RNAiscreen showed that some of the mostcritical protein kinases for the proliferationand survival of cancer cell lines are alsothe least studied [5].A goal becoming more and moreprominent in both experimental as wellas computational research is to leveragegene perturbation screens to the identifi-cation of molecular interactions, cellularpathways, and regulatory mechanisms.Research focus is shifting from under-standing the phenotypes of single proteinsto understanding how proteins fulfill theirfunction, what other proteins they interactwith, and where they act in a pathway.Novel ideas on how to use perturbationscreens to uncover cellular wiring dia-grams can lead to a better understandingof how cellular networks are deregulatedin diseases like cancer. This knowledge isindispensable for finding new drug targetsto attack the drivers of a disease and notonly the symptoms.This review surveys the current state-of-the-art in analyzing single gene perturba-tion screens from a network point of view.We describe approaches to make the stepfrom the parts list to the wiring diagram byusing phenotypes for network inferenceand integrating them with complementarydata sources.

[1]  Alfred V. Aho,et al.  The Transitive Reduction of a Directed Graph , 1972, SIAM J. Comput..

[2]  Yuanfang Guan,et al.  A Genomewide Functional Network for the Laboratory Mouse , 2008, PLoS Comput. Biol..

[3]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[6]  A. Fraser,et al.  A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans , 2008, Nature Genetics.

[7]  Robert Gentleman,et al.  Knowledge based identification of essential signaling from genome-scale siRNA experiments , 2009, BMC Systems Biology.

[8]  Lani F. Wu,et al.  Multidimensional Drug Profiling By Automated Microscopy , 2004, Science.

[9]  Roger E Bumgarner,et al.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. , 2001, Science.

[10]  Nir Friedman,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004, Science.

[11]  Achim Tresch,et al.  Modeling the temporal interplay of molecular signaling and gene expression by using dynamic nested effects models , 2009, Proceedings of the National Academy of Sciences.

[12]  Michael Boutros,et al.  Identification of JAK/STAT signalling components by genome-wide RNA interference , 2005, Nature.

[13]  Isaac Meilijson,et al.  Can single knockouts accurately single out gene functions? , 2008, BMC Systems Biology.

[14]  Anne E Carpenter,et al.  CellProfiler: image analysis software for identifying and quantifying cell phenotypes , 2006, Genome Biology.

[15]  Kristin C. Gunsalus,et al.  RNAiDB and PhenoBlast: web tools for genome-wide phenotypic mapping projects , 2004, Nucleic Acids Res..

[16]  I. Simon,et al.  Backup in gene regulatory networks explains differences between binding and knockout results , 2009, Molecular systems biology.

[17]  Fengzhu Sun,et al.  A network-based integrative approach to prioritize reliable hits from multiple genome-wide RNAi screens in Drosophila , 2009, BMC Genomics.

[18]  Eytan Ruppin,et al.  Inferring Functional Pathways from Multi-Perturbation Data , 2006, ISMB.

[19]  Achim Tresch,et al.  Discrimination of Direct and Indirect Interactions in a Network of Regulatory Effects , 2007, J. Comput. Biol..

[20]  Holger Fröhlich,et al.  Large scale statistical inference of signaling pathways from RNAi and microarray data , 2007, BMC Bioinformatics.

[21]  Xiaobo Zhou,et al.  Using iterative cluster merging with improved gap statistics to perform online phenotype discovery in the context of high-throughput RNAi screens , 2008, BMC Bioinformatics.

[22]  Ruth R. Montgomery,et al.  RNA interference screen for human genes associated with West Nile virus infection , 2008, Nature.

[23]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[24]  Edward M. Marcotte,et al.  Mechanisms of Cell Cycle Control Revealed by a Systematic and Quantitative Overexpression Screen in S. cerevisiae , 2008, PLoS genetics.

[25]  Prahlad T. Ram,et al.  Formation of Regulatory Patterns During Signal Propagation in a Mammalian Cellular Network , 2005, Science.

[26]  Nir Friedman,et al.  Inferring subnetworks from perturbed expression profiles , 2001, ISMB.

[27]  C. Bakal,et al.  Quantitative Morphological Signatures Define Local Signaling Networks Regulating Cell Morphology , 2007, Science.

[28]  Joshua M. Stuart,et al.  A Factor Graph Nested Effects Model To Identify Networks from Genetic Perturbations , 2009, PLoS Comput. Biol..

[29]  G. Sumara,et al.  A Probabilistic Functional Network of Yeast Genes , 2004 .

[30]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[31]  Michael Boutros,et al.  The art and design of genetic screens: RNA interference , 2008, Nature Reviews Genetics.

[32]  Tommi S. Jaakkola,et al.  Physical Network Models , 2004, J. Comput. Biol..

[33]  R. König,et al.  Global Analysis of Host-Pathogen Interactions that Regulate Early-Stage HIV-1 Replication , 2008, Cell.

[34]  Markus J. Herrgård,et al.  Network-based prediction of human tissue-specific metabolism , 2008, Nature Biotechnology.

[35]  Roland Eils,et al.  RNAither, an automated pipeline for the statistical analysis of high-throughput RNAi screens , 2009, Bioinform..

[36]  H. Erfle,et al.  High-throughput RNAi screening by time-lapse imaging of live human cells , 2006, Nature Methods.

[37]  C. Pál,et al.  Metabolic network analysis of the causes and evolution of enzyme dispensability in yeast , 2004, Nature.

[38]  Robert M. Seymour,et al.  Using large-scale perturbations in gene network reconstruction , 2005, BMC Bioinformatics.

[39]  Olga G. Troyanskaya,et al.  GOLEM: an interactive graph-based gene-ontology navigation and analysis tool , 2006, BMC Bioinformatics.

[40]  Wolfgang Huber,et al.  Analysis of cell-based RNAi screens , 2006, Genome Biology.

[41]  Michael Boutros,et al.  Cellular phenotyping by RNAi. , 2006, Briefings in functional genomics & proteomics.

[42]  David M. Sabatini,et al.  Building mammalian signalling pathways with RNAi screens , 2006, Nature Reviews Molecular Cell Biology.

[43]  Polina Golland,et al.  Scoring diverse cellular morphologies in image-based screens with iterative feedback and machine learning , 2009, Proceedings of the National Academy of Sciences.

[44]  E. Schadt,et al.  Integrating siRNA and protein-protein interaction data to identify an expanded insulin signaling network. , 2009, Genome research.

[45]  Daniel E. Zak,et al.  Importance of input perturbations and stochastic gene expression in the reverse engineering of genetic regulatory networks: insights from an identifiability analysis of an in silico network. , 2003, Genome research.

[46]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[47]  Bonnie Berger,et al.  RNAiCut: automated detection of significant genes from functional genomic screens , 2009, Nature Methods.

[48]  N. Hacohen,et al.  Highly parallel identification of essential genes in cancer cells , 2008, Proceedings of the National Academy of Sciences.

[49]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[50]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[51]  A. Barabasi,et al.  Predicting synthetic rescues in metabolic networks , 2008, Molecular systems biology.

[52]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[53]  T. Hughes,et al.  Mapping pathways and phenotypes by systematic gene overexpression. , 2006, Molecular cell.

[54]  Stephen T. C. Wong,et al.  Cellular Phenotype Recognition for High-Content RNA Interference Genome-Wide Screening , 2008, Journal of biomolecular screening.

[55]  Achim Tresch,et al.  Nested effects models for learning signaling networks from perturbation data , 2009, Biometrical journal. Biometrische Zeitschrift.

[56]  M. Vingron,et al.  Elucidating regulatory mechanisms downstream of a signaling pathway using informative experiments , 2009, Molecular systems biology.

[57]  Matthew A. Hibbs,et al.  Discovery of biological networks from diverse functional genomic data , 2005, Genome Biology.

[58]  Wolfgang Huber,et al.  Statistical methods and software for the analysis of highthroughput reverse genetic assays using flow cytometry readouts , 2006, Genome Biology.

[59]  Holger Fröhlich,et al.  Analyzing gene perturbation screens with nested effects models in R and bioconductor , 2008, Bioinform..

[60]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2008 update , 2008, Nucleic Acids Res..

[61]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[62]  Jaak Vilo,et al.  Building and analysing genome-wide gene disruption networks , 2002, ECCB.

[63]  R. Aebersold,et al.  Applying mass spectrometry-based proteomics to genetics, genomics and network biology , 2009, Nature Reviews Genetics.

[64]  R. König,et al.  A probability-based approach for the analysis of large-scale RNAi screens , 2007, Nature Methods.

[65]  V. Thorsson,et al.  Discovery of regulatory interactions through perturbation: inference and experimental design. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[66]  Ron Shamir,et al.  A Probabilistic Methodology for Integrating Knowledge and Experiments on Biological Networks , 2006, J. Comput. Biol..

[67]  C. Sander,et al.  Models from experiments: combinatorial drug perturbations of cancer cells , 2008, Molecular systems biology.

[68]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[69]  Ron Shamir,et al.  Identification of functional modules using network topology and high-throughput data , 2007, BMC Systems Biology.

[70]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[71]  M. Xiong,et al.  Identification of genetic networks. , 2004, Genetics.

[72]  H. Lehrach,et al.  A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome , 2005, Cell.

[73]  Bernhard O. Palsson,et al.  Context-Specific Metabolic Networks Are Consistent with Experiments , 2008, PLoS Comput. Biol..

[74]  N. Perrimon,et al.  Sequential activation of signaling pathways during innate immune responses in Drosophila. , 2002, Developmental cell.

[75]  Martin Vingron,et al.  Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration , 2008, Bioinform..

[76]  Andreas Wagner,et al.  How to reconstruct a large genetic network from n gene perturbations in fewer than n2 easy steps , 2001, Bioinform..

[77]  Randall T Moon,et al.  Integrative Analysis of Genome-Wide RNA Interference Screens , 2009, Science Signaling.

[78]  Aideen Long,et al.  Statistical methods for analysis of high-throughput RNA interference screens , 2009, Nature Methods.

[79]  F. Markowetz,et al.  Systems-level dynamic analyses of fate change in murine embryonic stem cells , 2009, Nature.

[80]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[81]  Achim Tresch,et al.  Structure Learning in Nested Effects Models , 2007, Statistical applications in genetics and molecular biology.

[82]  Rainer Spang,et al.  Probabilistic Soft Interventions in Conditional Gaussian Networks , 2005, AISTATS.

[83]  Holger Fröhlich,et al.  Estimating large-scale signaling networks through nested effect models with intervention effects from microarray data , 2008, Bioinform..

[84]  Gavin Sherlock,et al.  Global analysis of gene function in yeast by quantitative phenotypic profiling , 2006, Molecular systems biology.

[85]  Stefan Thurner,et al.  A fast and efficient gene-network reconstruction method from multiple over-expression experiments , 2009, BMC Bioinformatics.

[86]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[87]  Arun K. Ramani,et al.  Protein interaction networks from yeast to human. , 2004, Current opinion in structural biology.

[88]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[89]  Gregory F. Cooper,et al.  An evaluation of a system that recommends microarray experiments to perform to discover gene-regulation pathways , 2004, Artif. Intell. Medicine.

[90]  J. Collins,et al.  A network biology approach to aging in yeast , 2009, Proceedings of the National Academy of Sciences.

[91]  Lorenz Wernisch,et al.  Reconstruction of gene networks using Bayesian learning and manipulation experiments , 2004, Bioinform..

[92]  Radu Dobrin,et al.  Dissecting self-renewal in stem cells with RNA interference , 2006, Nature.

[93]  Lars Kaderali,et al.  Reconstructing signaling pathways from RNAi data using probabilistic Boolean threshold networks , 2009, Bioinform..

[94]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[95]  T. Jaakkola,et al.  Validation and refinement of gene-regulatory pathways on a network of physical interactions , 2005, Genome Biology.

[96]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[97]  Olga G. Troyanskaya,et al.  Nested effects models for high-dimensional phenotyping screens , 2007, ISMB/ECCB.

[98]  Bassem A. Hassan,et al.  Gene prioritization through genomic data fusion , 2006, Nature Biotechnology.

[99]  J. Hasty,et al.  Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[100]  Marco Grzegorczyk,et al.  Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks , 2006, Bioinform..

[101]  W. Wong,et al.  Learning Causal Bayesian Network Structures From Experimental Data , 2008 .

[102]  Monica L. Mo,et al.  Global reconstruction of the human metabolic network based on genomic and bibliomic data , 2007, Proceedings of the National Academy of Sciences.

[103]  Zachary D. Smith,et al.  Unbiased Reconstruction of a Mammalian Transcriptional Network Mediating Pathogen Responses , 2009 .

[104]  Stephen S Fong,et al.  Metabolic gene–deletion strains of Escherichia coli evolve to computationally predicted growth phenotypes , 2004, Nature Genetics.

[105]  Simon Rogers,et al.  A Bayesian regression approach to the inference of regulatory networks from gene expression data , 2005, Bioinform..

[106]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[107]  M. Rockman,et al.  Reverse engineering the genotype–phenotype map with natural genetic variation , 2008, Nature.

[108]  Markus J. Herrgård,et al.  A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology , 2008, Nature Biotechnology.

[109]  Roded Sharan,et al.  SPINE: a framework for signaling-regulatory pathway inference from cause-effect experiments , 2007, ISMB/ECCB.

[110]  Rainer Spang,et al.  Non-transcriptional pathway features reconstructed from secondary effects of RNA interference , 2005, Bioinform..

[111]  Thomas Lengauer,et al.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure , 2006, Bioinform..

[112]  J. Collins,et al.  Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks , 2005, Nature Biotechnology.

[113]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[114]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.