Posterior Association Networks and Functional Modules Inferred from Rich Phenotypes of Gene Perturbations

Combinatorial gene perturbations provide rich information for a systematic exploration of genetic interactions. Despite successful applications to bacteria and yeast, the scalability of this approach remains a major challenge for higher organisms such as humans. Here, we report a novel experimental and computational framework to efficiently address this challenge by limiting the ‘search space’ for important genetic interactions. We propose to integrate rich phenotypes of multiple single gene perturbations to robustly predict functional modules, which can subsequently be subjected to further experimental investigations such as combinatorial gene silencing. We present posterior association networks (PANs) to predict functional interactions between genes estimated using a Bayesian mixture modelling approach. The major advantage of this approach over conventional hypothesis tests is that prior knowledge can be incorporated to enhance predictive power. We demonstrate in a simulation study and on biological data, that integrating complementary information greatly improves prediction accuracy. To search for significant modules, we perform hierarchical clustering with multiscale bootstrap resampling. We demonstrate the power of the proposed methodologies in applications to Ewing's sarcoma and human adult stem cells using publicly available and custom generated data, respectively. In the former application, we identify a gene module including many confirmed and highly promising therapeutic targets. Genes in the module are also significantly overrepresented in signalling pathways that are known to be critical for proliferation of Ewing's sarcoma cells. In the latter application, we predict a functional network of chromatin factors controlling epidermal stem cell fate. Further examinations using ChIP-seq, ChIP-qPCR and RT-qPCR reveal that the basis of their genetic interactions may arise from transcriptional cross regulation. A Bioconductor package implementing PAN is freely available online at http://bioconductor.org/packages/release/bioc/html/PANR.html.

[1]  C. Bakal,et al.  Quantitative Morphological Signatures Define Local Signaling Networks Regulating Cell Morphology , 2007, Science.

[2]  Hidetoshi Shimodaira An approximately unbiased test of phylogenetic tree selection. , 2002, Systematic biology.

[3]  E. Kleinerman,et al.  Delta-Like Ligand 4 Plays a Critical Role in Pericyte/Vascular Smooth Muscle Cell Formation during Vasculogenesis and Tumor Vessel Expansion in Ewing's Sarcoma , 2010, Clinical Cancer Research.

[4]  Weiwei Zhong,et al.  Genome-Wide Prediction of C. elegans Genetic Interactions , 2006, Science.

[5]  N. Perrimon,et al.  High-throughput RNAi screening in cultured cells: a user's guide , 2006, Nature Reviews Genetics.

[6]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[7]  Hidetoshi Shimodaira,et al.  Pvclust: an R package for assessing the uncertainty in hierarchical clustering , 2006, Bioinform..

[8]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[9]  Eric D Brown,et al.  Chemical probes of Escherichia coli uncovered through chemical-chemical interaction profiling with compounds of known biological activity. , 2010, Chemistry & biology.

[10]  David Baltimore,et al.  Cooperation of multiple signaling pathways in CD40-regulated gene expression in B lymphocytes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[11]  S. Oliver,et al.  An integrated approach to characterize genetic interaction networks in yeast metabolism , 2011, Nature Genetics.

[12]  Brian D. Ripley,et al.  Modern applied statistics with S, 4th Edition , 2002, Statistics and computing.

[13]  Kobe Chuo-ku,et al.  Inhibition of PKC · Activation in Human Bone and Soft Tissue Sarcoma Cells by the Selective PKC Inhibitor PKC412 , 2008 .

[14]  Thomas Horn,et al.  GenomeRNAi: a database for cell-based RNAi phenotypes. 2009 update , 2009, Nucleic Acids Res..

[15]  Michael Boutros,et al.  The art and design of genetic screens: RNA interference , 2008, Nature Reviews Genetics.

[16]  Robert P. St.Onge,et al.  Defining genetic interaction , 2008, Proceedings of the National Academy of Sciences.

[17]  C. D. Litton,et al.  Theory of Probability (3rd Edition) , 1984 .

[18]  Nir Hacohen,et al.  Minimizing the risk of reporting false positives in large-scale RNAi screens , 2006, Nature Methods.

[19]  J. Bader,et al.  Finding friends and enemies in an enemies-only network: a graph diffusion kernel for predicting novel genetic interactions and co-complex membership from yeast genetic interactions. , 2008, Genome research.

[20]  Yuan Ji,et al.  Applications of beta-mixture models in bioinformatics , 2005, Bioinform..

[21]  M. Boutros,et al.  Clustering phenotype populations by genome-wide RNAi and multiparametric imaging , 2010, Molecular systems biology.

[22]  I. Olkin,et al.  Generating Correlation Matrices , 1984 .

[23]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[24]  Thomas Horn,et al.  GenomeRNAi: a database for cell-based RNAi phenotypes , 2006, Nucleic Acids Res..

[25]  Gary D Bader,et al.  Quantitative analysis of fitness and genetic interactions in yeast on a genome scale , 2010, Nature Methods.

[26]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[27]  F. Markowetz,et al.  RedeR: R/Bioconductor package for representing modular structures, nested networks and multiple levels of hierarchical associations , 2012, Genome Biology.

[28]  D. Koller,et al.  Automated identification of pathways from quantitative genetic interaction data , 2010, Molecular systems biology.

[29]  Wei Pan,et al.  Bioinformatics Original Paper Incorporating Gene Functions as Priors in Model-based Clustering of Microarray Gene Expression Data , 2022 .

[30]  Manuel Hidalgo,et al.  Phase I dose escalation study of the oral multi-CDK inhibitor PHA-848125 , 2008 .

[31]  Piero Picci,et al.  Contribution of MEK/MAPK and PI3‐K signaling pathway to the malignant behavior of Ewing's sarcoma cells: Therapeutic prospects , 2004, International journal of cancer.

[32]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[33]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[34]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Jonathan Flint,et al.  Genetic architecture of quantitative traits in mice, flies, and humans. , 2009, Genome research.

[36]  Wolfgang Huber,et al.  Mapping of signaling networks through synthetic genetic interaction analysis by RNAi , 2011, Nature Methods.

[37]  A. Rzhetsky,et al.  Self-Correcting Maps of Molecular Pathways , 2006, PloS one.

[38]  R. Gentleman,et al.  Modeling synthetic lethality , 2008, Genome Biology.

[39]  Xin Wang,et al.  Bioinformatics Applications Note Systems Biology Htsanalyzer: an R/bioconductor Package for Integrated Network Analysis of High-throughput Screens , 2022 .

[40]  P. Deb Finite Mixture Models , 2008 .

[41]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2011 update , 2010, Nucleic Acids Res..

[42]  T. Ideker,et al.  Systematic interpretation of genetic interactions using protein networks , 2005, Nature Biotechnology.

[43]  F. Piano,et al.  A High-Resolution C. elegans Essential Gene Network Based on Phenotypic Profiling of a Complex Tissue , 2011, Cell.

[44]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  N. Marina,et al.  Angiogenesis and vascular targeting in Ewing sarcoma , 2010, Cancer.

[46]  A. Fraser,et al.  Predicting genetic modifier loci using functional gene networks. , 2010, Genome research.

[47]  F. Markowetz,et al.  Diverse epigenetic strategies interact to control epidermal differentiation , 2012, Nature Cell Biology.

[48]  Anastasia A Samsonova,et al.  False negative rates in Drosophila cell-based RNAi screens: a case study , 2011, BMC Genomics.

[49]  Satoru Miyano,et al.  Statistical analysis of a small set of time-ordered gene expression data using linear splines , 2002, Bioinform..

[50]  Chao Sima,et al.  RNAi phenotype profiling of kinases identifies potential therapeutic targets in Ewing's sarcoma , 2010, Molecular Cancer.

[51]  H. Jeffreys,et al.  Theory of probability , 1896 .

[52]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[53]  Paul Tempst,et al.  PINdb: a database of nuclear protein complexes from human and yeast , 2004, Bioinform..

[54]  S. L. Wong,et al.  Combining biological networks to predict genetic interactions. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[55]  Masahiro Kurosaka,et al.  Inhibition of PKCalpha activation in human bone and soft tissue sarcoma cells by the selective PKC inhibitor PKC412. , 2008, Anticancer research.

[56]  L. M. M.-T. Theory of Probability , 1929, Nature.

[57]  Michael D. Wilson,et al.  ChIP-seq: using high-throughput sequencing to discover protein-DNA interactions. , 2009, Methods.

[58]  Grant W. Brown,et al.  Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map , 2007, Nature.

[59]  Gary D Bader,et al.  The Genetic Landscape of a Cell , 2010, Science.