Revealing Molecular Mechanisms by Integrating High-Dimensional Functional Screens with Protein Interaction Data

Functional genomics screens using multi-parametric assays are powerful approaches for identifying genes involved in particular cellular processes. However, they suffer from problems like noise, and often provide little insight into molecular mechanisms. A bottleneck for addressing these issues is the lack of computational methods for the systematic integration of multi-parametric phenotypic datasets with molecular interactions. Here, we present Integrative Multi Profile Analysis of Cellular Traits (IMPACT). The main goal of IMPACT is to identify the most consistent phenotypic profile among interacting genes. This approach utilizes two types of external information: sets of related genes (IMPACT-sets) and network information (IMPACT-modules). Based on the notion that interacting genes are more likely to be involved in similar functions than non-interacting genes, this data is used as a prior to inform the filtering of phenotypic profiles that are similar among interacting genes. IMPACT-sets selects the most frequent profile among a set of related genes. IMPACT-modules identifies sub-networks containing genes with similar phenotype profiles. The statistical significance of these selections is subsequently quantified via permutations of the data. IMPACT (1) handles multiple profiles per gene, (2) rescues genes with weak phenotypes and (3) accounts for multiple biases e.g. caused by the network topology. Application to a genome-wide RNAi screen on endocytosis showed that IMPACT improved the recovery of known endocytosis-related genes, decreased off-target effects, and detected consistent phenotypes. Those findings were confirmed by rescreening 468 genes. Additionally we validated an unexpected influence of the IGF-receptor on EGF-endocytosis. IMPACT facilitates the selection of high-quality phenotypic profiles using different types of independent information, thereby supporting the molecular interpretation of functional screens.

[1]  Joachim M. Buhmann,et al.  The Balanced Accuracy and Its Posterior Distribution , 2010, 2010 20th International Conference on Pattern Recognition.

[2]  Pablo Tamayo,et al.  ATARiS: Computational quantification of gene suppression phenotypes from multisample RNAi screens , 2013, Genome research.

[3]  Qisheng Li,et al.  A genome-wide genetic screen for host factors required for hepatitis C virus propagation , 2009, Proceedings of the National Academy of Sciences.

[4]  J. Watts,et al.  Chemically modified siRNA: tools and applications. , 2008, Drug discovery today.

[5]  S. Polo,et al.  Endocytosis Conducts the Cell Signaling Orchestra , 2006, Cell.

[6]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[7]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[8]  Michael Boutros,et al.  Cellular phenotyping by RNAi. , 2006, Briefings in functional genomics & proteomics.

[9]  Marc Vidal,et al.  Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis , 2005, Nature.

[10]  K. L. Pierce,et al.  Transactivation of the EGF Receptor Mediates IGF-1-stimulated Shc Phosphorylation and ERK1/2 Activation in COS-7 Cells* , 2000, The Journal of Biological Chemistry.

[11]  M. Boutros,et al.  Clustering phenotype populations by genome-wide RNAi and multiparametric imaging , 2010, Molecular systems biology.

[12]  N. Hacohen,et al.  Highly parallel identification of essential genes in cancer cells , 2008, Proceedings of the National Academy of Sciences.

[13]  Ron Shamir,et al.  Identification of functional modules using network topology and high-throughput data , 2007, BMC Systems Biology.

[14]  Amy S. Espeseth,et al.  Host Cell Factors in HIV Replication: Meta-Analysis of Genome-Wide Studies , 2009, PLoS pathogens.

[15]  M. Howell,et al.  Genome‐wide siRNA screen reveals amino acid starvation‐induced autophagy requires SCOC and WAC , 2012, The EMBO journal.

[16]  Rafael C. Jimenez,et al.  The IntAct molecular interaction database in 2012 , 2011, Nucleic Acids Res..

[17]  Ralf Zimmer,et al.  Contextual analysis of RNAi-based functional screens using interaction networks , 2011, Bioinform..

[18]  J. Swedlow,et al.  Phosphorylation of threonine 156 of the μ2 subunit of the AP2 complex is essential for endocytosis in vitro and in vivo , 2001, Current Biology.

[19]  Xin Wang,et al.  Bioinformatics Applications Note Systems Biology Htsanalyzer: an R/bioconductor Package for Integrated Network Analysis of High-throughput Screens , 2022 .

[20]  F. Maxfield,et al.  Endocytic recycling , 2004, Nature Reviews Molecular Cell Biology.

[21]  Ryuji Kobayashi,et al.  Insulin-like growth factor-I receptor/human epidermal growth factor receptor 2 heterodimerization contributes to trastuzumab resistance of breast cancer cells. , 2005, Cancer research.

[22]  F. Piano,et al.  Gene Clustering Based on RNAi Phenotypes of Ovary-Enriched Genes in C. elegans , 2002, Current Biology.

[23]  Pier Paolo Di Fiore,et al.  The endocytic matrix , 2010, Nature.

[24]  M. Howarth,et al.  Type 1 insulin-like growth factor receptor translocates to the nucleus of human tumor cells. , 2010, Cancer research.

[25]  Yan Wang,et al.  Genome-wide functional analysis of human cell-cycle regulators , 2006, Proceedings of the National Academy of Sciences.

[26]  P. Linsley,et al.  Recognizing and avoiding siRNA off-target effects for target identification and therapeutic application , 2010, Nature Reviews Drug Discovery.

[27]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[28]  G. Berchem,et al.  The acquisition of resistance to TNFα in breast cancer cells is associated with constitutive activation of autophagy as revealed by a transcriptome analysis using a custom microarray , 2011, Autophagy.

[29]  Ben S. Wittner,et al.  Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1 , 2009, Nature.

[30]  Fengzhu Sun,et al.  A network-based integrative approach to prioritize reliable hits from multiple genome-wide RNAi screens in Drosophila , 2009, BMC Genomics.

[31]  Neville E. Sanjana,et al.  Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells , 2014, Science.

[32]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[33]  U. Lendahl,et al.  Cross-talk between the Notch and TGF-β signaling pathways mediated by interaction of the Notch intracellular domain with Smad3 , 2003, The Journal of cell biology.

[34]  F. Anania,et al.  Bidirectional crosstalk between leptin and insulin-like growth factor-I signaling promotes invasion and migration of breast cancer cells via transactivation of epidermal growth factor receptor. , 2008, Cancer research.

[35]  John A. Tallarico,et al.  Integrating high-content screening and ligand-target prediction to identify mechanism of action. , 2008, Nature chemical biology.

[36]  David J. Reiss,et al.  Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks , 2006, BMC Bioinformatics.

[37]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[38]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[39]  Paul Pavlidis,et al.  The role of indirect connections in gene networks in predicting function , 2011, Bioinform..

[40]  N. Perrimon,et al.  High-throughput RNAi screening in cultured cells: a user's guide , 2006, Nature Reviews Genetics.

[41]  Christina Backes,et al.  An integer linear programming approach for finding deregulated subgraphs in regulatory networks , 2011, Nucleic acids research.

[42]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[43]  M. Leslie EGF is internalized and degraded , 2005, The Journal of Cell Biology.

[44]  Lawrence M. Lifshitz,et al.  Sorting of EGF and transferrin at the plasma membrane and by cargo-specific signaling to EEA1-enriched endosomes , 2008, Journal of Cell Science.

[45]  Edward S. Kim,et al.  Heterodimerization of insulin-like growth factor receptor/epidermal growth factor receptor and induction of survivin expression counteract the antitumor action of erlotinib. , 2006, Cancer research.

[46]  M. Zerial,et al.  Purification and identification of novel Rab effectors using affinity chromatography. , 2000, Methods.

[47]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[48]  David Zhang,et al.  The crosstalk between EGF, IGF, and Insulin cell signaling pathways - computational and experimental analysis , 2009, BMC Systems Biology.

[49]  Y. Kalaidzidis,et al.  Systems survey of endocytosis by multiparametric image analysis , 2010, Nature.

[50]  J. Schelter,et al.  Genome-wide resources of endoribonuclease-prepared short interfering RNAs for specific loss-of-function studies , 2007, Nature Methods.

[51]  Y. Kalaidzidis,et al.  Rab Conversion as a Mechanism of Progression from Early to Late Endosomes , 2005, Cell.

[52]  A. Sorkin,et al.  Endocytosis and signalling: intertwining molecular networks , 2009, Nature Reviews Molecular Cell Biology.

[53]  P. Liberali,et al.  Population context determines cell-to-cell variability in endocytosis and virus infection , 2009, Nature.

[54]  E. Schadt,et al.  Integrating siRNA and protein-protein interaction data to identify an expanded insulin signaling network. , 2009, Genome research.

[55]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[56]  Nir Hacohen,et al.  Minimizing the risk of reporting false positives in large-scale RNAi screens , 2006, Nature Methods.

[57]  Randall T Moon,et al.  Integrative Analysis of Genome-Wide RNA Interference Screens , 2009, Science Signaling.

[58]  Aideen Long,et al.  Statistical methods for analysis of high-throughput RNA interference screens , 2009, Nature Methods.

[59]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[60]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[61]  A. Hyman,et al.  Genome-scale RNAi profiling of cell division in human tissue culture cells , 2007, Nature Cell Biology.