A genome-wide atlas of co-essential modules assigns function to uncharacterized genes

A central remaining question in the post-genomic era is how genes interact to form biological pathways. Measurements of gene dependency across hundreds of cell lines have been used to cluster genes into ‘co-essential’ pathways, but this approach has been limited by ubiquitous false positives. Here, we develop a statistical method that enables robust identification of gene co-essentiality and yields a genome-wide set of functional modules. This almanac recapitulates diverse pathways and protein complexes and predicts the functions of 102 uncharacterized genes. Validating top predictions, we show that TMEM189 encodes plasmanylethanolamine desaturase, the long-sought key enzyme for plasmalogen synthesis. We also show that C15orf57 binds the AP2 complex, localizes to clathrin-coated pits, and enables efficient transferrin uptake. Finally, we provide an interactive web tool for the community to explore the results (coessentiality.net). Our results establish co-essentiality profiling as a powerful resource for biological pathway identification and discovery of novel gene functions.

[1]  Eiru Kim,et al.  A network of human functional gene interactions from knockout fitness screens in cancer cells , 2019, Life Science Alliance.

[2]  Eric S. Lander,et al.  Gene Essentiality Profiling Reveals Gene Networks and Synthetic Lethal Interactions with Oncogenic Ras , 2017, Cell.

[3]  Antoine de Weck,et al.  Project DRIVE: A Compendium of Cancer Dependencies and Synthetic Lethal Relationships Uncovered by Large-Scale, Deep RNAi Screening , 2017, Cell.

[4]  R. Hegde,et al.  The ER membrane protein complex is a transmembrane domain insertase , 2017, Science.

[5]  A. Borczuk,et al.  PHLDA2 is a key oncogene-induced negative feedback inhibitor of EGFR/ErbB2 signaling via interference with AKT signaling , 2015, Oncotarget.

[6]  M. Tadesse,et al.  Integrative genomic analyses , 2017 .

[7]  Norbert Perrimon,et al.  RNAi screening comes of age: improved techniques and complementary approaches , 2014, Nature Reviews Molecular Cell Biology.

[8]  Vineet Bafna,et al.  Inferring gene ontologies from pairwise similarity data , 2014, Bioinform..

[9]  T. Q. Huang,et al.  CRISPR screening using an expanded toolkit of autophagy reporters identifies TMEM41B as a novel autophagy factor , 2019, PLoS biology.

[10]  Defining a Cancer Dependency Map , 2017, Cell.

[11]  James M. McFarland,et al.  WRN Helicase is a Synthetic Lethal Target in Microsatellite Unstable Cancers , 2019, Nature.

[12]  A. Motley,et al.  Clathrin-mediated endocytosis in AP-2–depleted cells , 2003, The Journal of cell biology.

[13]  Thomas Hielscher,et al.  Toward an integrated map of genetic interactions in cancer cells , 2017, bioRxiv.

[14]  Gaelen T. Hess,et al.  Synergistic drug combinations for cancer identified in a CRISPR screen for pairwise genetic interactions , 2017, Nature Biotechnology.

[15]  Ann E. Sizemore,et al.  Computational correction of copy-number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells , 2017, Nature Genetics.

[16]  Aviad Tsherniak,et al.  Interrogation of Mammalian Protein Complex Structure, Function, and Membership Using Genome-Scale Fitness Screens. , 2018, Cell systems.

[17]  A. C. Aitken IV.—On Least Squares and Linear Combination of Observations , 1936 .

[18]  Michael T. McManus,et al.  A Systematic Mammalian Genetic Interaction Map Reveals Pathways Underlying Ricin Susceptibility , 2013, Cell.

[19]  R. Guigó,et al.  Fusion of the human gene for the polyubiquitination coeffector UEV1 with Kua, a newly identified gene. , 2000, Genome research.

[20]  Octave Noubibou Doudieu,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[21]  H. Moser,et al.  Human PEX7 encodes the peroxisomal PTS2 receptor and is responsible for rhizomelic chondrodysplasia punctata , 1997, Nature Genetics.

[22]  Ching-Seng Ang,et al.  Tim29 is a novel subunit of the human TIM22 translocase and is involved in complex assembly and stability , 2016, eLife.

[23]  Samuel F. Bakhoum,et al.  Chromosomal instability drives metastasis through a cytosolic DNA response , 2017, Nature.

[24]  U. Moll,et al.  The MDM2-p53 interaction. , 2003, Molecular cancer research : MCR.

[25]  J. Weissman,et al.  Membranes in balance: mechanisms of sphingolipid homeostasis. , 2010, Molecular cell.

[26]  Fabian J. Theis,et al.  Diffusion maps for high-dimensional single-cell analysis of differentiation data , 2015, Bioinform..

[27]  E. Wagner,et al.  snRNA 3' end formation: the dawn of the Integrator complex. , 2010, Biochemical Society transactions.

[28]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[29]  Gaelen T. Hess,et al.  A CRISPR-based screen for Hedgehog signaling provides insights into ciliary function and ciliopathies , 2017, Nature Genetics.

[30]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[31]  J. V. van Deursen,et al.  Whole chromosome instability and cancer: a complex relationship. , 2008, Trends in genetics : TIG.

[32]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[33]  T. Furukawa,et al.  Feedback regulation of DUSP6 transcription responding to MAPK1 via ETS2 in human cells. , 2008, Biochemical and biophysical research communications.

[34]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[35]  R. Hegde,et al.  The ER membrane protein complex promotes biogenesis of sterol-related enzymes maintaining cholesterol homeostasis , 2019, Journal of Cell Science.

[36]  Jacob D. Jaffe,et al.  Genetic and Proteomic Interrogation of Lower Confidence Candidate Genes Reveals Signaling Networks in β-Catenin-Active Cancers. , 2016, Cell systems.

[37]  Lifang Xu,et al.  Identification of a novel gene fusion (BMX-ARHGAP) in gastric cardia adenocarcinoma , 2014, Diagnostic Pathology.

[38]  Chenghai Xue,et al.  The fusion landscape of hepatocellular carcinoma , 2019, Molecular oncology.

[39]  Jeffrey A. Hussmann,et al.  The ER membrane protein complex interacts cotranslationally to enable biogenesis of multipass membrane proteins , 2018, eLife.

[40]  G. Weinstock,et al.  A Longitudinal Big Data Approach for Precision Health , 2019, Nature Medicine.

[41]  N. Nagan,et al.  Plasmalogens: biosynthesis and functions. , 2001, Progress in lipid research.

[42]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[43]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[44]  D. Sabatini,et al.  mTOR Signaling in Growth, Metabolism, and Disease , 2017, Cell.

[45]  Stéphane Lafon,et al.  Diffusion maps , 2006 .

[46]  W. Pavan,et al.  Sox proteins in melanocyte development and melanoma , 2010, Pigment cell & melanoma research.

[47]  F. Snyder,et al.  Ether-Linked Glycerolipids and Their Bioactive Species:Enzymes and Metabolic Regulation , 1985 .

[48]  Michael T. McManus,et al.  Dual gene activation and knockout screen reveals directional dependencies in genetic networks , 2018, Nature Biotechnology.

[49]  D. Vandorpe,et al.  Transmembrane insertases and N-glycosylation critically determine synthesis, trafficking, and activity of the nonselective cation channel TRPC6 , 2019, The Journal of Biological Chemistry.

[50]  R. Blamey,et al.  Estradiol induction of retinoic acid receptors in human breast cancer cells. , 1993, Cancer research.

[51]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[52]  Claire D. McWhite,et al.  Integration of over 9,000 mass spectrometry experiments builds a global map of human protein complexes , 2017, Molecular systems biology.

[53]  Alessandro Testori,et al.  The role of BRAF V600 mutation in melanoma , 2012, Journal of Translational Medicine.

[54]  Neville E. Sanjana,et al.  High-throughput functional genomics using CRISPR–Cas9 , 2015, Nature Reviews Genetics.

[55]  J. Froehlich,et al.  FATTY ACID DESATURASE4 of Arabidopsis encodes a protein distinct from characterized fatty acid desaturases. , 2009, The Plant journal : for cell and molecular biology.

[56]  D. Rickman,et al.  The Expanding World of N-MYC-Driven Tumors. , 2018, Cancer discovery.

[57]  J. Vilo,et al.  g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update) , 2019, Nucleic Acids Res..

[58]  Z. Zhai,et al.  Negative Regulation of RIG-I-Mediated Innate Antiviral Signaling by SEC14L1 , 2013, Journal of Virology.

[59]  Neal K. Bennett,et al.  Mapping the Genetic Landscape of Human Cells , 2018, Cell.

[60]  A. Brunet,et al.  Cross-Platform Comparison of Untargeted and Targeted Lipidomics Approaches on Aging Mouse Plasma , 2018, Scientific Reports.

[61]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[62]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[63]  R Clarke,et al.  Acquisition of estrogen independence induces TOB1-related mechanisms supporting breast cancer cell proliferation , 2016, Oncogene.

[64]  The Gene Ontology Consortium,et al.  Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[65]  Gary D Bader,et al.  Systematic Genetic Analysis with Ordered Arrays of Yeast Deletion Mutants , 2001, Science.

[66]  Jun Xie,et al.  Cauchy Combination Test: A Powerful Test With Analytic p-Value Calculation Under Arbitrary Dependency Structures , 2018, Journal of the American Statistical Association.

[67]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[68]  Donna K. Slonim,et al.  Assessment of network module identification across complex diseases , 2019, Nature Methods.

[69]  Ann E. Frazier,et al.  Accessory subunits are integral for assembly and function of human mitochondrial complex I , 2016, Nature.

[70]  Billy Tsai,et al.  The ER Membrane Protein Complex Promotes Biogenesis of Dengue and Zika Virus Non-structural Multi-pass Transmembrane Proteins to Support Infection , 2019, Cell reports.

[71]  T. Mustelin,et al.  The lipid-binding SEC14 domain. , 2007, Biochimica et biophysica acta.

[72]  Meagan E. Sullender,et al.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 , 2015, Nature Biotechnology.

[73]  H. Moser,et al.  Mutants in a macrophage-like cell line are defective in plasmalogen biosynthesis, but contain functional peroxisomes. , 1992, The Journal of biological chemistry.

[74]  T. Ideker,et al.  A gene ontology inferred from molecular networks , 2012, Nature Biotechnology.

[75]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[76]  James M. McFarland,et al.  WRN Helicase is a Synthetic Lethal Target in Microsatellite Unstable Cancers , 2018, bioRxiv.

[77]  B. Nadler,et al.  Diffusion maps, spectral clustering and reaction coordinates of dynamical systems , 2005, math/0503445.

[78]  A. Hermetter,et al.  Protein-catalyzed transport of ether phospholipids. , 1991, Biochimica et biophysica acta.

[79]  Joseph R. Ecker,et al.  Moving forward in reverse: genetic technologies to enable genome-wide phenomic screens in Arabidopsis , 2006, Nature Reviews Genetics.

[80]  James M. McFarland,et al.  Computational correction of copy-number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells , 2017, bioRxiv.

[81]  Xihong Lin,et al.  ACAT: A Fast and Powerful p Value Combination Method for Rare-Variant Analysis in Sequencing Studies. , 2019, American journal of human genetics.

[82]  T. Ideker,et al.  A decade of systems biology. , 2010, Annual review of cell and developmental biology.

[83]  Adam P. Rosebrock,et al.  A global genetic interaction network maps a wiring diagram of cellular function , 2016, Science.

[84]  Lei S. Qi,et al.  Genetic interaction mapping in mammalian cells using CRISPR interference , 2017, Nature Methods.

[85]  Kerstin B. Meyer,et al.  Master regulators of FGFR2 signalling and breast cancer risk , 2013, Nature Communications.

[86]  Gary D Bader,et al.  The Genetic Landscape of a Cell , 2010, Science.

[87]  Kengo Kinoshita,et al.  COXPRESdb in 2015: coexpression database for animal species by DNA-microarray and RNAseq-based expression data with multiple quality assessment systems , 2014, Nucleic Acids Res..

[88]  Daniel Marbach,et al.  Assessment of network module identification across complex diseases , 2019, Nature Methods.

[89]  A. Anichini,et al.  NFATc2 is an intrinsic regulator of melanoma dedifferentiation , 2016, Oncogene.

[90]  The Gene Ontology Consortium Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[91]  G. Superti-Furga,et al.  Gene essentiality and synthetic lethality in haploid human cells , 2015, Science.

[92]  S. Padmanabhan,et al.  A bacterial light response reveals an orphan desaturase for human plasmalogen synthesis , 2019, Science.

[93]  T. Golub,et al.  Integrative genomic analyses identify MITF as a lineage survival oncogene amplified in malignant melanoma , 2005, Nature.

[94]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[95]  Mathew W. Wright,et al.  The HUGO Gene Nomenclature Committee (HGNC) , 2001, Human Genetics.

[96]  Wei Yang,et al.  Proteomic Analysis Identifies Membrane Proteins Dependent on the ER Membrane Protein Complex , 2019, Cell reports.

[97]  Aaron N. Chang,et al.  Combinatorial CRISPR-Cas9 screens for de novo mapping of genetic interactions , 2017, Nature Methods.

[98]  Anne E Carpenter,et al.  Systematic genome-wide screens of gene function , 2004, Nature Reviews Genetics.

[99]  J. Pritchard,et al.  High‐resolution mapping of cancer cell networks using co‐functional interactions , 2018, bioRxiv.

[100]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[101]  A. Mai,et al.  Discovery of Inhibitors for the Ether Lipid-Generating Enzyme AGPS as Anti-Cancer Agents. , 2015, ACS chemical biology.

[102]  D. Vertommen,et al.  A conserved phosphatase destroys toxic glycolytic side products in mammals and yeast. , 2016, Nature Chemical Biology.