Integrative analysis and refined design of CRISPR knockout screens

Genome-wide CRISPR-Cas9 screen has been widely used to interrogate gene functions. However, the analysis remains challenging and rules to design better libraries beg further refinement. Here we present MAGeCK-NEST, which integrates protein-protein interaction (PPI), improves the inference accuracy when fewer guide-RNAs (sgRNAs) are available, and assesses screen qualities using information on PPI. MAGeCK-NEST also adopts a maximum-likelihood approach to remove sgRNA outliers, which are characterized with higher G-nucleotide counts, especially in regions distal from the PAM motif. Using MAGeCK-NEST, we found that choosing non-targeting sgRNAs as negative controls lead to strong bias, which can be mitigated by sgRNAs targeting the “safe harbor” regions. Custom-designed screens confirmed our findings, and further revealed that 19nt sgRNAs consistently gave the best signal-to-noise separation. Collectively, our method enabled robust calling of CRISPR screen hits and motivated the design of an improved genome-wide CRISPR screen library.

[1]  Jun S. Liu,et al.  Quality control, modeling, and visualization of CRISPR screens with MAGeCK-VISPR , 2015, Genome Biology.

[2]  J. Doudna,et al.  A Programmable Dual-RNA–Guided DNA Endonuclease in Adaptive Bacterial Immunity , 2012, Science.

[3]  Matthew C. Canver,et al.  BCL11A enhancer dissection by Cas9-mediated in situ saturating mutagenesis , 2015, Nature.

[4]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[5]  David A. Orlando,et al.  Quantitative ChIP-Seq normalization reveals global modulation of the epigenome. , 2014, Cell reports.

[6]  R. Barrangou,et al.  CRISPR Provides Acquired Resistance Against Viruses in Prokaryotes , 2007, Science.

[7]  E. Crawford,et al.  Characterising the castration‐resistant prostate cancer population: a systematic review , 2011, International journal of clinical practice.

[8]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[9]  Tim J. Wigle,et al.  A selective inhibitor of EZH2 blocks H3K27 methylation and kills mutant lymphoma cells. , 2012, Nature chemical biology.

[10]  R. Terns,et al.  Cas9 function and host genome sampling in Type II-A CRISPR–Cas adaptation , 2015, Genes & development.

[11]  D. Morgens,et al.  Systematic comparison of CRISPR-Cas9 and RNAi screens for essential genes , 2016, Nature Biotechnology.

[12]  H. Xiang,et al.  Adaptation of the Haloarcula hispanica CRISPR-Cas system to a purified virus strictly requires a priming process , 2013, Nucleic acids research.

[13]  R. Mehra,et al.  Polycomb protein EZH2 regulates tumor invasion via the transcriptional repression of the metastasis suppressor RKIP in breast and prostate cancer. , 2012, Cancer research.

[14]  H. Lepor,et al.  Androgen deprivation therapy in the treatment of advanced prostate cancer. , 2007, Reviews in urology.

[15]  Bin Li,et al.  A new class of temporarily phenotypic enhancers identified by CRISPR/Cas9-mediated genetic screening , 2016, Genome research.

[16]  H. Hieronymus,et al.  Androgen receptor signaling regulates DNA repair in prostate cancers. , 2013, Cancer discovery.

[17]  C. Sander,et al.  Integrative genomic profiling of human prostate cancer. , 2010, Cancer cell.

[18]  J. García-Martínez,et al.  Intervening Sequences of Regularly Spaced Prokaryotic Repeats Derive from Foreign Genetic Elements , 2005, Journal of Molecular Evolution.

[19]  F. Rodríguez-Valera,et al.  Long stretches of short tandem repeats are present in the largest replicons of the Archaea Haloferax mediterranei and Haloferax volcanii and could be involved in replicon partitioning , 1995, Molecular microbiology.

[20]  T. Thompson,et al.  Prostate cancer progression after androgen deprivation therapy: mechanisms of castrate resistance and novel therapeutic approaches , 2013, Oncogene.

[21]  R. Tjian,et al.  Dynamics of CRISPR-Cas9 genome interrogation in living cells , 2015, Science.

[22]  Stan J. J. Brouns,et al.  Small CRISPR RNAs Guide Antiviral Defense in Prokaryotes , 2008, Science.

[23]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[24]  B. Li,et al.  Expression profiling reveals off-target gene regulation by RNAi , 2003, Nature Biotechnology.

[25]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[26]  M. Henry,et al.  Epithelial-to-mesenchymal transition in prostate cancer: paradigm or puzzle? , 2011, Nature Reviews Urology.

[27]  D van Soolingen,et al.  Insertion element IS987 from Mycobacterium bovis BCG is located in a hot-spot integration region for insertion elements in Mycobacterium tuberculosis complex strains , 1991, Infection and immunity.

[28]  F. Stegmeier,et al.  Abstract B21: CRISPR screens provide a comprehensive assessment of cancer vulnerabilities but generate false-positive hits for highly amplified genomic regions , 2017 .

[29]  G. Church,et al.  CAS9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering , 2013, Nature Biotechnology.

[30]  Luciano A. Marraffini,et al.  Cas9 specifies functional viral targets during CRISPR-Cas adaptation , 2015, Nature.

[31]  Neville E. Sanjana,et al.  Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells , 2014, Science.

[32]  S. Dhanasekaran,et al.  The polycomb group protein EZH2 is involved in progression of prostate cancer , 2002, Nature.

[33]  Jun S. Song,et al.  HiTSelect: a comprehensive tool for high-complexity-pooled screen analysis , 2014, Nucleic acids research.

[34]  H. Scher,et al.  Development of a Second-Generation Antiandrogen for Treatment of Advanced Prostate Cancer , 2009, Science.

[35]  E. Lander,et al.  Identification and characterization of essential genes in the human genome , 2015, Science.

[36]  Zhongzheng Cao,et al.  Genome-scale deletion screening of human long non-coding RNAs using a paired-guide RNA CRISPR–Cas9 library , 2016, Nature Biotechnology.

[37]  T. Golub,et al.  Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting. , 2016, Cancer discovery.

[38]  S. Gabriel,et al.  Discovery and saturation analysis of cancer genes across 21 tumor types , 2014, Nature.

[39]  R. Elkon,et al.  Functional genetic screens for enhancer elements in the human genome using CRISPR-Cas9 , 2016, Nature Biotechnology.

[40]  T. Guthrie,et al.  Prostate cancer. , 2020, American family physician.

[41]  Chun Jimmie Ye,et al.  A Genome-wide CRISPR Screen in Primary Immune Cells to Dissect Regulatory Networks , 2015, Cell.

[42]  J. Moffat,et al.  Measuring error rates in genomic perturbation screens: gold standards for human functional genomics , 2014, Molecular systems biology.

[43]  E. Lander,et al.  Genetic Screens in Human Cells Using the CRISPR-Cas9 System , 2013, Science.

[44]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[45]  Jun S. Liu,et al.  MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens , 2014, Genome Biology.

[46]  Morten Nielsen,et al.  Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion , 2012, Nucleic Acids Res..

[47]  J. Kinney,et al.  Discovery of cancer drug targets by CRISPR-Cas9 screening of protein domains , 2015, Nature Biotechnology.

[48]  Shiyou Zhu,et al.  High-throughput screening of a CRISPR/Cas9 library for functional genomics in human cells , 2014, Nature.

[49]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[50]  Jon C. Aster,et al.  Network analysis of gene essentiality in functional genomics experiments , 2015, Genome Biology.

[51]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[52]  Philippe Horvath,et al.  The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA , 2010, Nature.

[53]  M. Vetter,et al.  PRC2 during vertebrate organogenesis: a complex in transition. , 2012, Developmental biology.

[54]  S. Ehrlich,et al.  Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. , 2005, Microbiology.

[55]  M. Loda,et al.  EZH2 Oncogenic Activity in Castration-Resistant Prostate Cancer Cells Is Polycomb-Independent , 2012, Science.

[56]  J. Moffat,et al.  Measuring error rates in genomic perturbation screens: gold standards for human functional genomics , 2014, bioRxiv.

[57]  Clifford A. Meyer,et al.  Androgen Receptor Regulates a Distinct Transcription Program in Androgen-Independent Prostate Cancer , 2009, Cell.

[58]  L. Schouls,et al.  Identification of genes that are associated with DNA repeats in prokaryotes , 2002, Molecular microbiology.

[59]  Stan J. J. Brouns,et al.  CRISPR Interference Directs Strand Specific Spacer Acquisition , 2012, PloS one.

[60]  J. Keith Joung,et al.  Improving CRISPR-Cas nuclease specificity using truncated guide RNAs , 2014, Nature Biotechnology.

[61]  R. König,et al.  A probability-based approach for the analysis of large-scale RNAi screens , 2007, Nature Methods.

[62]  Sven Laur,et al.  Robust rank aggregation for gene list integration and meta-analysis , 2012, Bioinform..

[63]  Qingzhong Hu,et al.  CYP17 inhibitors—abiraterone, C17,20-lyase inhibitors and multi-targeting agents , 2014, Nature Reviews Urology.

[64]  S. Varambally,et al.  Genomic Loss of microRNA-101 Leads to Overexpression of Histone Methyltransferase EZH2 in Cancer , 2008, Science.

[65]  Rameen Beroukhim,et al.  An oncogene–tumor suppressor cascade drives metastatic prostate cancer by coordinately activating Ras and nuclear factor-κB , 2010, Nature Medicine.

[66]  N. Hacohen,et al.  Highly parallel identification of essential genes in cancer cells , 2008, Proceedings of the National Academy of Sciences.

[67]  D. Durocher,et al.  High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities , 2015, Cell.

[68]  J. Licht,et al.  Epigenetic therapy of hematological malignancies: where are we now? , 2013, Therapeutic advances in hematology.

[69]  M. Dawson,et al.  Cancer Epigenetics: From Mechanism to Therapy , 2012, Cell.

[70]  L. Marraffini,et al.  Mature clustered, regularly interspaced, short palindromic repeats RNA (crRNA) length is measured by a ruler mechanism anchored at the precursor processing site , 2011, Proceedings of the National Academy of Sciences.

[71]  A. Fire,et al.  Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans , 1998, Nature.

[72]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[73]  Meagan E. Sullender,et al.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 , 2015, Nature Biotechnology.

[74]  Eli J. Fine,et al.  DNA targeting specificity of RNA-guided Cas9 nucleases , 2013, Nature Biotechnology.

[75]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[76]  G. Calin,et al.  Regulation of tumor angiogenesis by EZH2. , 2010, Cancer cell.

[77]  Jiyang Yu,et al.  ScreenBEAM: a novel meta-analysis algorithm for functional genomics screens via Bayesian hierarchical modeling , 2015, Bioinform..

[78]  T. Kanda,et al.  Identification of an Insulator in AAVS1, a Preferred Region for Integration of Adeno-Associated Virus DNA , 2003, Journal of Virology.

[79]  Darren J. Wilkinson,et al.  Bayesian methods in bioinformatics and computational systems biology , 2006, Briefings Bioinform..

[80]  P. Linsley,et al.  Recognizing and avoiding siRNA off-target effects for target identification and therapeutic application , 2010, Nature Reviews Drug Discovery.

[81]  Joshua M. Korn,et al.  CRISPR Screens Provide a Comprehensive Assessment of Cancer Vulnerabilities but Generate False-Positive Hits for Highly Amplified Genomic Regions. , 2016, Cancer discovery.

[82]  C. Daniel,et al.  Therapeutic opportunities for manipulating TReg cells in autoimmunity and cancer , 2012, Nature Reviews Drug Discovery.

[83]  Martin J. Aryee,et al.  GUIDE-Seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases , 2014, Nature Biotechnology.

[84]  L. Marraffini,et al.  CRISPR Interference Limits Horizontal Gene Transfer in Staphylococci by Targeting DNA , 2008, Science.

[85]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[86]  J. Trapman,et al.  The androgen receptor in prostate cancer. , 1996, Pathology, research and practice.

[87]  Lei Zhang,et al.  Functional genomics, proteomics, and regulatory DNA analysis in isogenic settings using zinc finger nuclease-driven transgenesis into a safe harbor locus in the human genome. , 2010, Genome research.

[88]  Debashis Ghosh,et al.  EZH2 is a marker of aggressive breast cancer and promotes neoplastic transformation of breast epithelial cells , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[89]  Le Cong,et al.  Multiplex Genome Engineering Using CRISPR/Cas Systems , 2013, Science.

[90]  Yan Liu,et al.  EZH2 inhibition as a therapeutic strategy for lymphoma with EZH2-activating mutations , 2012, Nature.

[91]  R. Ketting The many faces of RNAi. , 2011, Developmental cell.

[92]  E. Chan,et al.  Identification of EZH2 and EZH1 small molecule inhibitors with selective impact on diffuse large B cell lymphoma cell growth. , 2013, Chemistry & biology.

[93]  Juri Rappsilber,et al.  A model for transmission of the H3K27me3 epigenetic mark , 2008, Nature Cell Biology.

[94]  H. Scher,et al.  Expanding androgen- and androgen receptor signaling-directed therapies for castration-resistant prostate cancer. , 2014, Oncology.

[95]  Clifford A. Meyer,et al.  Sequence determinants of improved CRISPR sgRNA design , 2015, Genome research.

[96]  M. Lomax,et al.  Biological consequences of radiation-induced DNA damage: relevance to radiotherapy. , 2013, Clinical oncology (Royal College of Radiologists (Great Britain)).

[97]  P. Atadja,et al.  Selective inhibition of Ezh2 by a small molecule inhibitor blocks tumor cells proliferation , 2012, Proceedings of the National Academy of Sciences.

[98]  A. Chase,et al.  Aberrations of EZH2 in Cancer , 2011, Clinical Cancer Research.

[99]  Meagan E. Sullender,et al.  Rational design of highly active sgRNAs for CRISPR-Cas9–mediated gene inactivation , 2014, Nature Biotechnology.

[100]  Jong-il Kim,et al.  Digenome-seq: genome-wide profiling of CRISPR-Cas9 off-target effects in human cells , 2015, Nature Methods.

[101]  Yilong Li,et al.  Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library , 2013, Nature Biotechnology.

[102]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[103]  M. Lübbert,et al.  Histone deacetylase (HDAC) inhibitors in recent clinical trials for cancer therapy , 2010, Clinical Epigenetics.

[104]  Michel Sadelain,et al.  Safe harbours for the integration of new DNA in the human genome , 2011, Nature Reviews Cancer.

[105]  Yong Jiang,et al.  Mutation of A677 in histone methyltransferase EZH2 in human B-cell lymphoma promotes hypertrimethylation of histone H3 on lysine 27 (H3K27) , 2012, Proceedings of the National Academy of Sciences.