Unsupervised correction of gene-independent cell responses to CRISPR-Cas9 targeting

Background: Genome editing by CRISPR-Cas9 technology allows large-scale screening of gene essentiality in cancer. A confounding factor when interpreting CRISPR-Cas9 screens is the high false-positive rate in detecting essential genes within copy number amplified regions of the genome. We have developed the computational tool CRISPRcleanR which is capable of identifying and correcting gene-independent responses to CRISPR-Cas9 targeting. CRISPRcleanR uses an unsupervised approach based on the segmentation of single-guide RNA fold change values across the genome, without making any assumption about the copy number status of the targeted genes. Results Applying our method to existing and newly generated genome-wide essentiality profiles from 15 cancer cell lines, we demonstrate that CRISPRcleanR reduces false positives when calling essential genes, correcting biases within and outside of amplified regions, while maintaining true positive rates. Established cancer dependencies and essentiality signals of amplified cancer driver genes are detectable post-correction. CRISPRcleanR reports sgRNA fold changes and normalised read counts, is therefore compatible with downstream analysis tools, and works with multiple sgRNA libraries. Conclusions CRISPRcleanR is a versatile open-source tool for the analysis of CRISPR-Cas9 knockout screens to identify essential genes.

[1]  Eric S. Lander,et al.  Gene Essentiality Profiling Reveals Gene Networks and Synthetic Lethal Interactions with Oncogenic Ras , 2017, Cell.

[2]  Julio Saez-Rodriguez,et al.  A CRISPR Dropout Screen Identifies Genetic Vulnerabilities and Therapeutic Targets in Acute Myeloid Leukemia , 2016, Cell reports.

[3]  Nuno A. Fonseca,et al.  Transcription factor activities enhance markers of drug response in cancer , 2017, bioRxiv.

[4]  T. Golub,et al.  Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting. , 2016, Cancer discovery.

[5]  Hans Clevers,et al.  Genome-wide CRISPR screens reveal a Wnt–FZD5 signaling circuit as a druggable vulnerability of RNF43-mutant pancreatic tumors , 2016, Nature Medicine.

[6]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[7]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[8]  Nir Hacohen,et al.  A genome-wide CRISPR screen identifies a restricted set of HIV host dependency factors , 2016, Nature Genetics.

[9]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[10]  E. Lander,et al.  Identification and characterization of essential genes in the human genome , 2015, Science.

[11]  D. Durocher,et al.  High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities , 2015, Cell.

[12]  S. Ramaswamy,et al.  Systematic identification of genomic markers of drug sensitivity in cancer cells , 2012, Nature.

[13]  Neville E. Sanjana,et al.  Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells , 2014, Science.

[14]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Emanuel J. V. Gonçalves,et al.  Tandem duplications lead to loss of fitness effects in CRISPR-Cas9 data , 2018, bioRxiv.

[16]  Kosuke Yusa,et al.  Optimised metrics for CRISPR-KO screens with second-generation gRNA libraries , 2017, Scientific Reports.

[17]  James E. DiCarlo,et al.  RNA-Guided Human Genome Engineering via Cas9 , 2013, Science.

[18]  Le Cong,et al.  Multiplex Genome Engineering Using CRISPR/Cas Systems , 2013, Science.

[19]  James M. McFarland,et al.  Computational correction of copy-number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells , 2017, bioRxiv.

[20]  Meagan E. Sullender,et al.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 , 2015, Nature Biotechnology.

[21]  Emanuel J. V. Gonçalves,et al.  A Landscape of Pharmacogenomic Interactions in Cancer , 2016, Cell.

[22]  S. Swamy,et al.  PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data , 2009, Biostatistics.

[23]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[24]  Alvis Brazma,et al.  The BioStudies database , 2015, Molecular Systems Biology.

[25]  N. Kenmochi,et al.  The human ribosomal protein genes: sequencing and comparative analysis of 73 genes. , 2002, Genome research.

[26]  Benjamin E. Gross,et al.  Integrative Analysis of Complex Cancer Genomics and Clinical Profiles Using the cBioPortal , 2013, Science Signaling.

[27]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[28]  E. S. Venkatraman,et al.  A faster circular binary segmentation algorithm for the analysis of array CGH data , 2007, Bioinform..

[29]  Yilong Li,et al.  Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library , 2013, Nature Biotechnology.

[30]  E. Lander,et al.  Genetic Screens in Human Cells Using the CRISPR-Cas9 System , 2013, Science.

[31]  G. Traver Hart,et al.  BAGEL: a computational framework for identifying essential genes from pooled library screens , 2015, BMC Bioinformatics.

[32]  Julio Saez-Rodriguez,et al.  Abstract A44: A landscape of pharmacogenomic interactions in cancer , 2017 .

[33]  John G. Doench,et al.  In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target , 2017, Nature.

[34]  G. Getz,et al.  GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers , 2011, Genome Biology.

[35]  Joshua M. Korn,et al.  CRISPR Screens Provide a Comprehensive Assessment of Cancer Vulnerabilities but Generate False-Positive Hits for Highly Amplified Genomic Regions. , 2016, Cancer discovery.

[36]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[37]  Jun S. Liu,et al.  MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens , 2014, Genome Biology.