Integrative analysis of pooled CRISPR genetic screens using MAGeCKFlute

Genome-wide screening using CRISPR coupled with nuclease Cas9 (CRISPR–Cas9) is a powerful technology for the systematic evaluation of gene function. Statistically principled analysis is needed for the accurate identification of gene hits and associated pathways. Here, we describe how to perform computational analysis of CRISPR screens using the MAGeCKFlute pipeline. MAGeCKFlute combines the MAGeCK and MAGeCK-VISPR algorithms and incorporates additional downstream analysis functionalities. MAGeCKFlute is distinguished from other currently available tools by its comprehensive pipeline, which contains a series of functions for analyzing CRISPR screen data. This protocol explains how to use MAGeCKFlute to perform quality control (QC), normalization, batch effect removal, copy-number bias correction, gene hit identification and downstream functional enrichment analysis for CRISPR screens. We also describe gene identification and data analysis in CRISPR screens involving drug treatment. Completing the entire MAGeCKFlute pipeline requires ~3 h on a desktop computer running Linux or Mac OS with R support.MAGeCKFlute is an algorithm for the analysis and visualization of CRISPR screen data. Starting from sequencing reads and an sgRNA library, the algorithm normalizes results and represents them as pathway enrichment classifications.

[1]  Jun S. Liu,et al.  MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens , 2014, Genome Biology.

[2]  J. Kinney,et al.  Discovery of cancer drug targets by CRISPR-Cas9 screening of protein domains , 2015, Nature Biotechnology.

[3]  Robert Gentleman,et al.  Using GOstats to test gene lists for GO term association , 2007, Bioinform..

[4]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[5]  Andrew E. Jaffe,et al.  Bioinformatics Applications Note Gene Expression the Sva Package for Removing Batch Effects and Other Unwanted Variation in High-throughput Experiments , 2022 .

[6]  Weijun Luo,et al.  Pathview: an R/Bioconductor package for pathway-based data integration and visualization , 2013, Bioinform..

[7]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[8]  N. Hacohen,et al.  Highly parallel identification of essential genes in cancer cells , 2008, Proceedings of the National Academy of Sciences.

[9]  Jun S. Liu,et al.  Quality control, modeling, and visualization of CRISPR screens with MAGeCK-VISPR , 2015, Genome Biology.

[10]  Guangchuang Yu,et al.  clusterProfiler: an R package for comparing biological themes among gene clusters. , 2012, Omics : a journal of integrative biology.

[11]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Le Cong,et al.  Multiplex Genome Engineering Using CRISPR/Cas Systems , 2013, Science.

[13]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[14]  Jiyang Yu,et al.  Abstract PR10: ScreenBEAM: a Novel Meta-Analysis Algorithm for Functional Genomics Screens via Bayesian Hierarchical Modeling , 2017 .

[15]  Max A. Horlbeck,et al.  Genome-Scale CRISPR-Mediated Control of Gene Repression and Activation , 2014, Cell.

[16]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[17]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[18]  Lei Zhang,et al.  Functional genomics, proteomics, and regulatory DNA analysis in isogenic settings using zinc finger nuclease-driven transgenesis into a safe harbor locus in the human genome. , 2010, Genome research.

[19]  Peng Jiang,et al.  Improved design and analysis of CRISPR knockout screens , 2018, Bioinform..

[20]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[21]  Neville E. Sanjana,et al.  High-throughput functional genomics using CRISPR–Cas9 , 2015, Nature Reviews Genetics.

[22]  Julio Saez-Rodriguez,et al.  A CRISPR Dropout Screen Identifies Genetic Vulnerabilities and Therapeutic Targets in Acute Myeloid Leukemia , 2016, Cell reports.

[23]  Emily J. Girard,et al.  Genome-wide CRISPR-Cas9 Screens Reveal Loss of Redundancy between PKMYT1 and WEE1 in Glioblastoma Stem-like Cells. , 2015, Cell reports.

[24]  D. Largaespada,et al.  Using genome-wide CRISPR library screening with library resistant DCK to find new sources of Ara-C drug resistance in AML , 2016, Scientific Reports.

[25]  Gaelen T. Hess,et al.  Synergistic drug combinations for cancer identified in a CRISPR screen for pairwise genetic interactions , 2017, Nature Biotechnology.

[26]  D. Morgens,et al.  Systematic comparison of CRISPR-Cas9 and RNAi screens for essential genes , 2016, Nature Biotechnology.

[27]  Hakho Lee,et al.  Genome-wide CRISPR Screen in a Mouse Model of Tumor Growth and Metastasis , 2015, Cell.

[28]  Zhi-hua Chen,et al.  Kyoto Encyclopedia of Genes and Genomes were used for functional enrichment analysis of differentially expressed genes (DEGs). A protein‐protein interaction network was constructed, and the hub genes were subjected to module analysis and identification using Search Tool for the Retrieval , 2019 .

[29]  E. Lander,et al.  Genetic Screens in Human Cells Using the CRISPR-Cas9 System , 2013, Science.

[30]  Jason Moffat,et al.  BAGEL: A computational framework for identifying essential genes from pooled library screens , 2015 .

[31]  R. Jaenisch,et al.  Efficient targeting of expressed and silent genes in human ESCs and iPSCs using zinc-finger nucleases , 2009, Nature Biotechnology.

[32]  James M. Roberts,et al.  CDK inhibitors: positive and negative regulators of G1-phase progression. , 1999, Genes & development.

[33]  Jon C. Aster,et al.  Network analysis of gene essentiality in functional genomics experiments , 2015, Genome Biology.

[34]  Eric S. Lander,et al.  Gene Essentiality Profiling Reveals Gene Networks and Synthetic Lethal Interactions with Oncogenic Ras , 2017, Cell.

[35]  Neville E. Sanjana,et al.  Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells , 2014, Science.

[36]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[37]  Yilong Li,et al.  Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library , 2013, Nature Biotechnology.

[38]  S. Dawson,et al.  CMTM6 maintains the expression of PD-L1 and regulates anti-tumour immunity , 2017, Nature.

[39]  D. Durocher,et al.  High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities , 2015, Cell.

[40]  Alexandro E. Trevino,et al.  Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex , 2014, Nature.

[41]  E. Lander,et al.  Identification and characterization of essential genes in the human genome , 2015, Science.

[42]  Raphaël Guérois,et al.  PPI4DOCK: large scale assessment of the use of homology models in free docking over more than 1000 realistic targets , 2016, Bioinform..

[43]  R. König,et al.  A probability-based approach for the analysis of large-scale RNAi screens , 2007, Nature Methods.

[44]  T. Golub,et al.  Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting. , 2016, Cancer discovery.

[45]  John G. Doench,et al.  In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target , 2017, Nature.

[46]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[47]  Jiyang Yu,et al.  ScreenBEAM: a novel meta-analysis algorithm for functional genomics screens via Bayesian hierarchical modeling , 2015, Bioinform..

[48]  James E. DiCarlo,et al.  RNA-Guided Human Genome Engineering via Cas9 , 2013, Science.