Optimized sgRNA design by deep learning to balance the off-target effects and on-target activity of CRISPR/Cas9

The CRISPR/Cas9 system derived from bacteria especially Streptococcus pyogenes (SpyCas9) is currently considered as the most advanced tool used for numerous areas of biological study in which it is useful to target or modify specific DNA sequences. However, low on-target cleavage efficiency and off-target effects impede its wide application. Several different sgRNA design tools for SpyCas9 by using various algorithms have been developed, including linear regression model, support vector machine (SVM) model and convolutional neuron network model. While the deep insight into the sgRNA features contributing for both on-target activity and off-target still remains to be determined. Here, with public large-scale CRISPR screen data, we evaluated contribution of different features influence sgRNA activity and off-target effects, and developed models for sgRNA off-target evaluation and on-target activity prediction. In addition, we combined both activity and off-target prediction models and packaged them as an online sgRNA design tool, OPT-sgRNA.

[1]  J. L. Mateo,et al.  Refined sgRNA efficacy prediction improves large- and small-scale CRISPR–Cas9 applications , 2017, Nucleic acids research.

[2]  J. Keith Joung,et al.  High frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells , 2013, Nature Biotechnology.

[3]  Wing Hung Wong,et al.  SeqMap: mapping massive amount of oligonucleotides to the genome , 2008, Bioinform..

[4]  Jan Winter,et al.  GenomeCRISPR - a database for high-throughput CRISPR/Cas9 screens , 2016, Nucleic Acids Res..

[5]  Kira S. Makarova,et al.  Crystal Structure of Cpf1 in Complex with Guide RNA and Target DNA , 2016, Cell.

[6]  Eli J. Fine,et al.  DNA targeting specificity of RNA-guided Cas9 nucleases , 2013, Nature Biotechnology.

[7]  Jennifer A. Doudna,et al.  Enhanced proofreading governs CRISPR-Cas9 targeting accuracy , 2017, Nature.

[8]  Dimitri Perrin,et al.  A benchmark of computational CRISPR-Cas9 guide design methods , 2018, bioRxiv.

[9]  Tao Zhang,et al.  Computational approaches for effective CRISPR guide RNA design and evaluation , 2019, Computational and structural biotechnology journal.

[10]  Bo Zhang,et al.  CasOT: a genome-wide Cas9/gRNA off-target searching tool , 2014, Bioinform..

[11]  Jennifer A. Doudna,et al.  Biology and Applications of CRISPR Systems: Harnessing Nature’s Toolbox for Genome Engineering , 2016, Cell.

[12]  Margus Lukk,et al.  Crisflash: open-source software to generate CRISPR guide RNAs against genomes annotated with individual variation , 2019, Bioinform..

[13]  J. Lan,et al.  Direct observation of the formation of a CRISPR-Cas12a R-loop complex at the single-molecule level. , 2020, Chemical communications.

[14]  N. Perrimon,et al.  Identification of potential drug targets for tuberous sclerosis complex by synthetic screens combining CRISPR-based knockouts with RNAi , 2015, Science Signaling.

[15]  E. Lander,et al.  Genetic Screens in Human Cells Using the CRISPR-Cas9 System , 2013, Science.

[16]  Jennifer Listgarten,et al.  Prediction of off-target activities for the end-to-end design of CRISPR guide RNAs , 2018, Nature Biomedical Engineering.

[17]  Le Cong,et al.  Multiplex Genome Engineering Using CRISPR/Cas Systems , 2013, Science.

[18]  Jin-Soo Kim,et al.  Web-based design and analysis tools for CRISPR base editing , 2018, BMC Bioinformatics.

[19]  Martin J. Aryee,et al.  Open-source guideseq software for analysis of GUIDE-seq data , 2016, Nature Biotechnology.

[20]  Meagan E. Sullender,et al.  Rational design of highly active sgRNAs for CRISPR-Cas9–mediated gene inactivation , 2014, Nature Biotechnology.

[21]  J. Keith Joung,et al.  731. High-Fidelity CRISPR-Cas9 Nucleases with No Detectable Genome-Wide Off-Target Effects , 2016 .

[22]  Xiaowei Wang,et al.  WU-CRISPR: characteristics of functional guide RNAs for the CRISPR/Cas9 system , 2015, Genome Biology.

[23]  Itay Mayrose,et al.  A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action , 2017, PLoS Comput. Biol..

[24]  G. Church,et al.  Unraveling CRISPR-Cas9 genome engineering parameters via a library-on-library approach , 2015, Nature Methods.

[25]  Jennifer A. Doudna,et al.  Enhanced Proofreading Governs CRISPR-Cas9 Targeting Accuracy , 2018 .

[26]  T. Takumi,et al.  Sequence features associated with the cleavage efficiency of CRISPR/Cas9 system , 2016, Scientific Reports.

[27]  Md. Khaledur Rahman,et al.  CRISPRpred: A flexible and efficient tool for sgRNAs on-target activity prediction in CRISPR/Cas9 systems , 2017, PloS one.

[28]  Yanhui Hu,et al.  Enhanced specificity and efficiency of the CRISPR/Cas9 system with optimized sgRNA parameters in Drosophila. , 2014, Cell reports.

[29]  T. Golub,et al.  Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting. , 2016, Cancer discovery.

[30]  M. Depken,et al.  Hybridization Kinetics Explains CRISPR-Cas Off-Targeting Rules. , 2018, Cell reports.

[31]  Hui Chen,et al.  The initiation, propagation and dynamics of CRISPR-SpyCas9 R-loop complex , 2017, Nucleic acids research.

[32]  H. Nishimasu,et al.  Structural Basis for the Canonical and Non-canonical PAM Recognition by CRISPR-Cpf1. , 2017, Molecular cell.

[33]  Jin-Soo Kim,et al.  Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases , 2014, Bioinform..

[34]  Luke A. Gilbert,et al.  CRISPR-Mediated Modular RNA-Guided Regulation of Transcription in Eukaryotes , 2013, Cell.

[35]  Feng Zhang,et al.  Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA , 2014, Cell.

[36]  Jan Gorodkin,et al.  CRISPR-Cas9 off-targeting assessment with nucleic acid duplex energy parameters , 2018, Genome Biology.

[37]  Alejandro Chavez,et al.  sgRNA Scorer 2.0: A Species-Independent Model To Predict CRISPR/Cas9 Activity. , 2017, ACS synthetic biology.

[38]  M. Boutros,et al.  E-CRISP: fast CRISPR target site identification , 2014, Nature Methods.

[39]  Sungroh Yoon,et al.  Deep learning improves prediction of CRISPR–Cpf1 guide RNA activity , 2018, Nature Biotechnology.

[40]  Jennifer A. Doudna,et al.  Conformational control of DNA target cleavage by CRISPR–Cas9 , 2015, Nature.

[41]  Denis C. Bauer,et al.  High Activity Target-Site Identification Using Phenotypic Independent CRISPR-Cas9 Core Functionality. , 2018, The CRISPR journal.

[42]  Charles E. Vejnar,et al.  CRISPRscan: designing highly efficient sgRNAs for CRISPR/Cas9 targeting in vivo , 2015, Nature Methods.

[43]  J. Doudna,et al.  The new frontier of genome engineering with CRISPR-Cas9 , 2014, Science.

[44]  Xingxu Huang,et al.  sgRNAcas9: A Software Package for Designing CRISPR sgRNA and Evaluating Potential Off-Target Cleavage Sites , 2014, PloS one.

[45]  Mazhar Adli,et al.  Cas9-chromatin binding information enables more accurate CRISPR off-target prediction , 2015, Nucleic acids research.

[46]  Jennifer A. Doudna,et al.  Structures of a CRISPR-Cas9 R-loop complex primed for DNA cleavage , 2016, Science.

[47]  Guohui Chuai,et al.  DeepCRISPR: optimized CRISPR guide RNA design by deep learning , 2018, Genome Biology.

[48]  Gang Bao,et al.  Quantifying on- and off-target genome editing. , 2015, Trends in biotechnology.

[49]  Chris M. Brown,et al.  CRISPRTarget , 2013, RNA biology.

[50]  Clifford A. Meyer,et al.  Sequence determinants of improved CRISPR sgRNA design , 2015, Genome research.

[51]  Jennifer A. Doudna,et al.  Structures of Cas9 Endonucleases Reveal RNA-Mediated Conformational Activation , 2014, Science.

[52]  Denis C. Bauer,et al.  The Current State and Future of CRISPR-Cas9 gRNA Design Tools , 2018, Front. Pharmacol..

[53]  Neville E. Sanjana,et al.  Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells , 2014, Science.

[54]  Jennifer A. Doudna,et al.  A Cas9–guide RNA complex preorganized for target DNA recognition , 2015, Science.

[55]  J. Joung,et al.  High-fidelity CRISPR-Cas9 variants with undetectable genome-wide off-targets , 2015, Nature.

[56]  David R. Liu,et al.  High-throughput profiling of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity , 2013, Nature Biotechnology.

[57]  J. Doudna,et al.  A Programmable Dual-RNA–Guided DNA Endonuclease in Adaptive Bacterial Immunity , 2012, Science.

[58]  Dongsheng Duan,et al.  Unified energetics analysis unravels SpCas9 cleavage activity for optimal gRNA design , 2019, Proceedings of the National Academy of Sciences.

[59]  Meagan E. Sullender,et al.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 , 2015, Nature Biotechnology.

[60]  E. Lander,et al.  Development and Applications of CRISPR-Cas9 for Genome Engineering , 2014, Cell.

[61]  Yi Zheng,et al.  CRISPR/Cas9 cleavage efficiency regression through boosting algorithms and Markov sequence profiling , 2018, Bioinform..

[62]  P. Hsu,et al.  Methods for Optimizing CRISPR-Cas9 Genome Editing Specificity. , 2016, Molecular cell.

[63]  A. McKenna,et al.  FlashFry: a fast and flexible tool for large-scale CRISPR target design , 2017, BMC Biology.