Deep learning improves prediction of CRISPR–Cpf1 guide RNA activity

We present two algorithms to predict the activity of AsCpf1 guide RNAs. Indel frequencies for 15,000 target sequences were used in a deep-learning framework based on a convolutional neural network to train Seq-deepCpf1. We then incorporated chromatin accessibility information to create the better-performing DeepCpf1 algorithm for cell lines for which such information is available and show that both algorithms outperform previous machine learning algorithms on our own and published data sets.

[1]  Xiaowei Wang,et al.  WU-CRISPR: characteristics of functional guide RNAs for the CRISPR/Cas9 system , 2015, Genome Biology.

[2]  Ciaran M Lee,et al.  Examination of CRISPR/Cas9 design tools and the effect of target site accessibility on Cas9 activity , 2017, Experimental physiology.

[3]  G. Church,et al.  Unraveling CRISPR-Cas9 genome engineering parameters via a library-on-library approach , 2015, Nature Methods.

[4]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[5]  Clifford A. Meyer,et al.  Sequence determinants of improved CRISPR sgRNA design , 2015, Genome research.

[6]  B. Frey,et al.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning , 2015, Nature Biotechnology.

[7]  E. M. DeGennaro,et al.  Multiplex gene editing by CRISPR-Cpf1 through autonomous processing of a single crRNA array , 2016, Nature Biotechnology.

[8]  Feng Zhang,et al.  Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA , 2014, Cell.

[9]  Charles E. Vejnar,et al.  CRISPRscan: designing highly efficient sgRNAs for CRISPR/Cas9 targeting in vivo , 2015, Nature Methods.

[10]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[11]  Jin-Soo Kim,et al.  Genome-wide analysis reveals specificities of Cpf1 endonucleases in human cells , 2016, Nature Biotechnology.

[12]  A. Regev,et al.  Cpf1 Is a Single RNA-Guided Endonuclease of a Class 2 CRISPR-Cas System , 2015, Cell.

[13]  Jin-Wu Nam,et al.  In vivo high-throughput profiling of CRISPR–Cpf1 activity , 2016, Nature Methods.

[14]  Meagan E. Sullender,et al.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 , 2015, Nature Biotechnology.

[15]  Martin J. Aryee,et al.  Genome-wide specificities of CRISPR-Cas Cpf1 nucleases in human cells , 2016, Nature Biotechnology.

[16]  E. Lander,et al.  Genetic Screens in Human Cells Using the CRISPR-Cas9 System , 2013, Science.

[17]  J. Kent,et al.  Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR , 2016, Genome Biology.

[18]  Yongsub Kim,et al.  Generation of knockout mice by Cpf1-mediated gene targeting , 2016, Nature Biotechnology.

[19]  Meagan E. Sullender,et al.  Rational design of highly active sgRNAs for CRISPR-Cas9–mediated gene inactivation , 2014, Nature Biotechnology.

[20]  Hao Li,et al.  Generation of targeted mutant rice using a CRISPR‐Cpf1 system , 2017, Plant biotechnology journal.

[21]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[22]  David R. Kelley,et al.  Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks , 2015, bioRxiv.

[23]  Alejandro Chavez,et al.  sgRNA Scorer 2.0: A Species-Independent Model To Predict CRISPR/Cas9 Activity. , 2017, ACS synthetic biology.

[24]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[25]  Byunghan Lee,et al.  Deep learning in bioinformatics , 2016, Briefings Bioinform..

[26]  Jin-Soo Kim,et al.  Targeted mutagenesis in mice by electroporation of Cpf1 ribonucleoproteins , 2016, Nature Biotechnology.