AlleleAnalyzer: a tool for personalized and allele-specific sgRNA design

The CRISPR/Cas system is a highly specific genome editing tool capable of distinguishing alleles differing by even a single base pair. Target sites might carry genetic variations that are not distinguishable by sgRNA designing tools based on one reference genome. AlleleAnalyzer is an open-source software that incorporates single-nucleotide variants and short insertions and deletions to design sgRNAs for precisely editing 1 or multiple haplotypes of a sequenced genome, currently supporting 11 Cas proteins. It also leverages patterns of shared genetic variation to optimize sgRNA design for different human populations. AlleleAnalyzer is available at https://github.com/keoughkath/AlleleAnalyzer.

[1]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[2]  Nir Hacohen,et al.  A genome-wide CRISPR screen identifies a restricted set of HIV host dependency factors , 2016, Nature Genetics.

[3]  Joshua J. Breunig,et al.  In Vivo CRISPR/Cas9 Gene Editing Corrects Retinal Dystrophy in the S334ter-3 Rat Model of Autosomal Dominant Retinitis Pigmentosa , 2015, Molecular therapy : the journal of the American Society of Gene Therapy.

[4]  Tanneguy Redarce,et al.  Automatic Lip-Contour Extraction and Mouth-Structure Segmentation in Images , 2011, Computing in Science & Engineering.

[5]  Jean-Claude Tardif,et al.  Human genetic variation alters CRISPR-Cas9 on- and off-targeting specificity at therapeutically implicated loci , 2017, Proceedings of the National Academy of Sciences.

[6]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[7]  Hanspeter Pfister,et al.  UpSet: Visualization of Intersecting Sets , 2014, IEEE Transactions on Visualization and Computer Graphics.

[8]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[9]  P. Wipf,et al.  New Pyrazolopyrimidine Inhibitors of Protein Kinase D as Potent Anticancer Agents for Prostate Cancer Cells , 2013, PloS one.

[10]  George M. Church,et al.  In vivo gene editing in dystrophic mouse muscle and muscle stem cells , 2016, Science.

[11]  S. Tsang,et al.  BEST1: the Best Target for Gene and Cell Therapies. , 2015, Molecular therapy : the journal of the American Society of Gene Therapy.

[12]  Brent S. Pedersen,et al.  Efficient "pythonic" access to FASTA files using pyfaidx , 2015, PeerJ Prepr..

[13]  Qiaobing Xu,et al.  Treatment of autosomal dominant hearing loss by in vivo delivery of genome editing agents , 2017, Nature.

[14]  Meagan E. Sullender,et al.  Rational design of highly active sgRNAs for CRISPR-Cas9–mediated gene inactivation , 2014, Nature Biotechnology.

[15]  David A. Scott,et al.  Implications of human genetic variation in CRISPR-based therapeutic genome editing , 2017, Nature Medicine.

[16]  Xuezhu Feng,et al.  Dual sgRNA-directed gene knockout using CRISPR/Cas9 technology in Caenorhabditis elegans , 2014, Scientific Reports.

[17]  A. Hyman,et al.  Stem cells: the new “model organism” , 2017, Molecular biology of the cell.

[18]  Iain Dunning,et al.  PuLP : A Linear Programming Toolkit for Python , 2011 .

[19]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[20]  M. Nesbit,et al.  Towards personalised allele-specific CRISPR gene editing to treat autosomal dominant disorders , 2017, Scientific Reports.

[21]  Max A. Horlbeck,et al.  Compact and highly active next-generation libraries for CRISPR-mediated gene repression and activation , 2016, eLife.

[22]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[23]  Gaelen T. Hess,et al.  Genome-scale measurement of off-target activity using Cas9 toxicity in high-throughput screens , 2017, Nature Communications.

[24]  Gang Wang,et al.  Targeted and genome-wide sequencing reveal single nucleotide variations impacting specificity of Cas9 in human stem cells , 2014, Nature Communications.

[25]  Kenneth L. Clarkson,et al.  Algorithms for Polytope Covering and Approximation , 1993, WADS.

[26]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[27]  B. Browning,et al.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. , 2007, American journal of human genetics.

[28]  Michael L. Waskom,et al.  mwaskom/seaborn: v0.9.0 (July 2018) , 2018 .

[29]  J. Kent,et al.  Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR , 2016, Genome Biology.

[30]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[31]  Meagan E. Sullender,et al.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 , 2015, Nature Biotechnology.

[32]  Tammy Gillis,et al.  Permanent inactivation of Huntington's disease mutation by personalized allele-specific CRISPR/Cas9. , 2016, Human molecular genetics.

[33]  Mauricio O. Carneiro,et al.  From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline , 2013, Current protocols in bioinformatics.