CUT&RUNTools: a flexible pipeline for CUT&RUN processing and footprint analysis

We introduce CUT&RUNTools as a flexible, general pipeline for facilitating the identification of chromatin-associated protein binding and genomic footprinting analysis from antibody-targeted CUT&RUN primary cleavage data. CUT&RUNTools extracts endonuclease cut site information from sequences of short read fragments and produces single-locus binding estimates, aggregate motif footprints, and informative visualizations to support the high-resolution mapping capability of CUT&RUN. CUT&RUNTools is available at https://bitbucket.org/qzhudfci/cutruntools/.

[1]  Florian Hahne,et al.  Visualizing Genomic Data Using Gviz and Bioconductor , 2016, Statistical Genomics.

[2]  Jacob F. Degner,et al.  Sequence and Chromatin Accessibility Data Accurate Inference of Transcription Factor Binding from Dna Material Supplemental Open Access , 2022 .

[3]  Jeff Vierstra,et al.  Genomic footprinting , 2016, Nature Methods.

[4]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[5]  Andy B. Yoo,et al.  Approved for Public Release; Further Dissemination Unlimited X-ray Pulse Compression Using Strained Crystals X-ray Pulse Compression Using Strained Crystals , 2002 .

[6]  Laura J. Scott,et al.  Genetic regulatory signatures underlying islet gene expression and type 2 diabetes , 2017, Proceedings of the National Academy of Sciences.

[7]  Srinivas Ramachandran,et al.  Transcription of Nearly All Yeast RNA Polymerase II-Transcribed Genes Is Dependent on Transcription Factor TFIID. , 2017, Molecular cell.

[8]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[9]  A. Quinlan BEDTools: The Swiss‐Army Tool for Genome Feature Analysis , 2014, Current protocols in bioinformatics.

[10]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[11]  Martha L. Bulyk,et al.  Direct Promoter Repression by BCL11A Controls the Fetal to Adult Hemoglobin Switch , 2018, Cell.

[12]  Atsushi Hasegawa,et al.  GATA1 Activity Governed by Configurations of cis-Acting Elements , 2017, Front. Oncol..

[13]  Clifford A. Meyer,et al.  Identifying and mitigating bias in next-generation sequencing methods for chromatin biology , 2014, Nature Reviews Genetics.

[14]  David Levens,et al.  ChIP bias as a function of cross-linking time , 2015, Chromosome Research.

[15]  M. Jette,et al.  Simple Linux Utility for Resource Management , 2009 .

[16]  Ross C Hardison,et al.  Divergent functions of hematopoietic transcription factors in lineage priming and differentiation during erythro-megakaryopoiesis , 2014, Genome research.

[17]  Steven Henikoff,et al.  An efficient targeted nuclease strategy for high-resolution mapping of DNA binding sites , 2016, bioRxiv.

[18]  Alexander van Oudenaarden,et al.  Highly expressed loci are vulnerable to misleading ChIP localization of multiple unrelated proteins , 2013, Proceedings of the National Academy of Sciences.

[19]  David J. Arenillas,et al.  JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework , 2017, Nucleic acids research.

[20]  Ross C Hardison,et al.  Identification of Biologically Relevant Enhancers in Human Erythroid Cells* , 2013, The Journal of Biological Chemistry.

[21]  D. Wechsler,et al.  Differential binding of c-Myc and Max to nucleosomal DNA. , 1994, Molecular and cellular biology.

[22]  M. Solomon,et al.  Formaldehyde-mediated DNA-protein crosslinking: a probe for in vivo chromatin structures. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Jonathan S. Weissman,et al.  Reprogramming human T cell function and specificity with non-viral genome targeting , 2017, Nature.

[24]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[25]  Siavoush Dastmalchi,et al.  Structural basis of simultaneous recruitment of the transcriptional regulators LMO2 and FOG1/ZFPM1 by the transcription factor GATA1 , 2011, Proceedings of the National Academy of Sciences.

[26]  Z. Weng,et al.  Elimination of PCR duplicates in RNA-seq and small RNA-seq using unique molecular identifiers , 2018, BMC Genomics.

[27]  Guo-Cheng Yuan,et al.  CUT&RUNTools: a flexible pipeline for CUT&RUN processing and footprint analysis , 2019, Genome Biology.

[28]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[29]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[30]  Matthew C. Canver,et al.  Variant-aware saturating mutagenesis using multiple Cas9 nucleases identifies regulatory elements at trait-associated loci , 2017, Nature Genetics.

[31]  Hanbo Chen,et al.  VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R , 2011, BMC Bioinformatics.

[32]  Jason Piper,et al.  Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data , 2013, Nucleic acids research.

[33]  S. Orkin,et al.  Erythroid differentiation in chimaeric mice blocked by a targeted mutation in the gene for transcription factor GATA-1 , 1991, Nature.

[34]  Shamit Soneji,et al.  Genome-wide identification of TAL1's functional targets: insights into its mechanisms of action in primary erythroid cells. , 2010, Genome research.

[35]  Shane J. Neph,et al.  An expansive human regulatory lexicon encoded in transcription factor footprints , 2012, Nature.

[36]  T. Rabbitts,et al.  The LIM‐only protein Lmo2 is a bridging molecule assembling an erythroid, DNA‐binding complex which includes the TAL1, E47, GATA‐1 and Ldb1/NLI proteins , 1997, The EMBO journal.

[37]  Philip Machanick,et al.  MEME-ChIP: motif analysis of large DNA datasets , 2011, Bioinform..

[38]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..