A Bioconductor Based Workflow for Z-DNA Region Detection and Biological Inference

Z-DNA is an alternative conformation of the DNA molecule implied in regulation of gene expression. However, the exact role of this structure in cell metabolism is not yet fully understood. Here we present a novel Z-DNA analysis workflow using the R software environment which aims to investigate Z-DNA forming regions (ZDRs) throughout the genome. It combines thermodynamic analysis of the well-known software Z-Catcher with biological data manipulation capabilities of several Bioconductor packages. We employed our methodology in the human chromosome 14 as a case study. With that, we established a correlation of ZDRs with transcription start sites (TSSs) which is in agreement with previous reports. In addition, our workflow was able to show that ZDRs which are positioned inside genes tend to occur in intronic sequences rather than exonic and that ZDRs upstream to TSSs may have a positive correlation with the up-regulation of RNA polymerase activity.

[1]  A. Rich,et al.  The Biology of Left-handed Z-DNA (*) , 1996, The Journal of Biological Chemistry.

[2]  N. Carter,et al.  A Complete Map of the Human Immunoglobulin VH Locus , 1995, Annals of the New York Academy of Sciences.

[3]  J. Wang,et al.  Supercoiling of the DNA template during transcription. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[4]  A. Rich,et al.  A computer aided thermodynamic approach for predicting the formation of Z‐DNA in naturally occurring sequences. , 1986, The EMBO journal.

[5]  N. D. Clarke,et al.  Integrative model of genomic factors for determining binding site selection by estrogen receptor-α , 2010, Molecular systems biology.

[6]  P. Seeburg,et al.  RNA editing in brain controls a determinant of ion flow in glutamate-gated channels , 1991, Cell.

[7]  A. Rich,et al.  Chicken double-stranded RNA adenosine deaminase has apparent specificity for Z-DNA. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[9]  Hideaki Sugawara,et al.  The Sequence Read Archive , 2010, Nucleic Acids Res..

[10]  Alfred Nordheim,et al.  THE CHEMISTRY AND BIOLOGY OF LEFT-HANDED Z-DNA , 1984 .

[11]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[12]  Alexander Rich,et al.  Z-DNA: the long road to biological function , 2003, Nature Reviews Genetics.

[13]  A. Rich,et al.  Zuotin, a putative Z‐DNA binding protein in Saccharomyces cerevisiae. , 1992, The EMBO journal.

[14]  Simon Tavaré,et al.  BayesPeak—an R package for analysing ChIP-seq data , 2011, Bioinform..

[15]  A. Mortazavi,et al.  Computation for ChIP-seq and RNA-seq studies , 2009, Nature Methods.

[16]  A Herbert,et al.  RNA editing, introns and evolution. , 1996, Trends in genetics : TIG.

[17]  Ting Wang,et al.  ENCODE whole-genome data in the UCSC Genome Browser , 2009, Nucleic Acids Res..