Linking the DNA strand asymmetry to the spatio-temporal replication program

In paper I, we addressed the impact of the spatio-temporal program on the DNA composition evolution in the case of time homogeneous and neighbor-independent substitution rates. But substitution rates do depend on the flanking nucleotides as exemplified in vertebrates where CpG sites are hypermutable so that the substitution rate $C \rightarrow T$ depends dramatically (ten fold) on whether the cytosine belongs to a CG dinucleotide or not. With the specific goal to account for neighbor-dependence, we revisit our minimal modeling of neutral substitution rates in the human genome. When assuming that $r=CpG \rightarrow TpG$ and its reverse complement $r^{c} =CpG \rightarrow CpA$are (by far) the main neighbor-dependent substitution rates, we demonstrate, using perturbative analysis, that neighbor-dependence does not affect the decomposition of the compositional asymmetry into a transcription- and a replication-associated components, the former increases in magnitude with transcription rate and changes sign with gene orientation, whereas the latter is proportional to the replication fork polarity. Indeed the neighbor dependence case differs from the neighbor-independent model by an additional source term related to the CG dinucleotide content in both the transcription and replication-associated components. We finally discuss the case of time-dependent substitution rates confirming as a very general result the fact that the skew can still be decomposed into a transcription- and a replication-associated components.Graphical abstract

[1]  Dirk Schübeler,et al.  Global Reorganization of Replication Domains During Embryonic Stem Cell Differentiation , 2008, PLoS biology.

[2]  M. Méchali,et al.  Eukaryotic DNA replication origins: many choices for appropriate answers , 2010, Nature Reviews Molecular Cell Biology.

[3]  H. Ochman,et al.  Strand asymmetries in DNA evolution. , 1997, Trends in genetics : TIG.

[4]  A. Danchin,et al.  Universal replication biases in bacteria , 1999, Molecular microbiology.

[5]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[6]  M. Touchon,et al.  Similar compositional biases are caused by very different mutational effects. , 2006, Genome research.

[7]  Alain Arneodo,et al.  Evidence for Sequential and Increasing Activation of Replication Origins along Replication Timing Gradients in the Human Genome , 2011, PLoS Comput. Biol..

[8]  Benjamin Audit,et al.  Replication Fork Polarity Gradients Revealed by Megabase-Sized U-Shaped Replication Timing Domains in Human Cell Lines , 2012, PLoS Comput. Biol..

[9]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[10]  J Bechhoefer,et al.  Inferring where and when replication initiates from genome-wide replication timing data. , 2012, Physical review letters.

[11]  Jacek Majewski,et al.  Dependence of mutational asymmetry on gene-expression levels in the human genome. , 2003, American journal of human genetics.

[12]  P. Green,et al.  Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Alain Arneodo,et al.  3D chromatin conformation correlates with replication timing and is conserved in resting cells , 2012, Nucleic acids research.

[14]  A. Bhagwat,et al.  Correlation between transcription and C to T mutations in the non-transcribed DNA strand. , 1998, Biological chemistry.

[15]  Zohar Yakhini,et al.  Global organization of replication time zones of the mouse genome. , 2008, Genome research.

[16]  D. Sherrington Stochastic Processes in Physics and Chemistry , 1983 .

[17]  Jean Thierry-Mieg,et al.  Predictable dynamic program of timing of DNA replication in human cells. , 2009, Genome research.

[18]  S Nicolay,et al.  Transcription‐coupled TA and GC strand asymmetries in the human genome , 2003, FEBS letters.

[19]  P. Green,et al.  Genomic signatures of germline gene expression. , 2010, Genome research.

[20]  A. Bhagwat,et al.  Transcription-induced mutations: increase in C to T mutations in the nontranscribed strand during transcription in Escherichia coli. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[21]  M. Debatisse,et al.  Replication fork movement sets chromatin loop size and origin choice in mammalian cells , 2008, Nature.

[22]  David M. Gilbert,et al.  Evaluating genome-scale approaches to eukaryotic DNA replication , 2010, Nature Reviews Genetics.

[23]  Alain Arneodo,et al.  Open chromatin encoded in DNA sequence is the signature of ‘master’ replication origins in human cells , 2009, Nucleic acids research.

[24]  P. Polak,et al.  Transcription induces strand-specific mutations at the 5' end of human genes. , 2008, Genome research.

[25]  N. Kampen,et al.  Stochastic processes in physics and chemistry , 1981 .

[26]  Laurent Farinelli,et al.  Impact of replication timing on non-CpG and CpG substitution rates in mammalian genomes. , 2010, Genome research.

[27]  H. Ochman,et al.  Asymmetries Generated by Transcription-Coupled Repair in Enterobacterial Genes , 1996, Science.

[28]  A. Bird,et al.  DNA methylation landscapes: provocative insights from epigenomics , 2008, Nature Reviews Genetics.

[29]  S Nicolay,et al.  DNA replication timing data corroborate in silico human replication origin predictions. , 2007, Physical review letters.

[30]  T. Kunkel,et al.  A sensitive genetic assay for the detection of cytosine deamination: determination of rate constants and the activation energy. , 1990, Biochemistry.

[31]  J. Lobry,et al.  Asymmetric substitution patterns: a review of possible underlying mutational or selective mechanisms. , 1999, Gene.

[32]  Temple F. Smith,et al.  Patterns of Genome Organization in Bacteria , 1998, Science.

[33]  Anindya Dutta,et al.  DNA replication in eukaryotic cells. , 2002, Annual review of biochemistry.

[34]  Olivier Hyrien,et al.  Paradoxes of eukaryotic DNA replication: MCM proteins and the random completion problem , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[35]  Terence Hwa,et al.  Substantial Regional Variation in Substitution Rates in the Human Genome: Importance of GC Content, Gene Density, and Telomere-Specific Effects , 2005, Journal of Molecular Evolution.

[36]  Mirit I. Aladjem,et al.  Replication in context: dynamic regulation of DNA replication patterns in metazoans , 2007, Nature Reviews Genetics.

[37]  Christopher B. Burge,et al.  DNA Sequence Evolution with Neighbor-Dependent Mutation , 2003, J. Comput. Biol..

[38]  S Nicolay,et al.  From DNA sequence analysis to modeling replication in the human genome. , 2005, Physical review letters.

[39]  A. Goldar,et al.  DNA replication induces compositional biases in yeast. , 2012, Molecular biology and evolution.

[40]  Michael O Dorschner,et al.  Sequencing newly replicated DNA reveals widespread plasticity in human replication timing , 2009, Proceedings of the National Academy of Sciences.

[41]  Martin Peifer,et al.  Transcription-induced mutational strand bias and its effect on substitution rates in human genes. , 2008, Molecular biology and evolution.

[42]  D. Gilbert,et al.  Epigenomic replication: linking epigenetics to DNA replication. , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[43]  Françoise Argoul,et al.  Multi-scale coding of genomic information: From DNA sequence to genome structure and function , 2011 .

[44]  Gene W. Yeo,et al.  Divergent Transcription from Active Promoters , 2008, Science.

[45]  Alain Arneodo,et al.  Human gene organization driven by the coordination of replication and transcription. , 2007, Genome research.

[46]  Laurent Duret,et al.  Biased gene conversion and the evolution of mammalian genomic landscapes. , 2009, Annual review of genomics and human genetics.

[47]  A. Grigoriev Strand-specific compositional asymmetries in double-stranded DNA viruses. , 1999, Virus research.

[48]  Laurent Duret,et al.  Evolution of synonymous codon usage in metazoans. , 2002, Current opinion in genetics & development.

[49]  E. Chargaff,et al.  Separation of B. subtilis DNA into complementary strands. 3. Direct analysis. , 1968, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Leighton J. Core,et al.  Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters , 2008, Science.

[51]  Alain Arneodo,et al.  Transcription-coupled and splicing-coupled strand asymmetries in eukaryotic genomes. , 2004, Nucleic acids research.

[52]  L. Duret,et al.  The relationship between DNA replication and human genome organization. , 2009, Molecular biology and evolution.

[53]  S Nicolay,et al.  Bifractality of human DNA strand-asymmetry profiles results from transcription. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[54]  C Saccone,et al.  Asymmetrical directional mutation pressure in the mitochondrial genome of mammals. , 1998, Molecular biology and evolution.

[55]  Alain Arneodo,et al.  Replication-associated strand asymmetries in mammalian genomes: toward detection of replication origins. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[56]  N. Sueoka Intrastrand parity rules of DNA base composition and usage biases of synonymous codons , 1995, Journal of Molecular Evolution.

[57]  Amos Tanay,et al.  Comparative Analysis of DNA Replication Timing Reveals Conserved Large-Scale Chromosomal Architecture , 2010, PLoS genetics.

[58]  M. Méchali,et al.  DNA replication origins: from sequence specificity to epigenetics , 2001, Nature Reviews Genetics.

[59]  Renata Retkute,et al.  Mathematical modelling of whole chromosome replication , 2010, Nucleic acids research.

[60]  J. Lobry Asymmetric substitution patterns in the two DNA strands of bacteria. , 1996, Molecular biology and evolution.

[61]  David M. Gilbert,et al.  Making Sense of Eukaryotic DNA Replication Origins , 2001, Science.

[62]  J. Lobry,et al.  Origin of Replication of Mycoplasma genitalium , 1996, Science.

[63]  S. Karlin,et al.  Over- and under-representation of short oligonucleotides in DNA sequences. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[64]  S. Dalton,et al.  Evolutionarily conserved replication timing profiles predict long-range chromatin interactions and distinguish closely related cell types. , 2010, Genome research.

[65]  S. Cebrat,et al.  Is there replication-associated mutational pressure in the Saccharomyces cerevisiae genome? , 2000, Journal of theoretical biology.

[66]  Marie-France Sagot,et al.  Analysis of fine-scale mammalian evolutionary breakpoints provides new insight into their relation to genome organisation , 2009 .

[67]  Ronald Berezney,et al.  Heterogeneity of eukaryotic replicons, replicon clusters, and replication foci , 2000, Chromosoma.

[68]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[69]  L. Duret,et al.  GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. , 2001, Genetics.

[70]  Alain Arneodo,et al.  Replication-associated mutational asymmetry in the human genome. , 2011, Molecular biology and evolution.

[71]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[72]  Peter F. Arndt,et al.  Identification and Measurement of Neigbor Dependent Nucleotide Substitution Processes , 2005, German Conference on Bioinformatics.

[73]  John Bechhoefer,et al.  Modeling genome-wide replication kinetics reveals a mechanism for regulation of replication timing , 2010, Molecular systems biology.

[74]  S. Karlin,et al.  Strand compositional asymmetry in bacterial and large viral genomes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[75]  C. Lobry,et al.  Evolution of DNA base composition under no-strand-bias conditions when the substitution rates are not constant. , 1999, Molecular biology and evolution.

[76]  J. M. Comeron,et al.  Selective and Mutational Patterns Associated With Gene Expression in Humans , 2004, Genetics.

[77]  P. Polak,et al.  Long-Range Bidirectional Strand Asymmetries Originate at CpG Islands in the Human Genome , 2009, Genome biology and evolution.

[78]  Mikkel H. Schierup,et al.  RNA Exosome Depletion Reveals Transcription Upstream of Active Human Promoters , 2008, Science.

[79]  J. Hamlin,et al.  A revisionist replicon model for higher eukaryotic genomes , 2008, Journal of cellular biochemistry.

[80]  T. Kunkel,et al.  Dividing the workload at a eukaryotic replication fork. , 2008, Trends in cell biology.

[81]  J. Stamatoyannopoulos,et al.  Human mutation rate associated with DNA replication timing , 2009, Nature Genetics.

[82]  Olivier Hyrien,et al.  Mathematical modelling of eukaryotic DNA replication , 2009, Chromosome Research.

[83]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[84]  J. Svejstrup Transcription: Mechanisms of transcription-coupled DNA repair , 2002, Nature Reviews Molecular Cell Biology.

[85]  J.-B. Gouéré,et al.  Solvable models of neighbor-dependent substitution processes. , 2008, Mathematical biosciences.

[86]  François Jacob,et al.  On the Regulation of DNA Replication in Bacteria , 1963 .

[87]  N. G. Van Kampen,et al.  Chapter III – STOCHASTIC PROCESSES , 2007 .

[88]  P. Green,et al.  Transcription-associated mutational asymmetry in mammalian evolution , 2003, Nature Genetics.

[89]  S T Hess,et al.  Wide variations in neighbor-dependent substitution rates. , 1994, Journal of molecular biology.

[90]  H. Ellegren,et al.  Conservation of Neutral Substitution Rate and Substitutional Asymmetries in Mammalian Genes , 2010, Genome biology and evolution.

[91]  Alain Arneodo,et al.  Wavelet-based method to disentangle transcription- and replication-associated strand asymmetries in mammalian genomes , 2010 .

[92]  K. Kinzler,et al.  The Antisense Transcriptomes of Human Cells , 2008, Science.

[93]  Laurent Duret,et al.  The Impact of Recombination on Nucleotide Substitutions in the Human Genome , 2008, PLoS genetics.

[94]  S Nicolay,et al.  Low frequency rhythms in human DNA sequences: a key to the organization of gene location and orientation? , 2004, Physical review letters.

[95]  A Grigoriev,et al.  Mutations induced by bacteriophage T7 RNA polymerase and their effects on the composition of the T7 genome. , 2000, Journal of molecular biology.

[96]  A. Arneodo,et al.  Linking the DNA strand asymmetry to the spatio-temporal replication program , 2012, The European physical journal. E, Soft matter.

[97]  J. R. Lobry,et al.  Properties of a general model of DNA evolution under no-strand-bias conditions , 1995, Journal of Molecular Evolution.

[98]  E. Tillier,et al.  The Contributions of Replication Orientation, Gene Direction, and Signal Sequences to Base-Composition Asymmetries in Bacterial Genomes , 2000, Journal of Molecular Evolution.

[99]  H. Ochman,et al.  Deamination as the basis of strand-asymmetric evolution in transcribed Escherichia coli sequences. , 2001, Molecular biology and evolution.