Large-scale analysis of the human genome: From DNA sequence analysis to the modeling of replication in higher eukaryotes

We explore large-scale nucleotide compositional fluctuations along the human genome through the optics of the wavelet transform microscope. Analysis of the TA and GC skews reveals the existence of strand asymmetries associated to transcription and/or replication. The investigation of 14854 intron-containing genes shows that both skews display a characteristic step-like profile exhibiting sharp transitions between transcribed (finite bias) and non-transcribed (zero bias) regions. As we observe for 7 out of 9 origins of replication experimentally identified so far, the (AT+GC) skew exhibits rather sharp upward jumps, with a linear decreasing profile in between two successive jumps. We describe a multi-scale methodology that allows us to predict 1012 replication origins in the 22 human autosomal chromosomes. We present a model of replication with well-positioned replication origins and random termination sites that accounts for the observed characteristic serrated skew profiles. We emphasize these putative replication initiation zones as regions where the chromatin fiber is likely to be more open so that DNA be easily accessible. In the crowded environment of the cell nucleus, these intrinsic decondensed structural defects actually predisposes the fiber to spontaneously form rosette-like structures that provide an attractive description of genome organization into replication foci that are observed in interphase mammalian nuclei.

[1]  S Nicolay,et al.  Low frequency rhythms in human DNA sequences: a key to the organization of gene location and orientation? , 2004, Physical review letters.

[2]  Caroline Demeret,et al.  Chromatin remodelling and DNA replication: from nucleosomes to loop domains , 2001, Oncogene.

[3]  S. Karlin,et al.  Strand compositional asymmetry in bacterial and large viral genomes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  J. Lobry,et al.  Asymmetric substitution patterns: a review of possible underlying mutational or selective mechanisms. , 1999, Gene.

[5]  S Nicolay,et al.  Transcription‐coupled TA and GC strand asymmetries in the human genome , 2003, FEBS letters.

[6]  D. Mindell Fundamentals of molecular evolution , 1991 .

[7]  J. R. Lobry,et al.  Properties of a general model of DNA evolution under no-strand-bias conditions , 1995, Journal of Molecular Evolution.

[8]  Aaron Bensimon,et al.  Dynamics of DNA Replication in Mammalian Somatic Cells Nucleotide Pool Modulates Origin Choice and Interorigin Spacing , 2003, Cell.

[9]  Peter R. Cook,et al.  Predicting three-dimensional genome structure from transcriptional activity , 2002, Nature Genetics.

[10]  Ronald Berezney,et al.  Heterogeneity of eukaryotic replicons, replicon clusters, and replication foci , 2000, Chromosoma.

[11]  S. Cebrat,et al.  Is there replication-associated mutational pressure in the Saccharomyces cerevisiae genome? , 2000, Journal of theoretical biology.

[12]  M. Méchali,et al.  Vertebrate HoxB gene expression requires DNA replication , 2003, The EMBO journal.

[13]  A Grigoriev,et al.  Mutations induced by bacteriophage T7 RNA polymerase and their effects on the composition of the T7 genome. , 2000, Journal of molecular biology.

[14]  H. G. Callan Review Lecture - Replication of DNA in the chromosomes of eukaryotes , 1972, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[15]  Y. Yurov,et al.  The units of DNA replication in the mammalian chromosomes: Evidence for a large size of replication units , 1977, Chromosoma.

[17]  M. Bulmer,et al.  Strand symmetry of mutation rates in theβ-globin region , 1991, Journal of Molecular Evolution.

[18]  M. DePamphilis,et al.  Mapping an origin of DNA replication at a single-copy locus in exponentially proliferating mammalian cells , 1990, Molecular and cellular biology.

[19]  Ana Pombo,et al.  Replicon Clusters Are Stable Units of Chromosome Structure: Evidence That Nuclear Organization Contributes to the Efficient Activation and Propagation of S Phase in Human Cells , 1998, The Journal of cell biology.

[20]  E. Rocha Is there a role for replication fork asymmetry in the distribution of genes in bacterial genomes? , 2002, Trends in microbiology.

[21]  David M. Gilbert,et al.  Making Sense of Eukaryotic DNA Replication Origins , 2001, Science.

[22]  Emmanuel Bacry,et al.  Wavelet based fractal analysis of DNA sequences , 1996 .

[23]  E. Tillier,et al.  The Contributions of Replication Orientation, Gene Direction, and Signal Sequences to Base-Composition Asymmetries in Bacterial Genomes , 2000, Journal of Molecular Evolution.

[24]  E. Chargaff Structure and function of nucleic acids as cell constituents. , 1951, Federation proceedings.

[25]  S Nicolay,et al.  From DNA sequence analysis to modeling replication in the human genome. , 2005, Physical review letters.

[26]  M. Méchali,et al.  DNA REPLICATION AND CHROMATIN ASSEMBLY USING XENOPUS EGGS OR EMBRYOS , 1999 .

[27]  Anindya Dutta,et al.  DNA replication in eukaryotic cells. , 2002, Annual review of biochemistry.

[28]  Alain Arneodo,et al.  Transcription-coupled and splicing-coupled strand asymmetries in eukaryotic genomes. , 2004, Nucleic acids research.

[29]  H. Ochman,et al.  Deamination as the basis of strand-asymmetric evolution in transcribed Escherichia coli sequences. , 2001, Molecular biology and evolution.

[30]  H. Ochman,et al.  Strand symmetry around the beta-globin origin of replication in primates. , 2000, Molecular biology and evolution.

[31]  Armin Bunde,et al.  The science of disasters : climate disruptions, heart attacks, and market crashes , 2002 .

[32]  J. Bechhoefer,et al.  Diffusion-limited loop formation of semiflexible polymers: Kramers theory and the intertwined time scales of chain relaxation and closing , 2003, cond-mat/0306546.

[33]  Alain Arneodo,et al.  Replication-associated strand asymmetries in mammalian genomes: toward detection of replication origins. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[34]  François Jacob,et al.  On the Regulation of DNA Replication in Bacteria , 1963 .

[35]  Zlatko Trajanoski,et al.  Generic Features of Tertiary Chromatin Structure as Detected in Natural Chromosomes , 2004, Molecular and Cellular Biology.

[36]  Charles Kooperberg,et al.  Genome-wide DNA replication profile for Drosophila melanogaster: a link between transcription and replication timing , 2002, Nature Genetics.

[37]  Howard Ochman,et al.  Strand Symmetry Around the β-Globin Origin of Replication in Primates , 2000 .

[38]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[39]  Jurg Ott,et al.  Nucleotide frequency variation across human genes. , 2003, Genome research.

[40]  P. Green,et al.  Transcription-associated mutational asymmetry in mammalian evolution , 2003, Nature Genetics.

[41]  M. Méchali,et al.  Chromosomal replication initiates and terminates at random sequences but at regular intervals in the ribosomal DNA of Xenopus early embryos. , 1993, The EMBO journal.

[42]  H. Philippe,et al.  Composition strand asymmetries in prokaryotic genomes: mutational bias and biased gene orientation. , 2001, Comptes rendus de l'Academie des sciences. Serie III, Sciences de la vie.

[43]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[44]  Entropically Driven Helix Formation , 2005, Science.