GC composition of the human genome: in search of isochores.

The isochore theory, proposed nearly three decades ago, depicts the mammalian genome as a mosaic of long, fairly homogeneous genomic regions that are characterized by their guanine and cytosine (GC) content. The human genome, for instance, was claimed to consist of five distinct isochore families: L1, L2, H1, H2, and H3, with GC contents of <37%, 37%-42%, 42%-47%, 47%-52%, and >52%, respectively. In this paper, we address the question of the validity of the isochore theory through a rigorous sequence-based analysis of the human genome. Toward this end, we adopt a set of six attributes that are generally claimed to characterize isochores and statistically test their veracity against the available draft sequence of the complete human genome. By the selection criteria used in this study: distinctiveness, homogeneity, and minimal length of 300 kb, we identify 1,857 genomic segments that warrant the label "isochore." These putative isochores are nonuniformly scattered throughout the genome and cover about 41% of the human genome. We found that a four-family model of putative isochores is the most parsimonious multi-Gaussian model that can be fitted to the empirical data. These families, however, are GC poor, with mean GC contents of 35%, 38%, 41%, and 48% and do not resemble the five isochore families in the literature. Moreover, due to large overlaps among the families, it is impossible to classify genomic segments into isochore families reliably, according to compositional properties alone. These findings undermine the utility of the isochore theory and seem to indicate that the theory may have reached the limits of its usefulness as a description of genomic compositional structures.

[1]  G. Bernardi,et al.  Codon usage and genome composition , 2005, Journal of Molecular Evolution.

[2]  G. Bernardi,et al.  Compositional gene landscapes in vertebrates. , 2004, Genome research.

[3]  I. Dunham,et al.  DNA sequence and analysis of human chromosome 9 , 2003, Nature.

[4]  G. Bernardi,et al.  Compositional Mapping of Mouse Chromosomes and Identification of the Gene-Rich Regions , 1997, Chromosome Research.

[5]  L. Duret,et al.  Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores , 1995, Journal of Molecular Evolution.

[6]  R. Storn,et al.  Differential Evolution , 2004 .

[7]  G. Bernardi,et al.  Genes, isochores and bands in human chromosomes 21 and 22 , 2004, Chromosome Research.

[8]  Dan Graur,et al.  kinase localization of a novel catalytic subunit of casein Translated Alu sequence determines nuclear , 2002 .

[9]  Chun-Ting Zhang,et al.  Identification of isochore boundaries in the human genome using the technique of wavelet multiresolution analysis. , 2003, Biochemical and biophysical research communications.

[10]  Ren Zhang,et al.  An isochore map of the human genome based on the Z curve method. , 2003, Gene.

[11]  Alexander E Vinogradov,et al.  Isochores and tissue-specificity. , 2003, Nucleic acids research.

[12]  James M. Eldred,et al.  The DNA sequence of human chromosome 7 , 2003, Nature.

[13]  Guy Perrière,et al.  G+C3 structuring along the genome: a common feature in prokaryotes. , 2003, Molecular biology and evolution.

[14]  Patrick Wincker,et al.  The DNA sequence and analysis of human chromosome 14 , 2023, Nature.

[15]  I. Dunham,et al.  The DNA sequence and analysis of human chromosome 6 , 2003, Nature.

[16]  L. Duret,et al.  Vanishing GC-rich isochores in mammalian genomes. , 2002, Genetics.

[17]  G. Bernardi,et al.  Isochores, GC3 and mutation biases in the human genome. , 2002, Gene.

[18]  Ramón Román-Roldán,et al.  Isochore chromosome maps of the human genome. , 2002, Gene.

[19]  Wentian Li,et al.  Are isochore sequences homogeneous? , 2002, Gene.

[20]  Wentian Li,et al.  Isochores Merit the Prefix 'Iso' , 2002, Genome Biology.

[21]  Ivo Grosse,et al.  Applications of Recursive Segmentation to the Analysis of DNA Sequences , 2002, Comput. Chem..

[22]  I. Grosse,et al.  Analysis of symbolic sequences using the Jensen-Shannon divergence. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Jan Paces,et al.  A compact view of isochores in the draft human genome sequence , 2002, FEBS letters.

[24]  Laurent Duret,et al.  Expected Relationship Between the Silent Substitution Rate and the GC Content: Implications for the Evolution of Isochores , 2002, Journal of Molecular Evolution.

[25]  D R Bentley,et al.  The DNA sequence and comparative analysis of human chromosome 20 , 2004, Nature.

[26]  G Bernardi,et al.  Misunderstandings about isochores. Part 1. , 2001, Gene.

[27]  G Bernardi,et al.  Compositional heterogeneity within and among isochores in mammalian genomes. II. Some general comments. , 2001, Gene.

[28]  G Bernardi,et al.  Compositional heterogeneity within and among isochores in mammalian genomes. I. CsCl and sequence analyses. , 2001, Gene.

[29]  L. Duret,et al.  GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. , 2001, Genetics.

[30]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[31]  D Häring,et al.  No isochores in the human chromosomes 21 and 22? , 2001, Biochemical and biophysical research communications.

[32]  The Wellcome Trust Sanger Institute The DNA sequence and comparative analysis of human chromosome 20 , 2001 .

[33]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[34]  A. Nekrutenko,et al.  Assessment of compositional heterogeneity within and between eukaryotic genomes. , 2000, Genome research.

[35]  M. Hattori,et al.  The DNA sequence of human chromosome 21 , 2000, Nature.

[36]  G Bernardi,et al.  Isochores and the evolutionary genomics of vertebrates. , 2000, Gene.

[37]  Melanie E. Goward,et al.  The DNA sequence of human chromosome 22 , 1999, Nature.

[38]  D. Higgs,et al.  The pattern of replication at a human telomeric region (16p13.3): its relationship to chromosome structure and gene expression. , 1999, Human molecular genetics.

[39]  H. Stanley,et al.  Analysis of DNA sequences using methods of statistical physics , 1998 .

[40]  G Bernardi,et al.  Methylation patterns in the isochores of vertebrate genomes. , 1997, Gene.

[41]  G Bernardi,et al.  The gene distribution of the human genome. , 1996, Gene.

[42]  P. Bernaola-Galván,et al.  Compositional segmentation and long-range fractal correlations in DNA sequences. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[43]  G. Bernardi,et al.  Regional specificity of HTLV-I proviral integration in the human genome. , 1994, Gene.

[44]  C. Peng,et al.  Mosaic organization of DNA nucleotides. , 1994, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[45]  G. Bernardi,et al.  Nonrandom distribution of MMTV proviral sequences in the mouse genome. , 1987, Nucleic acids research.

[46]  G Bernardi,et al.  The mosaic genome of warm-blooded vertebrates. , 1985, Science.

[47]  G. Bernardi,et al.  The major components of the mouse and human genomes. 2. Reassociation kinetics. , 1981, European journal of biochemistry.

[48]  G Bernardi,et al.  The major components of the mouse and human genomes. 1. Preparation, basic properties and compositional heterogeneity. , 1981, European journal of biochemistry.

[49]  G Bernardi,et al.  An approach to the organization of eukaryotic genomes at a macromolecular level. , 1976, Journal of molecular biology.

[50]  G Bernardi,et al.  An analysis of eukaryotic genomes by density gradient centrifugation. , 1976, Journal of molecular biology.

[51]  G Bernardi,et al.  An analysis of the bovine genome by Cs2SO4-Ag density gradient centrifugation. , 1973, Journal of molecular biology.

[52]  Ross B. Inman,et al.  A denaturation map of the λ phage DNA molecule determined by electron microscopy , 1966 .

[53]  R B Inman,et al.  A denaturation map of the lambda phage DNA molecule determined by electron microscopy. , 1966, Journal of molecular biology.

[54]  R. Sonder SOME GENERAL COMMENTS , 1965 .