Base composition and expression level of human genes.

It is well known that the gene distribution is non-uniform in the human genome, reaching the highest concentration in the GC-rich isochores. Also the amino acid frequencies, and the hydrophobicity, of the corresponding encoded proteins are affected by the high GC level of the genes localized in the GC-rich isochores. It was hypothesized that the gene expression level as well is higher in GC-rich compared to GC-poor isochores [Mol. Biol. Evol. 10 (1993) 186]. Several features of human genes and proteins, namely expression level, coding and non-coding lengths, and hydrophobicity were investigated in the present paper. The results support the hypothesis reported above, since all the parameters so far studied converge to the same conclusion, that the average expression level of the GC-rich genes is significantly higher than that of the GC-poor genes.

[1]  S. Stevanović,et al.  Proteome Analysis by Three-Dimensional Protein Separation: Turnover of Cytosolic Proteins in Hepatocytes , 2001, Biological chemistry.

[2]  C. DeLisi,et al.  Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. , 1987, Journal of molecular biology.

[3]  G Bernardi,et al.  CpG islands: features and distribution in the genomes of vertebrates. , 1991, Gene.

[4]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[5]  G Bernardi,et al.  The correlation of protein hydropathy with the base composition of coding sequences. , 1999, Gene.

[6]  B. Kerem,et al.  Mapping of DNAase I sensitive regions on mitotic chromosomes , 1984, Cell.

[7]  Alistair G. Rust,et al.  Ensembl 2002: accommodating comparative genomics , 2003, Nucleic Acids Res..

[8]  G. Bernardi,et al.  Evolutionary Genomics of Vertebrates and Its Implications , 1999, Annals of the New York Academy of Sciences.

[9]  Minoru Kanehisa,et al.  The size differences among mammalian introns are due to the accumulation of small deletions , 1996, FEBS letters.

[10]  G. Bernardi,et al.  Different hydrophobicities of orthologous proteins from Xenopus and human. , 1999, Gene.

[11]  G. Bernardi,et al.  Two classes of genes in plants. , 2000, Genetics.

[12]  Adrian Bird,et al.  Alternative chromatin structure at CpG islands , 1990, Cell.

[13]  G Bernardi,et al.  The mosaic genome of warm-blooded vertebrates. , 1985, Science.

[14]  Martin J. Lercher,et al.  Clustering of housekeeping genes provides a unified model of gene order in the human genome , 2002, Nature Genetics.

[15]  Luis Serrano,et al.  A thermodynamic and kinetic analysis of the folding pathway of an SH3 domain entropically stabilised by a redesigned hydrophobic core. , 2003, Journal of molecular biology.

[16]  G. D’Onofrio Expression patterns and gene distribution in the human genome. , 2002, Gene.

[17]  Graziano Pesole,et al.  CLEANUP: a fast computer program for removing redundancies from nucleotide sequence databases , 1996, Comput. Appl. Biosci..

[18]  L. Duret,et al.  Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores , 1995, Journal of Molecular Evolution.

[19]  Ming D. Li,et al.  Correlations Between mRNA Expression Levels and GC Contents of Coding and Untranslated Regions of Genes in Rodents , 2002, Journal of Molecular Evolution.

[20]  Giorgio Bernardi,et al.  Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins , 1991, Journal of Molecular Evolution.

[21]  L. Duret,et al.  Determinants of CpG islands: expression in early embryo and isochore structure. , 2001, Genome research.

[22]  D Poso,et al.  Progressive Stabilization of Intermediate and Transition States in Protein Folding Reactions by Introducing Surface Hydrophobic Residues* , 2000, The Journal of Biological Chemistry.

[23]  G Bernardi,et al.  The distribution of genes in the human genome. , 1991, Gene.

[24]  L. Duret,et al.  GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. , 2001, Genetics.

[25]  H. Prydz,et al.  CpG islands as gene markers in the human genome. , 1992, Genomics.

[26]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[27]  G. Bernardi,et al.  The vertebrate genome: isochores and evolution. , 1993, Molecular biology and evolution.

[28]  L. Hurst,et al.  Small introns tend to occur in GC-rich regions in some but not all vertebrates. , 1999, Trends in genetics : TIG.

[29]  Daniel Lee,et al.  The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species , 2001, Nucleic Acids Res..

[30]  Cristian I. Castillo-Davis,et al.  Selection for short introns in highly expressed genes , 2002, Nature Genetics.

[31]  G Bernardi,et al.  The gene distribution of the human genome. , 1996, Gene.

[32]  G. Bernardi,et al.  Compositional constraints and genome evolution , 2005, Journal of Molecular Evolution.

[33]  L. Duret,et al.  Nature and structure of human genes that generate retropseudogenes. , 2000, Genome research.

[34]  M. Gouy,et al.  HOVERGEN: a database of homologous vertebrate genes. , 1994, Nucleic acids research.

[35]  John Quackenbush,et al.  The TIGR Gene Indices: reconstruction and representation of expressed gene sequences , 2000, Nucleic Acids Res..

[36]  F. Hartl,et al.  Molecular Chaperones in the Cytosol: from Nascent Chain to Folded Protein , 2002, Science.

[37]  Giorgio Bernardi,et al.  Localization of the gene-richest and the gene-poorest isochores in the interphase nuclei of mammals and birds. , 2002, Gene.