The base composition of the genes is correlated with the secondary structures of the encoded proteins.

The analysis of a non-redundant set of human proteins, for which both the crystallographic structures and the corresponding gene sequences are available, show that bases at third codon position are non-uniformly distributed along the coding sequences. Significant compositional differences are found by comparing the gene regions corresponding to the different secondary structures of the proteins. Inter-and intra-structure differences were most pronounced in the GC-richest genes. These results are not compatible with any proposed hypotheses based on a neutral process of formation/maintenance of the high GC(3) levels of the genes localized in the GC-richest isochores of the human genome.

[1]  B. Rost Review: protein secondary structure prediction continues to rise. , 2001, Journal of structural biology.

[2]  M. Orešič,et al.  Specific correlations between relative synonymous codon usage and protein secondary structure. , 1998, Journal of molecular biology.

[3]  Stephen Neidle,et al.  Non‐random usage of ‘degenerate’ codons is related to protein three‐dimensional structure , 1996, FEBS letters.

[4]  T C Ghosh,et al.  Studies on the relationships between the synonymous codon usage and protein secondary structural units. , 2000, Biochemical and biophysical research communications.

[5]  G Bernardi,et al.  Human coding and noncoding DNA: compositional correlations. , 1996, Molecular phylogenetics and evolution.

[6]  M. Levitt Conformational preferences of amino acids in globular proteins. , 1978, Biochemistry.

[7]  Laurence D. Hurst,et al.  The evolution of isochores , 2001, Nature Reviews Genetics.

[8]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[9]  M. Levitt,et al.  Structure-based conformational preferences of amino acids. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[10]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[11]  C. Zhang,et al.  A joint prediction of the folding types of 1490 human proteins from their genetic codons. , 1993, Journal of theoretical biology.

[12]  G. Bernardi,et al.  Different hydrophobicities of orthologous proteins from Xenopus and human. , 1999, Gene.

[13]  P Argos,et al.  The future of protein secondary structure prediction accuracy. , 1997, Folding & design.

[14]  T. N. Bhat,et al.  The Protein Data Bank: unifying the archive , 2002, Nucleic Acids Res..

[15]  D. Ding,et al.  The relationship between synonymous codon usage and protein structure. , 1998, FEBS letters.

[16]  G Bernardi,et al.  Second codon positions of genes and the secondary structures of proteins. Relationships and implications for the origin of the genetic code. , 2000, Gene.

[17]  B. Havsteen A study of the correlation between the amino acid composition and the helical content of proteins. , 1966, Journal of theoretical biology.

[18]  G. Bernardi,et al.  Synonymous and Nonsynonymous Substitutions in Mammalian Genes: Intragenic Correlations , 1998, Journal of Molecular Evolution.

[19]  G Bernardi,et al.  The correlation of protein hydropathy with the base composition of coding sequences. , 1999, Gene.

[20]  N. Sueoka Directional mutation pressure and neutral molecular evolution. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[21]  A. Guzzo,et al.  The influence of amino-acid sequence on protein structure. , 1965, Biophysical journal.

[22]  I. Siemion,et al.  The informational context of the third base in amino acid codons. , 1994, Bio Systems.

[23]  G. D’Onofrio Expression patterns and gene distribution in the human genome. , 2002, Gene.

[24]  D. Goldsack Relation of amino acid composition and the moffitt parameters to the secondary structure of proteins , 1969, Biopolymers.

[25]  D. A. Cook,et al.  The relation between amino acid sequence and protein conformation. , 1967, Journal of molecular biology.

[26]  G Bernardi,et al.  An analysis of eukaryotic genomes by density gradient centrifugation. , 1976, Journal of molecular biology.

[27]  J W Prothero,et al.  Correlation between the distribution of amino acids and alpha helices. , 1966, Biophysical journal.

[28]  G Bernardi,et al.  The mosaic genome of warm-blooded vertebrates. , 1985, Science.

[29]  G Bernardi,et al.  An approach to the organization of eukaryotic genomes at a macromolecular level. , 1976, Journal of molecular biology.

[30]  L. Duret,et al.  GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. , 2001, Genetics.

[31]  A. Suyama,et al.  Local stability of DNA and RNA secondary structure and its relation to biological functions. , 1986, Progress in biophysics and molecular biology.

[32]  Søren Brunak,et al.  Protein structure and the sequential structure of mRNA: α‐Helix and β‐sheet signals at the nucleotide level , 1996 .

[33]  G. Bernardi,et al.  Correlations of nucleotide substitution rates and base composition of mammalian coding sequences with protein structure. , 1999, Gene.

[34]  Jan Paces,et al.  A compact view of isochores in the draft human genome sequence , 2002, FEBS letters.

[35]  A. Szent-Gyorgyi,et al.  Role of proline in polypeptide chain configuration of proteins. , 1957, Science.

[36]  A. Suyama,et al.  Third letters in codons counterbalance the (G + C)‐content of their first and second letters , 1985 .

[37]  G Bernardi,et al.  A universal compositional correlation among codon positions. , 1992, Gene.

[38]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[39]  S Brunak,et al.  Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level. , 1996, Proteins.