Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales

Abstract In this study, we calculated the codon usage bias in severe acute respiratory syndrome Coronavirus (SARSCoV) and performed a comparative analysis of synonymous codon usage patterns in SARSCoV and 10 other evolutionary related viruses in the Nidovirales. Although there is a significant variation in codon usage bias among different SARSCoV genes, codon usage bias in SARSCoV is a little slight, which is mainly determined by the base compositions on the third codon position. By comparing synonymous codon usage patterns in different viruses, we observed that synonymous codon usage pattern in these virus genes was virus specific and phylogenetically conserved, but it was not host specific. Phylogenetic analysis based on codon usage pattern suggested that SARSCoV was diverged far from all three known groups of Coronavirus. Compositional constraints could explain most of the variation of synonymous codon usage among these virus genes, while gene function is also correlated to synonymous codon usages to a certain extent. However, translational selection and gene length have no effect on the variations of synonymous codon usage in these virus genes.

[1]  M. Chan-yeung,et al.  Outbreak of severe acute respiratory syndrome in Hong Kong Special Administrative Region: case report , 2003, BMJ : British Medical Journal.

[2]  Howard Ochman,et al.  Isochores result from mutation not selection , 1999, Nature.

[3]  J. Bertranpetit,et al.  Variation in G + C-content and codon choice: differences among synonymous codon groups in vertebrate genes. , 1989, Nucleic acids research.

[4]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[5]  F. Wright The 'effective number of codons' used in a gene. , 1990, Gene.

[6]  P. Iannaccone,et al.  Characterization of the promoter region and genomic organization of GLI, a member of the Sonic hedgehog-Patched signaling pathway. , 1998, Gene.

[7]  G Bernardi,et al.  Second codon positions of genes and the secondary structures of proteins. Relationships and implications for the origin of the genetic code. , 2000, Gene.

[8]  M. Orešič,et al.  Specific correlations between relative synonymous codon usage and protein secondary structure. , 1998, Journal of molecular biology.

[9]  Obi L. Griffith,et al.  The Genome Sequence of the SARS-Associated Coronavirus , 2003, Science.

[10]  Manolo Gouy,et al.  Codon catalog usage is a genome strategy modulated for gene expressivity , 1981, Nucleic Acids Res..

[11]  G. Bernardi,et al.  Correlations of nucleotide substitution rates and base composition of mammalian coding sequences with protein structure. , 1999, Gene.

[12]  J O McInerney,et al.  Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[13]  T C Ghosh,et al.  Studies on codon usage in Entamoeba histolytica. , 2000, International journal for parasitology.

[14]  Josep M. Comeron,et al.  An Evaluation of Measures of Synonymous Codon Usage Bias , 1998, Journal of Molecular Evolution.

[15]  P. Sharp,et al.  Codon usage in regulatory genes in Escherichia coli does not reflect selection for 'rare' codons. , 1986, Nucleic acids research.

[16]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[17]  D. Ding,et al.  The relationship between synonymous codon usage and protein structure. , 1998, FEBS letters.

[18]  P. Sharp,et al.  Evolution of codon usage patterns: the extent and nature of divergence between Candida albicans and Saccharomyces cerevisiae. , 1992, Nucleic acids research.

[19]  M. Gouy,et al.  Codon usage in bacteria: correlation with gene expressivity. , 1982, Nucleic acids research.

[20]  D. Levin,et al.  Codon usage in nucleopolyhedroviruses. , 2000, The Journal of general virology.

[21]  K. H. Wolfe,et al.  Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae , 2000, Yeast.

[22]  Eric R. Ziegel,et al.  Statistical Methods in Bioinformatics , 2002, Technometrics.

[23]  Xie Tao,et al.  The relationship between synonymous codon usage and protein structure , 1998 .

[24]  M. Pagel,et al.  Evolution of Base Composition and Codon Usage Bias in the Genus Flavivirus , 2001, Journal of Molecular Evolution.

[25]  M. Gouy,et al.  Codon catalog usage and the genome hypothesis. , 1980, Nucleic acids research.

[26]  J. Drazen Case clusters of the severe acute respiratory syndrome. , 2003, The New England journal of medicine.

[27]  Christian Drosten,et al.  Characterization of a Novel Coronavirus Associated with Severe Acute Respiratory Syndrome , 2003, Science.

[28]  Paul M. Sharp,et al.  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes , 1986, Nucleic Acids Res..

[29]  F. Lisacek,et al.  Codon usage and gene function are related in sequences of Arabidopsis thaliana. , 1998, Gene.

[30]  J. Drake,et al.  Mutation rates among RNA viruses. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[31]  T. Ikemura Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. , 1981, Journal of molecular biology.

[32]  Laurent Duret,et al.  Synonymous Codon Usage, Accuracy of Translation, and Gene Length in Caenorhabditis elegans , 2001, Journal of Molecular Evolution.

[33]  Edward C Holmes,et al.  The extent of codon usage bias in human RNA viruses and its evolutionary origin. , 2003, Virus research.

[34]  S. Karlin,et al.  What drives codon choices in human genes? , 1996, Journal of molecular biology.

[35]  Xiao Sun,et al.  Cluster analysis of the codon use frequency of MHC genes from different species. , 2002, Bio Systems.

[36]  R. Epstein,et al.  A functional significance for codon third bases. , 2000, Gene.

[37]  Etsuko N. Moriyama,et al.  Gene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coli , 1998, Nucleic Acids Res..

[38]  R. Ehrlich,et al.  Ribosome traffic in E. coli and regulation of gene expression. , 2000, Journal of theoretical biology.

[39]  T C Ghosh,et al.  Compositional correlation studies among the three different codon positions in 12 bacterial genomes. , 1999, Biochemical and biophysical research communications.

[40]  Wuchun Cao,et al.  A complete sequence and comparative analysis of a SARS-associated virus (Isolate BJ01) , 2003, Chinese science bulletin = Kexue tongbao.

[41]  T C Ghosh,et al.  Studies on the relationships between the synonymous codon usage and protein secondary structural units. , 2000, Biochemical and biophysical research communications.

[42]  J. Risler,et al.  Codon usage as a tool to predict the cellular location of eukaryotic ribosomal proteins and aminoacyl-tRNA synthetases. , 1999, Nucleic acids research.

[43]  Y. Guan,et al.  Unique and Conserved Features of Genome and Proteome of SARS-coronavirus, an Early Split-off From the Coronavirus Group 2 Lineage , 2003, Journal of Molecular Biology.