An Improved Implementation of Codon Adaptation Index

Codon adaptation index is a widely used index for characterizing gene expression in general and translation efficiency in particular. Current computational implementations have a number of problems leading to various systematic biases. I illustrate these problems and provide a better computer implementation to solve these problems. The improved CAI can predict protein production better than CAI from other commonly used implementations.

[1]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[2]  Hanah Margalit,et al.  Relative predicted protein levels of functionally associated proteins are conserved across organisms , 2005, Nucleic acids research.

[3]  X. Xia,et al.  DAMBE: software package for data analysis in molecular biology and evolution. , 2001, The Journal of heredity.

[4]  Alessandra Carbone,et al.  Codon adaptation index as a measure of dominating codon bias , 2003, Bioinform..

[5]  Xuhua Xia,et al.  Data Analysis in Molecular Biology and Evolution , 2002, Springer US.

[6]  Josep M. Comeron,et al.  An Evaluation of Measures of Synonymous Codon Usage Bias , 1998, Journal of Molecular Evolution.

[7]  Nobuyoshi Sugaya,et al.  Causes for the large genome size in a cyanobacterium Anabaena sp. PCC7120. , 2004, Genome informatics. International Conference on Genome Informatics.

[8]  B. Futcher,et al.  A Sampling of the Yeast Proteome , 1999, Molecular and Cellular Biology.

[9]  Weiwen Zhang,et al.  Predicted highly expressed genes in the genomes of Streptomyces coelicolor and Streptomyces avermitilis and the implications for their metabolism. , 2005, Microbiology.

[10]  J. Bennetzen,et al.  Codon selection in yeast. , 1982, The Journal of biological chemistry.

[11]  K. H. Wolfe,et al.  Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae , 2000, Yeast.

[12]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[13]  Antonio Lazcano,et al.  Comparative Analysis of Methodologies for the Detection of Horizontally Transferred Genes: A Reassessment of First-Order Markov Models , 2005, Silico Biol..

[14]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[15]  F. Wright The 'effective number of codons' used in a gene. , 1990, Gene.

[16]  Aristotelis Tsirigos,et al.  A new computational method for the detection of horizontal gene transfer events , 2005, Nucleic acids research.

[17]  S. Barray,et al.  Molecular evolution of the major outer-membrane protein gene (oprF) of Pseudomonas. , 2006, Microbiology.

[18]  Aristotelis Tsirigos,et al.  A sensitive, support-vector-machine method for the detection of horizontal gene transfers in viral, archaeal and bacterial genomes , 2005, Nucleic acids research.

[19]  G. Armengol,et al.  A theoretical analysis of codon adaptation index of the Boophilus microplus bm86 gene directed to the optimization of a DNA vaccine. , 2006, Journal of theoretical biology.

[20]  Paul M. Sharp,et al.  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes , 1986, Nucleic Acids Res..

[21]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.