Incorporation of transition to transversion ratio and nonsense mutations, improves the estimation of the number of synonymous and non-synonymous sites in codons

Abstract A common approach to estimate the strength and direction of selection acting on protein coding sequences is to calculate the dN/dS ratio. The method to calculate dN/dS has been widely used by many researchers and many critical reviews have been made on its application after the proposition by Nei and Gojobori in 1986. However, the method is still evolving considering the non-uniform substitution rates and pretermination codons. In our study of SNPs in 586 genes across 156 Escherichia coli strains, synonymous polymorphism in 2-fold degenerate codons were higher in comparison to that in 4-fold degenerate codons, which could be attributed to the difference between transition (Ti) and transversion (Tv) substitution rates where the average rate of a transition is four times more than that of a transversion in general. We considered both the Ti/Tv ratio, and nonsense mutation in pretermination codons, to improve estimates of synonymous (S) and non-synonymous (NS) sites. The accuracy of estimating dN/dS has been improved by considering the Ti/Tv ratio and nonsense substitutions in pretermination codons. We showed that applying the modified approach based on Ti/Tv ratio and pretermination codons results in higher values of dN/dS in 29 common genes of equal reading-frames between E. coli and Salmonella enterica. This study emphasizes the robustness of amino acid composition with varying codon degeneracy, as well as the pretermination codons when calculating dN/dS values.

[1]  E. Feil,et al.  Stem Region of tRNA Genes Favors Transition Substitution Towards Keto Bases in Bacteria , 2022, Journal of Molecular Evolution.

[2]  Sergei L. Kosakovsky Pond,et al.  Weak selection on synonymous codons substantially inflates dN/dS estimates in bacteria , 2021, Proceedings of the National Academy of Sciences.

[3]  A. Waris,et al.  A Web Portal to Calculate Codon Adaptation Index (CAI) with Organism Specific Reference Set of High Expression Genes for Diverse Bacteria Species , 2019 .

[4]  Daniel M Lyons,et al.  Evidence for the Selective Basis of Transition-to-Transversion Substitution Bias in Two RNA Viruses , 2017, Molecular biology and evolution.

[5]  Harry A. Thorpe,et al.  Comparative Analyses of Selection Operating on Nontranslated Intergenic Regions of Diverse Bacterial Species , 2017, Genetics.

[6]  S. Ouellette,et al.  Tryptophan Codon-Dependent Transcription in Chlamydia pneumoniae during Gamma Interferon-Mediated Tryptophan Limitation , 2016, Infection and Immunity.

[7]  Stephanie J. Spielman,et al.  The relationship between dN/dS and scaled selection coefficients. , 2015, Molecular biology and evolution.

[8]  H. Ellegren,et al.  Kr/Kc but not dN/dS correlates positively with body mass in birds, raising implications for inferring lineage-specific selection , 2014, Genome Biology.

[9]  S. K. Ray,et al.  Variable correlation of genome GC% with transfer RNA number as well as with transfer RNA diversity among bacterial groups: alpha-proteobacteria and tenericutes exhibit strong positive correlation. , 2010, Microbiological research.

[10]  J. Plotkin,et al.  The Population Genetics of dN/dS , 2008, PLoS genetics.

[11]  T. Ohama,et al.  Evolution of the mitochondrial genetic code II. Reassignment of codon AUA from isoleucine to methionine , 1989, Journal of Molecular Evolution.

[12]  M. Touchon,et al.  Similar compositional biases are caused by very different mutational effects. , 2006, Genome research.

[13]  Douglas W. Raiford,et al.  Amino acid cost and codon-usage biases in 6 prokaryotic genomes: a whole-genome analysis. , 2006, Molecular biology and evolution.

[14]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[15]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[16]  Takashi Gojobori,et al.  Patterns of nucleotide substitution in pseudogenes and functional genes , 2005, Journal of Molecular Evolution.

[17]  Y. Ina,et al.  New methods for estimating the numbers of synonymous and nonsynonymous substitutions , 1995, Journal of Molecular Evolution.

[18]  L. Hurst The Ka/Ks ratio: diagnosing the form of sequence evolution. , 2002, Trends in genetics : TIG.

[19]  R. Nielsen,et al.  Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. , 2002, Molecular biology and evolution.

[20]  N. Goldman,et al.  Codon-substitution models for heterogeneous selection pressure at amino acid sites. , 2000, Genetics.

[21]  Ziheng Yang,et al.  Estimation of the Transition/Transversion Rate Bias and Species Sampling , 1999, Journal of Molecular Evolution.

[22]  D. Petrov,et al.  Patterns of nucleotide substitution in Drosophila and mammalian genomes. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Z. Yang,et al.  Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. , 1998, Molecular biology and evolution.

[24]  C. G. Km-land An Extreme Codon Preference Strategy : Codon Reassignment , 1998 .

[25]  M. Nei,et al.  Variance and covariances of the numbers of synonymous and nonsynonymous substitutions per site. , 1994, Molecular biology and evolution.

[26]  R. Jackson,et al.  Efficient initiation of mammalian mRNA translation at a CUG codon. , 1989, Nucleic acids research.

[27]  Wen-Hsiung Li,et al.  The rate of synonymous substitution in enterobacterial genes is inversely related to codon usage bias. , 1987, Molecular biology and evolution.

[28]  M. Nei,et al.  Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. , 1986, Molecular biology and evolution.

[29]  S. Jeffery Evolution of Protein Molecules , 1979 .