NucAmino: a nucleotide to amino acid alignment optimized for virus gene sequences

BackgroundCurrent nucleotide-to-amino acid alignment software programs were developed primarily for detecting gene exons within eukaryotic genomes and were therefore optimized for speed across long genetic sequences. We developed a nucleotide-to-amino acid alignment program NucAmino optimized for virus sequencing.ResultsNucAmino is an open source program written in the high-level language Go. NucAmino is more likely to align codons flush with a reference sequence’s amino acids and can be modified to facilitate the placement of insertions and deletions at specific positions. We compared NucAmino to the nucleotide to amino acid alignment program Local Alignment Program (LAP) using 115,118 human immunodeficiency virus type 1 (HIV-1) protease, reverse transcriptase, and integrase sequences—three genes that are commonly sequenced in clinical laboratories. Discordances between NucAmino and LAP occurred in 512 (16.9%) of the 3,029 sequences containing gaps but in none of 112,910 sequences without gaps. For 242 of the sequences with discordances, NucAmino produced an alignment that was preferable to that found by LAP in that it was more likely to codon align insertions and deletions and to facilitate the placement of an important drug-resistance associated insertion at the position at which most laboratories expect it to occur.ConclusionsNucAmino is a nucleotide-to-amino acid alignment program with several advantages for clinical laboratories performing virus sequencing compared with older programs designed for gene finding.

[1]  R. Durbin,et al.  Using GeneWise in the Drosophila annotation experiment. , 2000, Genome research.

[2]  J. Zhang,et al.  Methods for comparing a DNA sequence with a protein sequence , 1996, Comput. Appl. Biosci..

[3]  J. H. Chen,et al.  Molecular epidemiology and divergence of HIV type 1 protease codon 35 inserted strains among treatment-naive patients in Hong Kong. , 2008, AIDS research and human retroviruses.

[4]  Bryan Chan,et al.  Human immunodeficiency virus reverse transcriptase and protease sequence database , 2003, Nucleic Acids Res..

[5]  J. Mendieta,et al.  Thymidine Analogue Excision and Discrimination Modulated by Mutational Complexes Including Single Amino Acid Deletions of Asp-67 or Thr-69 in HIV-1 Reverse Transcriptase* , 2011, The Journal of Biological Chemistry.

[6]  D. Katzenstein,et al.  A 6-basepair insert in the reverse transcriptase gene of human immunodeficiency virus type 1 confers resistance to multiple nucleoside inhibitors. , 1998, The Journal of clinical investigation.

[7]  V. Calvez,et al.  Genotypic and Phenotypic Resistance Patterns of Human Immunodeficiency Virus Type 1 Variants with Insertions or Deletions in the Reverse Transcriptase (RT): Multicenter Study of Patients Treated with RT Inhibitors , 2001, Antimicrobial Agents and Chemotherapy.

[8]  T. F. Rinke de Wit,et al.  Automated sequence analysis and editing software for HIV drug resistance testing. , 2012, Journal of clinical virology : the official publication of the Pan American Society for Clinical Virology.

[9]  Eugene W. Myers,et al.  Optimal alignments in linear space , 1988, Comput. Appl. Biosci..