Positive Selection Detection in 40,000 HumanImmunodeficiency Virus (HIV) Type 1 Sequences Automatically IdentifiesDrug Resistance and Positive Fitness Mutations in HIV Proteaseand ReverseTranscriptase

ABSTRACT Drug resistance is a major problem in the treatment of AIDS, due to the very high mutation rate of human immunodeficiency virus (HIV) and subsequent rapid development of resistance to new drugs. Identification of mutations associated with drug resistance is critical for both individualized treatment selection and new drug design. We have performed an automated mutation analysis of HIV Type 1 (HIV-1) protease and reverse transcriptase (RT) from approximately 40,000 AIDS patient plasma samples sequenced by Specialty Laboratories Inc. from 1999 to mid-2002. This data set provides a nearly complete mutagenesis of HIV protease and enables the calculation of statistically significant Ka/Ks values for each individual amino acid mutation in protease and RT. Positive selection (i.e., a Ka/Ks ratio of> 1, indicating increased reproductive fitness) detected 19 of 23 known drug-resistant mutation positions in protease and 20 of 34 such positions in RT. We also discovered 163 new amino acid mutations in HIV protease and RT that are strong candidates for drug resistance or fitness. Our results match available independent data on protease mutations associated with specific drug treatments and mutations with positive reproductive fitness, with high statistical significance (the P values for the observed matches to occur by random chance are 10−5.2 and 10−16.6, respectively). Our mutation analysis provides a valuable resource for AIDS research and will be available to academic researchers upon publication at http://www.bioinformatics.ucla.edu/HIV . Our data indicate that positive selection mapping is an analysis that can yield powerful insights from high-throughput sequencing of rapidly mutating pathogens.

[1]  Marianne Manchester,et al.  Complete mutagenesis of the HIV-1 protease , 1989, Nature.

[2]  J. Coffin,et al.  HIV population dynamics in vivo: implications for genetic variation, pathogenesis, and therapy , 1995, Science.

[3]  L. M. Mansky,et al.  Lower in vivo mutation rate of human immunodeficiency virus type 1 than that predicted from the fidelity of purified reverse transcriptase , 1995, Journal of virology.

[4]  A. Hughes,et al.  Natural selection on the gag, pol, and env genes of human immunodeficiency virus 1 (HIV-1). , 1995, Molecular biology and evolution.

[5]  B. Larder,et al.  Mutations in Retroviral Genes Associated with Drug Resistance , 1996 .

[6]  A. Perelson,et al.  HIV-1 Dynamics in Vivo: Virion Clearance Rate, Infected Cell Life-Span, and Viral Generation Time , 1996, Science.

[7]  H. Mitsuya,et al.  HIV-1 acquires resistance to two classes of antiviral drugs through homologous recombination. , 1997, Antiviral research.

[8]  W. Fitch,et al.  Long term trends in the evolution of H(3) HA1 human influenza type A. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[9]  R. Nielsen,et al.  Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. , 1998, Genetics.

[10]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[11]  T Gojobori,et al.  A method for detecting positive selection at single amino acid sites. , 1999, Molecular biology and evolution.

[12]  A. Telenti,et al.  Polymorphism of HIV type 1 gag p7/p1 and p1/p6 cleavage sites: clinical significance and implications for resistance to protease inhibitors. , 2000, AIDS research and human retroviruses.

[13]  P. Sharp,et al.  Origins and evolution of AIDS viruses: estimating the time-scale. , 2000, Biochemical Society transactions.

[14]  D. Persing,et al.  Comparative Evaluation of Three Human Immunodeficiency Virus Genotyping Systems: the HIV-GenotypR Method, the HIV PRT GeneChip Assay, and the HIV-1 RT Line Probe Assay , 2000, Journal of Clinical Microbiology.

[15]  N. Goldman,et al.  Codon-substitution models for heterogeneous selection pressure at amino acid sites. , 2000, Genetics.

[16]  Christopher J. Lee,et al.  Genome-wide analysis of single-nucleotide polymorphisms in human expressed sequences , 2000, Nature Genetics.

[17]  T. Merigan,et al.  Variants Other than Aspartic Acid at Codon 69 of the Human Immunodeficiency Virus Type 1 Reverse Transcriptase Gene Affect Susceptibility to Nucleoside Analogs , 2001, Antimicrobial Agents and Chemotherapy.

[18]  Richard A. Rode,et al.  Identification of Genotypic Changes in Human Immunodeficiency Virus Protease That Correlate with Reduced Susceptibility to the Protease Inhibitor Lopinavir among Viral Isolates from Protease Inhibitor-Experienced Patients , 2001, Journal of Virology.

[19]  J Licinio,et al.  Single nucleotide polymorphism identification in candidate gene systems of obesity , 2001, The Pharmacogenomics Journal.

[20]  R. Shafer,et al.  A Guide to HIV-1 Reverse Transcriptase and Protease Sequencing for Drug Resistance Studies. , 2001, HIV sequence compendium.

[21]  B. Korber,et al.  HIV sequence compendium 2002 , 2002 .

[22]  L. Peltonen,et al.  Efficient discovery of single-nucleotide polymorphisms in coding regions of human genes , 2002, The Pharmacogenomics Journal.

[23]  Christopher J. Lee,et al.  Multiple sequence alignment using partial order graphs , 2002, Bioinform..

[24]  C. Moore,et al.  Evidence of HIV-1 Adaptation to HLA-Restricted Immune Responses at a Population Level , 2002, Science.

[25]  Bradley J. Betts,et al.  Human immunodeficiency virus reverse transcriptase and protease sequence database. , 2003, Nucleic acids research.

[26]  Thomas D. Wu,et al.  Mutation Patterns and Structural Correlates in Human Immunodeficiency Virus Type 1 Protease following Different Protease Inhibitor Treatments , 2003, Journal of Virology.

[27]  Wen-Hsiung Li Unbiased estimation of the rates of synonymous and nonsynonymous substitution , 2006, Journal of Molecular Evolution.