Positive selection on the H3 hemagglutinin gene of human influenza virus A.

The hemagglutinin (HA) gene of influenza viruses encodes the major surface antigen against which neutralizing antibodies are produced during infection or vaccination. We examined temporal variation in the HA1 domain of HA genes of human influenza A (H3N2) viruses in order to identify positively selected codons. Positive selection is defined for our purposes as a significant excess of nonsilent over silent nucleotide substitutions. If past mutations at positively selected codons conferred a selective advantage on the virus, then additional changes at these positions may predict which emerging strains will predominate and cause epidemics. We previously reported that a 38% excess of mutations occurred on the tip or terminal branches of the phylogenetic tree of 254 HA genes of influenza A (H3N2) viruses. Possible explanations for this excess include processes other than viral evolution during replication in human hosts. Of particular concern are mutations that occur during adaptation of viruses for growth in embryonated chicken eggs in the laboratory. Because the present study includes 357 HA sequences (a 40% increase), we were able to separately analyze those mutations assigned to internal branches. This allowed us to determine whether mutations on terminal and internal branches exhibit different patterns of selection at the level of individual codons. Additional improvements over our previous analysis include correction for a skew in the distribution of amino acid replacements across codons and analysis of a population of phylogenetic trees rather than a single tree. The latter improvement allowed us to ascertain whether minor variation in tree structure had a significant effect on our estimate of the codons under positive selection. This method also estimates that 75.6% of the nonsilent mutations are deleterious and have been removed by selection prior to sampling. Using the larger data set and the modified methods, we confirmed a large (40%) excess of changes on the terminal branches. We also found an excess of changes on branches leading to egg-grown isolates. Furthermore, 9 of the 18 amino acid codons, identified as being under positive selection to change when we used only mutations assigned to internal branches, were not under positive selection on the terminal branches. Thus, although there is overlap between the selected codons on terminal and internal branches, the codons under positive selection on the terminal branches differ from those on the internal branches. We also observed that there is an excess of positively selected codons associated with the receptor-binding site and with the antibody-combining sites. This association may explain why the positively selected codons are restricted in their distribution along the sequence. Our results suggest that future studies of positive selection should focus on changes assigned to the internal branches, as certain of these changes may have predictive value for identifying future successful epidemic variants.

[1]  J. Yewdell,et al.  The antigenic structure of the influenza virus A/PR/8/34 hemagglutinin (H1 subtype) , 1982, Cell.

[2]  N. Cox,et al.  Comparison of 10 influenza A (H1N1 and H3N2) haemagglutinin sequences obtained directly from clinical specimens to those of MDCK cell- and egg-grown viruses. , 1993, The Journal of general virology.

[3]  I. Wilson,et al.  Single amino acid substitutions in influenza haemagglutinin change receptor binding specificity , 1983, Nature.

[4]  C. Naeve,et al.  Egg fluids and cells of the chorioallantoic membrane of embryonated chicken eggs can select different variants of influenza A (H3N2) viruses. , 1995, Virology.

[5]  A. Kendal,et al.  Identification of the binding sites to monoclonal antibodies on A/USSR/90/77 (H1N1) hemagglutinin and their involvement in antigenic drift in H1N1 influenza viruses. , 1983, Virology.

[6]  J. Robertson Clinical influenza virus and the embryonated Hen's egg , 1993 .

[7]  C. Aquadro,et al.  Sequence evolution within populations under multiple types of mutation. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[8]  W. Messier,et al.  Episodic adaptive evolution of primate lysozymes , 1997, Nature.

[9]  C. Luo,et al.  A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. , 1985, Molecular biology and evolution.

[10]  W. Fitch,et al.  Long term trends in the evolution of H(3) HA1 human influenza type A. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[11]  R. Webster,et al.  Codominant mixtures of viruses in reference strains of influenza virus due to host cell variation. , 1994, Virology.

[12]  Conrad C. Huang,et al.  The MIDAS display system , 1988 .

[13]  I. Wilson,et al.  Structural identification of the antibody-binding sites of Hong Kong influenza haemagglutinin and their involvement in antigenic variation , 1981, Nature.

[14]  G. B. Golding The detection of deleterious selection using ancestors inferred from a phylogenetic history. , 1987, Genetical research.

[15]  T Gojobori,et al.  Statistical analysis of nucleotide sequences of the hemagglutinin gene of human influenza A viruses. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Golding Gb The detection of deleterious selection using ancestors inferred from a phylogenetic history. , 1987 .

[17]  I. Wilson,et al.  Structural basis of immune recognition of influenza virus hemagglutinin. , 1990, Annual review of immunology.

[18]  W. Fitch,et al.  Positive Darwinian evolution in human influenza A viruses. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Y. Kawaoka,et al.  Differences in sialic acid-galactose linkages in the chicken egg amnion and allantois influence human influenza virus receptor specificity and variant selection , 1997, Journal of virology.

[20]  M. Nei,et al.  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection , 1988, Nature.

[21]  A. Hughes Positive selection and interallelic recombination at the merozoite surface antigen-1 (MSA-1) locus of Plasmodium falciparum. , 1992, Molecular biology and evolution.

[22]  Z. Yang,et al.  Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. , 1998, Molecular biology and evolution.

[23]  K. Mullis,et al.  Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase. , 1988, Science.

[24]  J. Skehel,et al.  The structure and function of the hemagglutinin membrane glycoprotein of influenza virus. , 1987, Annual review of biochemistry.