ABSTRACT In the last few years, the genomic sequence data for thousands of influenza A virus strains, including the 1918 pandemic strain, and hundreds of isolates of the avian influenza virus H5N1, which is causing an increasing number of human fatalities, have become publicly available. This large quantity of sequence data allows us to do comparative genomics with the human and avian versions of the virus. We find that the nucleotide compositions of influenza A viruses infecting the two hosts are sufficiently different that we can determine the host at almost 100% accuracy. This assignment works at the segment level, which allows us to construct the reassortment history of individual segments within each strain. We suggest that the different nucleotide compositions can be explained by a host-dependent mutation bias. To support this idea, we estimate the fixation rates for the different polymerase segments and the ratios of synonymous to nonsynonymous changes. Additionally, we provide evidence supporting the hypothesis that the H1N1 influenza virus entered the human population just prior to the 1918 outbreak, with an earliest bound of 1910.
[1]
David E. Swayne,et al.
Characterization of the Reconstructed 1918 Spanish Influenza Pandemic Virus
,
2005,
Science.
[2]
Jeffery K. Taubenberger,et al.
Characterization of the 1918 influenza virus polymerase genes
,
2005,
Nature.
[3]
M. Emerman,et al.
Ancient Adaptive Evolution of the Primate Antiviral DNA-Editing Enzyme APOBEC3G
,
2004,
PLoS biology.
[4]
R. Webster,et al.
Avian-to-human transmission of the PB1 gene of influenza A viruses in the 1957 and 1968 pandemics
,
1989,
Journal of virology.
[5]
B. Cullen.
Role and Mechanism of Action of the APOBEC3 Family of Antiretroviral Resistance Factors
,
2006,
Journal of Virology.
[6]
R. König,et al.
Single-strand specificity of APOBEC3G accounts for minus-strand deamination of the HIV genome
,
2004,
Nature Structural &Molecular Biology.
[7]
M. Kimura,et al.
On the probability of fixation of mutant genes in a population.
,
1962,
Genetics.