Reconstructing genealogies of serial samples under the assumption of a molecular clock using serial-sample UPGMA.

Reconstruction of evolutionary relationships from noncontemporaneous molecular samples provides a new challenge for phylogenetic reconstruction methods. With recent biotechnological advances there has been an increase in molecular sequencing throughput, and the potential to obtain serial samples of sequences from populations, including rapidly evolving pathogens, is fast being realized. A new method called the serial-sample unweighted pair grouping method with arithmetic means (sUPGMA) is presented that reconstructs a genealogy or phylogeny of sequences sampled serially in time using a matrix of pairwise distances. The resulting tree depicts the terminal lineages of each sample ending at a different level consistent with the sample's temporal order. Since sUPGMA is a variant of UPGMA, it will perform best when sequences have evolved at a constant rate (i.e., according to a molecular clock). On simulated data, this new method performs better than standard cluster analysis under a variety of longitudinal sampling strategies. Serial-sample UPGMA is particularly useful for analysis of longitudinal samples of viruses and bacteria, as well as ancient DNA samples, with the minimal requirement that samples of sequences be ordered in time.

[1]  E. Holmes,et al.  Convergent and divergent sequence evolution in the surface envelope glycoprotein of human immunodeficiency virus type 1 within a single infected patient. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[2]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[3]  J. Margolick,et al.  Consistent Viral Evolutionary Changes Associated with the Progression of Human Immunodeficiency Virus Type 1 Infection , 1999, Journal of Virology.

[4]  T. Leitner,et al.  The molecular clock of HIV-1 unveiled through analysis of a known transmission history. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[5]  O. Pybus,et al.  An integrated framework for the inference of viral population history from reconstructed genealogies. , 2000, Genetics.

[6]  Steven M. Wolinsky,et al.  Adaptive Evolution of Human Immunodeficiency Virus-Type 1 During the Natural Course of Infection , 1996, Science.

[7]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[8]  E. Holmes,et al.  Inferring population history from molecular phylogenies. , 1995, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[9]  E. G. Shpaer,et al.  Coalescent estimates of HIV-1 generation time in vivo. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[10]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[11]  H. Kishino,et al.  Estimating the rate of evolution of the rate of molecular evolution. , 1998, Molecular biology and evolution.

[12]  T Gojobori,et al.  Molecular clock of viral evolution, and the neutral theory. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[13]  J. Felsenstein,et al.  Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates. , 1992, Genetical research.

[14]  B. Epperson Gene genealogies in geographically structured populations. , 1999, Genetics.

[15]  Y. Fu,et al.  A phylogenetic estimator of effective population size or mutation rate. , 1994, Genetics.

[16]  J. Huelsenbeck,et al.  A compound poisson process for relaxing the molecular clock. , 2000, Genetics.

[17]  K. Crandall The evolution of HIV , 1999 .

[18]  J. Kingman On the genealogy of large populations , 1982, Journal of Applied Probability.

[19]  A. J. Brown,et al.  Analysis of HIV-1 env gene sequences reveals evidence for a low effective number in the viral population. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[20]  A. Perelson,et al.  HIV-1 Dynamics in Vivo: Virion Clearance Rate, Infected Cell Life-Span, and Viral Generation Time , 1996, Science.