The Effect of Recombination on the Accuracy of Phylogeny Estimation

Phylogenetic studies based on DNA sequences typically ignore the potential occurrence of recombination, which may produce different alignment regions with different evolutionary histories. Traditional phylogenetic methods assume that a single history underlies the data. If recombination is present, can we expect the inferred phylogeny to represent any of the underlying evolutionary histories? We examined this question by applying traditional phylogenetic reconstruction methods to simulated recombinant sequence alignments. The effect of recombination on phylogeny estimation depended on the relatedness of the sequences involved in the recombinational event and on the extent of the different regions with different phylogenetic histories. Given the topologies examined here, when the recombinational event was ancient, or when recombination occurred between closely related taxa, one of the two phylogenies underlying the data was generally inferred. In this scenario, the evolutionary history corresponding to the majority of the positions in the alignment was generally recovered. Very different results were obtained when recombination occurred recently among divergent taxa. In this case, when the recombinational breakpoint divided the alignment in two regions of similar length, a phylogeny that was different from any of the true phylogenies underlying the data was inferred.

[1]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[2]  P. H. A. Sneath,et al.  Detecting Evolutionary Incompatibilities From Protein Sequences , 1975 .

[3]  P. H. A. Sneath,et al.  Cladistic Representation of Reticulate Evolution , 1975 .

[4]  R. Hudson Properties of a neutral allele model with intragenic recombination. , 1983, Theoretical population biology.

[5]  S. Tavaré Some probabilistic and statistical problems in the analysis of DNA sequences , 1986 .

[6]  M. Sanderson,et al.  RECONSTRUCTION OF ORGANISMAL AND GENE PHYLOGENIES FROM DATA ON MULTIGENE FAMILIES: CONCERTED EVOLUTION, HOMOPLASY, AND CONFIDENCE , 1992 .

[7]  P. Sharp,et al.  Recombination in HIV-1 , 1995, Nature.

[8]  W. Maddison Gene Trees in Species Trees , 1997 .

[9]  E. Holmes,et al.  A likelihood method for the detection of selection and recombination using nucleotide sequences. , 1997, Molecular biology and evolution.

[10]  B. Spratt,et al.  Interspecies recombination, and phylogenetic distortions, within the glutamine synthetase and shikimate dehydrogenase genes of Neisseria meningitidis and commensal Neisseria species , 1997, Molecular microbiology.

[11]  G. McGuire,et al.  A graphical method for detecting recombination in phylogenetic data sets. , 1997, Molecular biology and evolution.

[12]  J. Wiens Combining data sets with different phylogenetic histories. , 1998, Systematic biology.

[13]  E. Holmes,et al.  Widespread intra-serotype recombination in natural populations of dengue virus. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[14]  E. Holmes,et al.  The influence of recombination on the population structure and evolution of the human pathogen Neisseria meningitidis. , 1999, Molecular biology and evolution.

[15]  G. Drouin,et al.  Detecting and characterizing gene conversions between multigene family members. , 1999, Molecular biology and evolution.

[16]  Bin Ma,et al.  From Gene Trees to Species Trees , 2000, SIAM J. Comput..

[17]  J. Hein,et al.  Consequences of recombination on traditional phylogenetic analysis. , 2000, Genetics.

[18]  M. Worobey,et al.  A novel approach to detecting and measuring recombination: new insights into evolution in viruses, bacteria, and mitochondria. , 2001, Molecular biology and evolution.

[19]  J. Hein,et al.  A simulation study of the reliability of recombination detection methods. , 2001, Molecular biology and evolution.

[20]  K. Crandall,et al.  Intraspecific gene genealogies: trees grafting into networks. , 2001, Trends in ecology & evolution.

[21]  K. Crandall,et al.  Evaluation of methods for detecting recombination from DNA sequences: Computer simulations , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[22]  C. Brown,et al.  The power to detect recombination using the coalescent. , 2001, Molecular biology and evolution.