Invariable sites models and their use in phylogeny reconstruction.

Phylogenetic inference is well known to be problematic if both long and short branches occur together in the underlying tree. With biological data, correcting for this problem may require simultaneous consideration for both substitution biases and rate heterogeneity between lineages and across sequence positions. A particular form of the latter is the presence of invariable sites, which are well known to mislead estimation of genetic divergences. Here we describe a capture-recapture method to estimate the proportion of invariable sites in an alignment of amino acids or nucleotides. We use it to investigate phylogenetic signals in 18S ribosomal DNA sequences from Holometabolus insects. Our results suggest that, as taxa diverged, their 18S rDNA sequences have altered in both their distribution of sites that can vary as well as in their base compositions.

[1]  Daniel H. Huson,et al.  SplitsTree-a program for analyzing and visualizing evolutionary data , 1997 .

[2]  J. Lake,et al.  Reconstructing evolutionary trees from DNA and protein sequences: paralinear distances. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[3]  N. Saitou,et al.  Relative Efficiencies of the Fitch-Margoliash, Maximum-Parsimony, Maximum-Likelihood, Minimum-Evolution, and Neighbor-joining Methods of Phylogenetic Tree Construction in Obtaining the Correct Tree , 1989 .

[4]  M. Miyamoto,et al.  Testing the covarion hypothesis of molecular evolution. , 1995, Molecular biology and evolution.

[5]  D Penny,et al.  Hadamard conjugations and modeling sequence evolution with unequal rates across sites. , 1997, Molecular phylogenetics and evolution.

[6]  J. Huelsenbeck Systematic bias in phylogenetic analysis: is the Strepsiptera problem solved? , 1998, Systematic biology.

[7]  H. Philippe,et al.  How good are deep phylogenetic trees? , 1998, Current opinion in genetics & development.

[8]  Michael D. Hendy,et al.  A Framework for the Quantitative Study of Evolutionary Trees , 1989 .

[9]  W C Wheeler,et al.  The Strepsiptera problem: phylogeny of the holometabolous insect orders inferred from 18S and 28S ribosomal DNA sequences and morphology. , 1997, Systematic biology.

[10]  A. Dress,et al.  Split decomposition: a new and useful approach to phylogenetic analysis of distance data. , 1992, Molecular phylogenetics and evolution.

[11]  Z. Yang,et al.  Among-site rate variation and its impact on phylogenetic analyses. , 1996, Trends in ecology & evolution.

[12]  D. Tautz,et al.  An episodic change of rDNA nucleotide substitution rate has occurred during the emergence of the insect order Diptera. , 1997, Molecular biology and evolution.

[13]  M. A. Steel,et al.  Confidence in evolutionary trees from biological sequence data , 1993, Nature.

[14]  M. Steel,et al.  A covariotide model explains apparent phylogenetic structure of oxygenic photosynthetic lineages. , 1998, Molecular biology and evolution.

[15]  Daniel H. Huson,et al.  SplitsTree: analyzing and visualizing evolutionary data , 1998, Bioinform..

[16]  M. Steel Recovering a tree from the leaf colourations it generates under a Markov model , 1994 .

[17]  J. Huelsenbeck Is the Felsenstein zone a fly trap? , 1997, Systematic biology.

[18]  G A Churchill,et al.  Sample size for a phylogenetic inference. , 1992, Molecular biology and evolution.

[19]  D Penny,et al.  Evolution of chlorophyll and bacteriochlorophyll: the problem of invariant sites in sequence analysis. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[20]  M. Steel,et al.  Recovering evolutionary trees under a more realistic model of sequence evolution. , 1994, Molecular biology and evolution.

[21]  M. Steel,et al.  Corrected Parsimony, Minimum Evolution, and Hadamard Conjugations , 1996 .

[22]  Junhyong Kim,et al.  GENERAL INCONSISTENCY CONDITIONS FOR MAXIMUM PARSIMONY: EFFECTS OF BRANCH LENGTHS AND INCREASING NUMBERS OF TAXA , 1996 .

[23]  J. Whitfield,et al.  Hierarchical analysis of variation in the mitochondrial 16S rRNA gene among Hymenoptera. , 1998, Molecular biology and evolution.

[24]  R. Crozier,et al.  Unbiased Estimation of Symmetrical Directional Mutation Pressure from Protein-Coding DNA , 1996, Journal of Molecular Evolution.

[25]  Peter J. Waddell,et al.  Statistical methods of phylogenetic analysis : including Hadamard conjugations, LogDet transforms and maximum likelihood : a thesis presented in partial fulfilment of the requirements for the degree of Ph.D. in Biology at Massey University , 1995 .

[26]  Masami Hasegawa,et al.  Ribosomal RNA trees misleading? , 1993, Nature.

[27]  W. Fitch,et al.  Evidence from nuclear sequences that invariable sites should be considered when sequence divergence is calculated. , 1989, Molecular biology and evolution.

[28]  P J Waddell,et al.  Using novel phylogenetic methods to evaluate mammalian mtDNA, including amino acid-invariant sites-LogDet plus site stripping, to detect internal conflicts in the data, with special reference to the positions of hedgehog, armadillo, and elephant. , 1999, Systematic biology.

[29]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .