On the use of nucleic acid sequences to infer early branchings in the tree of life.

Simplifying assumptions made in various tree reconstruction methods--notably rate constancy among nucleotide sites, homogeneity, and stationarity of the substitutional processes--are clearly violated when nucleotide sequences are used to infer distant relationships. Use of tree reconstruction methods based on such oversimplified assumptions can lead to misleading results, as pointed out by previous authors. In this paper, we made use of a (discretized) gamma distribution to account for variable rates of substitution among sites and built models that allowed for unequal base frequencies in different sequences. The models were nonhomogeneous Markov-process models, assuming different patterns of substitution in different parts of the tree. Data of the small-subunit rRNAs from four species were analyzed, where base frequencies were quite different among sequences and rates of substitution were highly variable at sites. Parameters in the models were estimated by maximum likelihood, and models were compared by the likelihood-ratio test. The nonhomogeneous models provided significantly better fit to the data than homogeneous models despite their involvement of many parameters. They also appeared to produce reasonable estimation of the phylogenetic tree; in particular, they seemed able to identify the root of the tree.

[1]  W F Loomis,et al.  Molecular phylogeny of Dictyostelium discoideum by protein sequence comparison. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Z. Yang,et al.  Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. , 1993, Molecular biology and evolution.

[3]  Mitchell L Sogin Early evolution and the origin of eukaryotes , 1992, Current Biology.

[4]  L. Jin,et al.  Limitations of the evolutionary parsimony method of phylogenetic analysis. , 1990, Molecular biology and evolution.

[5]  M. Gouy,et al.  Molecular phylogeny of Rodentia, Lagomorpha, Primates, Artiodactyla, and Carnivora and molecular clocks. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Masasuke Yoshida,et al.  Evolution of the vacuolar H+-ATPase: implications for the origin of eukaryotes. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R. Overbeek,et al.  The winds of (evolutionary) change: breathing new life into microbiology. , 1996, Journal of bacteriology.

[8]  J. Hartigan,et al.  Statistical Analysis of Hominoid Molecular Evolution , 1987 .

[9]  J. Wakeley,et al.  Substitution-rate variation among sites and the estimation of transition bias. , 1994, Molecular biology and evolution.

[10]  M. Jackson,et al.  Origin of the Metazoa , 1979, Nature.

[11]  P. Forterre,et al.  The nature of the last universal ancestor and the root of the tree of life, still open questions. , 1992, Bio Systems.

[12]  N. Goldman,et al.  Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation. , 1994, Molecular biology and evolution.

[13]  T. Cavalier-smith,et al.  Kingdom protozoa and its 18 phyla. , 1993, Microbiological reviews.

[14]  O. Kandler,et al.  Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Sogin,et al.  Phylogenetic meaning of the kingdom concept: an unusual ribosomal RNA from Giardia lamblia. , 1989, Science.

[16]  S. Stickel,et al.  Monophyletic origins of the metazoa: an evolutionary link with fungi , 1993, Science.

[17]  M. Clegg,et al.  Evolutionary Analysis of Plant DNA Sequences , 1987, The American Naturalist.

[18]  M. A. Steel,et al.  Confidence in evolutionary trees from biological sequence data , 1993, Nature.

[19]  C. Woese,et al.  Bacterial evolution , 1987, Microbiological reviews.

[20]  J. Gillespie,et al.  RATES OF MOLECULAR EVOLUTION , 1986 .

[21]  R. H. Thomas,et al.  Reduced thermophilic bias in the 16S rDNA sequence from Thermus ruber provides further support for a relationship between Thermus and Deinococcus , 1993 .

[22]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[23]  R. Raff,et al.  Molecular phylogeny of the animal kingdom. , 1988, Science.

[24]  S. Osawa,et al.  Evolutionary relationship of archaebacteria, eubacteria, and eukaryotes inferred from phylogenetic trees of duplicated genes. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Masami Hasegawa,et al.  Ribosomal RNA trees misleading? , 1993, Nature.

[26]  J A Lake,et al.  Evidence that eukaryotes and eocyte prokaryotes are immediate relatives. , 1992, Science.

[27]  Takahata Naoyuki Overdispersed molecular clock at the major histocompatibility complex loci , 1991, Proceedings of the Royal Society of London. Series B: Biological Sciences.