More taxa are not necessarily better for the reconstruction of ancestral character states.

Ancestral state reconstruction is an important approach to understanding the origins and evolution of key features of different living organisms (Liberles, 2007). For example, ancestral proteins and genomic sequences have been reconstructed for investigating the origins of genes and proteins (Hillis et al., 1994; Jermann et al., 1995; Zhang and Rosenberg, 2002; Gaucher et al., 2003; Thornton et al., 2003; Blanchette et al., 2004; Cai et al., 2004; Felsenstein, 2004; Taubenberger et al., 2005). A variety of reconstruction methods, including parsimony and maximum likelihood, exist for biomolecular sequencing (Yang et al., 1995; Koshi and Goldstein, 1996; Elias and Tuller, 2007), multistate discrete data (Schultz et al., 1996; Mooers and Schluter, 1999; Pagel, 1999), and continuous data (Martins, 1999). These different reconstruction methods have been assessed by both theoretical analyses (Maddison, 1995; Yang et al., 1995) and computer simulation (Schultz et al., 1996; Zhang and Nei, 1997; Salisbury and Kim, 2001; Blanchette et al., 2004; Mooers, 2004; Williams et al., 2006). One important observation in these investigations is that the topology of the phylogenetic tree relating the extant taxa to the target ancestor has a significant influence on reconstruction accuracy. For instance, a star-like phylogeny allows the ancestral character states to be inferred more accurately than other topologies given the same number of terminal taxa under the two-state symmetric model (Schultz et al., 1996; Evans et al., 2000). For more complex models (e.g., on four-states such as DNA), the influence of topology on reconstruction accuracy is more complicated (Lucena and Haussler, 2005).

[1]  Elchanan Mossel Reconstruction on Trees: Beating the Second Eigenvalue , 2001 .

[2]  N. Grishin,et al.  Reconstruction of ancestral protein sequences and its applications , 2004, BMC Evolutionary Biology.

[3]  Mike Steel,et al.  Five surprising properties of parsimoniously colored trees , 1995 .

[4]  Tamir Tuller,et al.  Reconstruction of Ancestral Genomic Sequences Using Likelihood , 2007, J. Comput. Biol..

[5]  M Steel,et al.  Links between maximum likelihood and maximum parsimony under a simple model of site substitution. , 1997, Bulletin of mathematical biology.

[6]  A. Mooers Effects of tree shape on the accuracy of maximum likelihood-based ancestor reconstructions. , 2004, Systematic biology.

[7]  J. Taubenberger,et al.  Characterization of the 1918 influenza virus hemagglutinin and neuraminidase genes , 2001 .

[8]  S. Benner,et al.  Inferring the palaeoenvironment of ancient bacteria on the basis of resurrected proteins , 2003, Nature.

[9]  D. Schluter,et al.  LIKELIHOOD OF ANCESTOR STATES IN ADAPTIVE RADIATION , 1997, Evolution; international journal of organic evolution.

[10]  G. Churchill,et al.  THE RECONSTRUCTION OF ANCESTRAL CHARACTER STATES , 1996, Evolution; international journal of organic evolution.

[11]  M. Crisp,et al.  Do early branching lineages signify ancestral traits? , 2005, Trends in ecology & evolution.

[12]  D. Aldous Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today , 2001 .

[13]  M. Nei,et al.  Accuracies of ancestral amino acid sequences inferred by the parsimony, likelihood, and distance methods , 2009, Journal of Molecular Evolution.

[14]  László A. Székely,et al.  Inverting random functions , 1999 .

[15]  Lifford,et al.  Some Limitations of Ancestral Character-State Reconstruction When Testing Evolutionary Hypotheses , 2002 .

[16]  Y. Peres,et al.  Broadcasting on trees and the Ising model , 2000 .

[17]  J. Huelsenbeck,et al.  Application and accuracy of molecular phylogenies. , 1994, Science.

[18]  D. Liberles Ancestral sequence reconstruction , 2007 .

[19]  olph,et al.  Reconstructing Ancestor States with Maximum Likelihood : Support for One-and Two-Rate Models , 2002 .

[20]  Todd H. Oakley,et al.  Comparative methods for the analysis of gene-expression evolution: an example using yeast functional genomic data. , 2005, Molecular biology and evolution.

[21]  Wayne P. Maddison,et al.  Calculating the Probability Distributions of Ancestral States Reconstructed by Parsimony on Phylogenetic Trees , 1995 .

[22]  Jeffery K. Taubenberger,et al.  Characterization of the 1918 influenza virus polymerase genes , 2005, Nature.

[23]  James O. Berger Statistical Decision Theory , 1980 .

[24]  E. Martins Estimation of ancestral states of continuous characters: A computer simulation study , 1999 .

[25]  M. Pagel The Maximum Likelihood Approach to Reconstructing Ancestral Character States of Discrete Characters on Phylogenies , 1999 .

[26]  D. Haussler,et al.  Reconstructing large regions of an ancestral mammalian genome in silico. , 2004, Genome research.

[27]  Arne Ø. Mooers,et al.  CoMET: A Mesquite package for comparing models of continuous character evolution on phylogenies , 2006, Evolutionary bioinformatics online.

[28]  M. Nei,et al.  A new method of inference of ancestral nucleotide and amino acid sequences. , 1995, Genetics.

[29]  James O. Berger,et al.  Statistical Decision Theory and Bayesian Analysis, Second Edition , 1985 .

[30]  David Crews,et al.  Resurrecting the Ancestral Steroid Receptor: Ancient Origin of Estrogen Signaling , 2003, Science.

[31]  Steven A. Benner,et al.  Reconstructing the evolutionary history of the artiodactyl ribonuclease superfamily , 1995, Nature.

[32]  D. Haussler,et al.  Counterexample to a claim about the reconstruction of ancestral character states. , 2005, Systematic Biology.

[33]  B. Salisbury,et al.  Ancestral state estimation and taxon sampling density. , 2001, Systematic biology.

[34]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[35]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[36]  Jianzhi Zhang,et al.  Complementary advantageous substitutions in the evolution of an antiviral RNase of higher primates , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Richard A. Goldstein,et al.  Probabilistic reconstruction of ancestral protein sequences , 1996, Journal of Molecular Evolution.

[38]  Paul D. Williams,et al.  Assessing the Accuracy of Ancestral Protein Reconstruction Methods , 2006, PLoS Comput. Biol..

[39]  J. J. Flynn,et al.  Ancestral state reconstruction of body size in the Caniformia (Carnivora, Mammalia): the effects of incorporating data from the fossil record. , 2006, Systematic biology.

[40]  Olivier François,et al.  Which random processes describe the tree of life? A large-scale study of phylogenetic tree imbalance. , 2006, Systematic biology.