Properties of supertree methods in the consensus setting.

Supertree methods (SMs) are techniques for inferring (super)trees from sets of (input) trees. Classical consensus methods are SMs that were designed for the special case where input trees have identical leaf sets. The need for methods that can also combine information from input trees with nonidentical leaf sets has led to many alternative SMs. Some of these SMs are generalizations from conservative consensus methods (strict and semistrict) that do not resolve input tree conflicts (e.g., Gordon, 1986; Goloboff and Pol, 2002). Our focus here is on more liberal SMs, those capable of resolving conflicts among input trees. Liberal SMs comprise the majority of described methods and have been the most used in practice by biologists seeking well-resolved phylogenies. However, today's practitioners are confronted with choosing among a potentially bewildering array of liberal SM(s). Wilkinson et al. (2004) argued that nonarbitrary, rational choice among liberal SMs would best be guided by knowledge of the comparative accuracy of alternative methods. However, there have been few comparisons of accuracy using simulations (Bininda-Emonds and Sanderson, 2001; Chen et al., 2003; Eulenstein et al., 2004; Lapointe and Levasseur, 2004; Ross and Rodrigo, 2004) over a restricted range of conditions. Thus, Wilkinson et al. (2004) also discussed a number of properties that they suggested provide surrogates for accuracy and might therefore be expected of any SM. One of these, that input tree conflicts should be resolved independently of input tree shape, was investigated by Wilkinson et al. (2005a), who used a simple example (Fig. 1) and simulations to demonstrate input tree shape effects with 8 of the 14 methods they investigated, including the widely used matrix representation with parsimony (MRP) of Baum (1992) and Ragan (1992). Here we introduce a class of sub-Pareto properties that we argue constitute particularly weak expectations of how accurate SMs should handle consensus problems. We then use the same example to substantiate and extend results reported in Wilkinson et al. (2004) and to demonstrate that seven of the liberal SMs that are sensitive to input tree shape also lack some seemingly reasonable consensus properties. Lastly, we consider the relevance of these properties to choice and PRELIMINARIES

[1]  Oliver Eulenstein,et al.  The shape of supertrees to come: tree shape related properties of fourteen supertree methods. , 2005, Systematic biology.

[2]  Allen G. Rodrigo,et al.  An Assessment of Matrix Representation with Compatibility in Supertree Construction , 2004 .

[3]  Mark D. Wilkinson,et al.  A view of supertree methods , 2001, Bioconsensus.

[4]  David Fernández-Baca,et al.  Rainbow: a toolbox for phylogenetic supertree construction and analysis , 2004, Bioinform..

[5]  Mark Wilkinson,et al.  Measuring support and finding unsupported relationships in supertrees. , 2005, Systematic biology.

[6]  David Bryant,et al.  A classification of consensus methods for phylogenetics , 2001, Bioconsensus.

[7]  David Fernández-Baca,et al.  Performance of flip supertree construction with a heuristic algorithm. , 2004, Systematic biology.

[8]  M Steel,et al.  Simple but fundamental limitations on supertree and consensus tree methods. , 2000, Systematic biology.

[9]  Joseph L. Thorley,et al.  RadCon: phylogenetic tree comparison and consensus , 2000, Bioinform..

[10]  Roderic D. M. Page,et al.  Modified Mincut Supertrees , 2002, WABI.

[11]  E. N. Adams,et al.  N-trees as nestings: Complexity, similarity, and consensus , 1986 .

[12]  Roderic D. M. Page,et al.  Tangled Tales from Multiple Markers , 2004 .

[13]  M. Ragan Phylogenetic inference based on matrix representation of trees. , 1992, Molecular phylogenetics and evolution.

[14]  Fred R. McMorris,et al.  Axiomatic Consensus Theory in Group Choice and Biomathematics , 2003 .

[15]  James O. McInerney,et al.  Some Desiderata for Liberal Supertrees , 2004 .

[16]  A. D. Gordon Consensus supertrees: The synthesis of rooted trees containing overlapping sets of labeled leaves , 1986 .

[17]  Mark Wilkinson,et al.  Matrix representation with parsimony, taxonomic congruence, and total evidence. , 2002, Systematic biology.

[18]  François-Joseph Lapointe,et al.  Everything You always wanted to Know about the Average Consensus, and More , 2004 .

[19]  M. Wilkinson Common Cladistic Information and its Consensus Representation: Reduced Adams and Reduced Cladistic Consensus Trees and Profiles , 1994 .

[20]  Charles Semple,et al.  A supertree method for rooted trees , 2000, Discret. Appl. Math..

[21]  David Fernández-Baca,et al.  Flipping: A supertree construction method , 2001, Bioconsensus.

[22]  M J Sanderson,et al.  Assessment of the accuracy of matrix representation with parsimony analysis supertree construction. , 2001, Systematic biology.

[23]  Andy Purvis,et al.  A species-level phylogenetic supertree of marsupials , 2004 .

[24]  O. Bininda-Emonds,et al.  Novel versus unsupported clades: assessing the qualitative support for clades in MRP supertrees. , 2003, Systematic biology.

[25]  B. Baum Combining trees as a way of combining data sets for phylogenetic inference, and the desirability of combining gene trees , 1992 .

[26]  Diego Pol,et al.  Semi‐strict supertrees , 2002, Cladistics : the international journal of the Willi Hennig Society.

[27]  O. Bininda-Emonds Phylogenetic Supertrees: Combining Information To Reveal The Tree Of Life , 2004 .

[28]  James O. McInerney,et al.  Clann: investigating phylogenetic information through supertree analyses , 2005, Bioinform..

[29]  Pablo A. Goloboff,et al.  Minority rule supertrees? MRP, Compatibility, and Minimum Flip may display the least frequent groups , 2005 .

[30]  Fred R. McMorris,et al.  Axiomatics in group choice and bioconsensus , 2001, Bioconsensus.