It is common to represent taxonomic hierarchies of related objects (such as similar plant or animal species or languages of the same family) by rooted trees with labelled terminal vertices which represent the objects. The multivariate data comparing numerous characteristics of the objects is first reduced to indices of similarity (or more often of dissimilarity) between each pair of objects. These are used to classify the objects into groups which are then depicted on a tree. This paper shows that an unrooted tree with labelled terminal vertices may provide a better representation of the relationships between the objects because the similarity indices are required to conform to fewer restrictions. Also for a given number of terminal vertices, there are fewer unrooted than rooted trees so that studies using probability distributions of trees or seeking the most suitable tree to represent the data are more practicable. NUMERICAL TAXONOMY; GRAPH THEORY; SIMILARITY; FAMILY TREES; REPRESENTATION; NUMBERS OF TREES; PROBABILITIES OF TREES 1. Terminology We will mainly be concerned with trees with at least three vertices such that each vertex has degree at most three and at most one vertex (the root) has degree two. A vertex of degree one is a terminal vertex and all others are inner vertices. An inner edge is one connecting two inner vertices.
[1]
Frank Harary,et al.
The number of homeomorphically irreducible trees, and other species
,
1959
.
[2]
J. Hartigan.
REPRESENTATION OF SIMILARITY MATRICES BY TREES
,
1967
.
[3]
S. C. Johnson.
Hierarchical clustering schemes
,
1967,
Psychometrika.
[4]
L. Cavalli-Sforza,et al.
PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES
,
1967,
Evolution; international journal of organic evolution.
[5]
J. Farris.
Methods for Computing Wagner Trees
,
1970
.
[6]
R. M. Cormack,et al.
A Review of Classification
,
1971
.
[7]
E. Harding.
The probabilities of rooted tree-shapes generated by random bifurcation
,
1971,
Advances in Applied Probability.