Automated Construction of Classifications: Conceptual Clustering Versus Numerical Taxonomy

A method for automated construction of classifications called conceptual clustering is described and compared to methods used in numerical taxonomy. This method arranges objects into classes representing certain descriptive concepts, rather than into classes defined solely by a similarity metric in some a priori defined attribute space. A specific form of the method is conjunctive conceptual clustering, in which descriptive concepts are conjunctive statements involving relations on selected object attributes and optimized according to an assumed global criterion of clustering quality. The method, implemented in program CLUSTER/2, is tested together with 18 numerical taxonomy methods on two exemplary problems: 1) a construction of a classification of popular microcomputers and 2) the reconstruction of a classification of selected plant disease categories. In both experiments, the majority of numerical taxonomy methods (14 out of 18) produced results which were difficult to interpret and seemed to be arbitrary. In contrast to this, the conceptual clustering method produced results that had a simple interpretation and corresponded well to solutions preferred by people.

[1]  H. Ross Principles of Numerical Taxonomy , 1964 .

[2]  渡辺 慧,et al.  Knowing and guessing : a quantitative study of inference and information , 1969 .

[3]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .

[4]  R. Michalski Variable-Valued Logic: System VL1 , 1974 .

[5]  Ryszard S. Michalski,et al.  Variable-Valued Logic and Its Applications to Pattern Recognition and Machine Learning , 1975 .

[6]  Ryszard S. Michalski,et al.  Selection of Most Representative Training Examples and Incremental Generation of VL1 Hypotheses: The Underlying Methodology and the Description of Programs ESEL and AQ11 , 1978 .

[7]  Ryszard S. Michalski,et al.  Pattern Recognition as Rule-Guided Inductive Inference , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Gérard Govaert,et al.  Clustering in Pattern Recognition , 1981 .

[9]  Ryszard S. Michalski,et al.  An Application of AI Techniques to Structuring Objects into an Optimal Conceptual Hierarchy , 1981, IJCAI.

[10]  Edwin Diday,et al.  A Recent Advance in Data Analysis: Clustering Objects into Classes Characterized by Conjunctive Concepts , 1981 .

[11]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Ryszard S. Michalski,et al.  Revealing Conceptual Structure in Data by Inductive Inference , 1982 .

[13]  Ryszard S. Michalski,et al.  A Theory and Methodology of Inductive Learning , 1983, Artificial Intelligence.