论文信息 - On evaluating clustering procedures for use in classification

On evaluating clustering procedures for use in classification

The problem of evaluating clustering algorithms and their respective computer programs for use in a preprocessing step for classification is addressed. In clustering for classification the probability of correct classification is suggested as the ultimate measure of accuracy on training data. A means of implementing this criterion and a measure of cluster purity are discussed. Examples are given. A procedure for cluster labeling that is based on cluster purity and sample size is presented.

S. S. Yao | W. G. Eppler | M. D. Pore | T. E. Moritz | D. T. Register