Finding natural clusters having minimum description length
暂无分享,去创建一个
A two-step procedure that finds natural clusters in geometric point data is described. The first step computes a hierarchical cluster tree minimizing an entropy objective function. The second step recursively explores the tree for a level clustering having minimum description length. Together, these two steps find natural clusters without requiring a user to specify threshold parameters or so-called magic numbers. In particular, the method automatically determines the number of clusters in the input data. The first step exploits a new hierarchical clustering procedure called numerical iterative hierarchical clustering (NIHC). The output of NIHC is a cluster tree. The second step in the procedure searches the tree for a minimum-description-length (MDL) level clustering. The MDL formulation, equivalent to maximizing the posterior probability, is suited to the clustering problem because it defines a natural prior distribution.<<ETX>>
[1] Arthur C. Sanderson,et al. Attributed image matching using a minimum representation size criterion , 1989, Proceedings, 1989 International Conference on Robotics and Automation.
[2] C. S. Wallace,et al. An Information Measure for Classification , 1968, Comput. J..
[3] C. S. Wallace,et al. An Information Measure for Single Link Classification , 1975, Comput. J..
[4] J. Rissanen. A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .