A maximum entropy approach to pairwise data clustering

Partitioning a set of data points which are characterized by their mutual dissimilarities instead of an explicit coordinate representation is a difficult, NP-hard combinatorial optimization problem. The authors formulate this optimization problem of a pairwise clustering cost function in the maximum entropy framework using a variational principle to derive corresponding data partitionings in a d-dimensional Euclidian space. This approximation solves the embedding problem and the grouping of these data into clusters simultaneously and in a selfconsistent fashion.

[1]  Rose,et al.  Statistical mechanics and phase transitions in clustering. , 1990, Physical review letters.

[2]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[3]  Y. Tikochinsky,et al.  Alternative approach to maximum-entropy inference , 1984 .

[4]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[5]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[6]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[7]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[8]  Joachim M. Buhmann,et al.  Vector quantization with complexity costs , 1993, IEEE Trans. Inf. Theory.