论文信息 - Extending Standard Cluster Algorithms to Allow for Group Constraints

Extending Standard Cluster Algorithms to Allow for Group Constraints

Summary. This paper demonstrates how standard cluster algorithms like K-means or partitioning around medoids can be modified such that the final solution fulfills group constraints, which specify that certain data points must be or may not be in the same cluster. An extensible software implementation for the R statistical computing environment is presented that allows user-specified group constraints for clustering with respect to arbitrary distance measures. Finally we discuss applications of the methodology to market segmentation of household shopping basket panel data and model diagnostics for finite mixture models.

Friedrich Leisch | Bettina Grün | F. Leisch | B. Grün

[1] P. Green,et al. Corrigendum: On Bayesian analysis of mixtures with an unknown number of components , 1997 .

[2] F. Leisch. FlexMix: A general framework for finite mixture models and latent class regression in R , 2004 .

[3] Maurice K. Wong,et al. Algorithm AS136: A k-means clustering algorithm. , 1979 .

[4] Yoshua Bengio,et al. Pattern Recognition and Neural Networks , 1995 .

[5] Bettina Gr,et al. BOOTSTRAPPING FINITE MIXTURE MODELS , 2004 .

[6] Ali S. Hadi,et al. Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[7] M. Stephens. Dealing with label switching in mixture models , 2000 .

[8] Geoffrey J. McLachlan,et al. Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[9] Claire Cardie,et al. Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[10] Kenneth Steiglitz,et al. Combinatorial Optimization: Algorithms and Complexity , 1981 .

[11] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .