论文信息 - Clustering with Size Constraints

Clustering with Size Constraints

We consider the problem of partitioning a data set of n data objects into c homogeneous subsets or clusters (that is, data objects in the same subset should be similar to each other) with constraints on the number of data per cluster. The proposed techniques can be used for various purposes. If a set of items, jobs or customers has to be distributed among a limited number of resources and the workload for each resource shall be balanced, clusters of approximately the same size would be needed. If the resources have different capacities, then clusters of the corresponding sizes need to be found. We also extend our approach to avoid extremely small or large clusters in standard cluster analysis. Another extension offers a measure for comparing different prototype-based clustring results.

Frank Klawonn | Frank Höppner | F. Klawonn | F. Höppner

[1] Paul R. Cohen,et al. Very Predictive Ngrams for Space-Limited Probabilistic Models , 2003, IDA.

[2] Frank Klawonn,et al. What Is Fuzzy about Fuzzy Clustering? Understanding and Improving the Concept of the Fuzzifier , 2003, IDA.

[3] J. C. Peters,et al. Fuzzy Cluster Analysis : A New Method to Predict Future Cardiac Events in Patients With Positive Stress Tests , 1998 .

[4] Kenneth G. Manton,et al. Fuzzy Cluster Analysis , 2005 .

[5] Anil K. Jain,et al. Algorithms for Clustering Data , 1988 .

[6] Frank Klawonn,et al. Equi-sized, Homogeneous Partitioning , 2006, KES.

[7] James C. Bezdek,et al. Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[8] Lakhmi C. Jain,et al. Knowledge-Based Intelligent Information and Engineering Systems, 10th International Conference, KES 2006, Bournemouth, UK, October 9-11, 2006, Proceedings, Part II , 2006, International Conference on Knowledge-Based Intelligent Information & Engineering Systems.

[9] Lakhmi C. Jain,et al. Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.