Optimization Strategies for Two-Mode Partitioning

Two-mode partitioning is a relatively new form of clustering that clusters both rows and columns of a data matrix. In this paper, we consider deterministic two-mode partitioning methods in which a criterion similar to k-means is optimized. A variety of optimization methods have been proposed for this type of problem. However, it is still unclear which method should be used, as various methods may lead to non-global optima. This paper reviews and compares several optimization methods for two-mode partitioning. Several known methods are discussed, and a new fuzzy steps method is introduced. The fuzzy steps method is based on the fuzzy c-means algorithm of Bezdek (1981) and the fuzzy steps approach of Heiser and Groenen (1997) and Groenen and Jajuga (2001). The performances of all methods are compared in a large simulation study. In our simulations, a two-mode k-means optimization method most often gives the best results. Finally, an empirical data set is used to give a practical example of two-mode partitioning.

[1]  Vichi Maurizio Double k-means Clustering for Simultaneous Classification of Objects and Variables , 2001 .

[2]  Elliot Noma,et al.  Benchmark for the Blocking of Sociometric Data , 1985 .

[3]  A. Ferligoj,et al.  Generalized Blockmodeling: Preface , 2004 .

[4]  Boris Mirkin,et al.  Clustering For Data Mining: A Data Recovery Approach (Chapman & Hall/Crc Computer Science) , 2005 .

[5]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[6]  Hans-Hermann Bock,et al.  Data Analysis and Information Systems , 1996 .

[7]  Maurizio Vichi,et al.  Two-mode multi-partitioning , 2008, Comput. Stat. Data Anal..

[8]  Fred W. Glover,et al.  Future paths for integer programming and links to artificial intelligence , 1986, Comput. Oper. Res..

[9]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[10]  Otto Opitz,et al.  Classification and Knowledge Organization , 1997 .

[11]  W. DeSarbo Gennclus: New models for general nonhierarchical clustering analysis , 1982 .

[12]  Hans-Hermann Bock,et al.  Two-mode clustering methods: astructuredoverview , 2004, Statistical methods in medical research.

[13]  Daniel Baier,et al.  Two-Mode Overlapping Clustering With Applications to Simultaneous Benefit Segmentation and Market Structuring , 1997 .

[14]  Hans-Hermann Bock,et al.  Classification, Clustering, and Data Analysis , 2002 .

[15]  J. Trejos,et al.  Simulated Annealing Optimization for Two-mode Partitioning , 2000 .

[16]  Martin Schader,et al.  Advances in Classification and Data Analysis , 2001 .

[17]  Jorge J. Moré,et al.  Benchmarking optimization software with performance profiles , 2001, Math. Program..

[18]  L. Hubert,et al.  Additive two-mode clustering: The error-variance approach revisited , 1995 .

[19]  Vladimir Batagelj,et al.  Generalized blockmodeling of two-mode network data , 2004, Soc. Networks.

[20]  James C. Bezdek,et al.  Fuzzy Kohonen clustering networks , 1994, Pattern Recognit..

[21]  P. Groenen,et al.  Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima , 1997 .

[22]  L. Hubert,et al.  Comparing partitions , 1985 .

[23]  J. Hansohm,et al.  Two-mode Clustering with Genetic Algorithms , 2002 .

[24]  Jan Schepers,et al.  Selecting Among Multi-Mode Partitioning Models of Different Complexities: A Comparison of Four Model Selection Criteria , 2008, J. Classif..

[25]  G. W. Milligan,et al.  An examination of the effect of six types of error perturbation on fifteen clustering algorithms , 1980 .

[26]  Emile H. L. Aarts,et al.  Simulated Annealing: Theory and Applications , 1987, Mathematics and Its Applications.

[27]  Martin Schader,et al.  A New Algorithm for Two-Mode Clustering , 1996 .

[28]  Javier Trejos,et al.  Two-mode Partitioning: Review of Methods and Application of Tabu Search , 2002 .

[29]  Krzysztof Jajuga,et al.  Fuzzy clustering with squared Minkowski distances , 2001, Fuzzy Sets Syst..