K-maximin clustering: a maximin correlation approach to partition-based clustering

We propose a new clustering algorithm based upon the maximin correlation analysis (MCA), a learning technique that can minimize the maximum misclassification risk. The proposed algorithm resembles conventional partition clustering algorithms such as k-means in that data objects are partitioned into k disjoint partitions. On the other hand, the proposed approach is unique in that an MCA-based approach is used to decide the location of the representative point for each partition. We test the proposed technique with typography data and show our approach outperforms the popular k-means and k-medoids clustering in terms of retrieving the inherent cluster membership.

[1]  Ian Witten,et al.  Data Mining , 2000 .

[2]  Jan A. Van Mieghem,et al.  Subclass Pattern Recognition: A Maximin Correlation Approach , 1995 .

[3]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.

[4]  Hadar I. Avi-Itzhak,et al.  Multiple Subclass Pattern Recognition: A Maximin Correlation Approach , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.