Lambda Consensus Clustering

This paper introduces an extension to consensus clustering that allows a feedback of the results of the consensus to the original clustering processes. The original clustering processes may use this information to update their partitioning of the data. An exponential weighting approach, called lambda consensus, is presented as a method to merged the consensus information into graph based and vector space based clustering algorithms. Successful consensus clustering is highly dependent on the quality and diversity of the partitions in the ensemble. The feedback signal allows the clustering processes to adapt their algorithms to attempt to improve quality and diversity of the set of partitions in the ensemble. Communication requirements are on the same order as consensus clustering as only the consensus labels are returned to the clustering processes. The method is evaluated on real world data sets.

[1]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[2]  S DhillonInderjit,et al.  Weighted Graph Cuts without Eigenvectors A Multilevel Approach , 2007 .

[3]  Jesse Johnson,et al.  Pinch Ratio Clustering from a Topologically Intrinsic Lexicographic Ordering , 2013, SDM.

[4]  Hui Xiong,et al.  K-Means-Based Consensus Clustering: A Unified View , 2015, IEEE Transactions on Knowledge and Data Engineering.

[5]  M. Cugmas,et al.  On comparing partitions , 2015 .

[6]  George Michailidis,et al.  Critical limitations of consensus clustering in class discovery , 2014, Scientific Reports.

[7]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[8]  Inderjit S. Dhillon,et al.  Weighted Graph Cuts without Eigenvectors A Multilevel Approach , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Inderjit S. Dhillon,et al.  Kernel k-means: spectral clustering and normalized cuts , 2004, KDD.

[10]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[11]  Joydeep Ghosh,et al.  Cluster ensembles , 2011, Data Clustering: Algorithms and Applications.

[12]  Shaina Race,et al.  Determining the Number of Clusters via Iterative Consensus Clustering , 2014, SDM.

[13]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[14]  Mohammad Reza Daliri,et al.  Shape and texture clustering: Best estimate for the clusters number , 2009, Image Vis. Comput..

[15]  W. Torgerson Multidimensional scaling: I. Theory and method , 1952 .

[16]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..