Hierarchical and Non-Hierarchical Medoid Clustering Using Asymmetric Similarity Measures

Medoid clustering frequently gives better interpretation than the K-means clustering, since a unique object is the representative element of a cluster. Moreover the method of medoids can be applied to non-metric cases such as weighted graphs that arise in analyzing SNS (Social Networking Service) networks. A fundamental problem in clustering is that asymmetric similarity measures are difficult to handle, while relations are asymmetric in SNS user groups. In this paper we consider K-medoids clustering for asymmetric graphs in which a cluster has two different centers with outgoing directions and incoming directions. Moreover two-stage agglomerative hierarchical clustering is studied in which the first stage is a one-pass K-medoids and the second stage uses an agglomerative algorithm. These methods are applied to artificial and real data sets.

[1]  Sadaaki Miyamoto,et al.  A method of two stage clustering using agglomerative hierarchical algorithms with one-pass k-means++ or k-median++ , 2014, 2014 IEEE International Conference on Granular Computing (GrC).

[2]  R. Krishnapuram,et al.  A fuzzy relative of the k-medoids algorithm with application to web document and snippet clustering , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[3]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[4]  宮本 定明 Fuzzy sets in information retrieval and cluster analysis , 1990 .

[5]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[6]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[7]  Matthew A. Russell,et al.  Mining the social web , 2011 .

[8]  Sadaaki Miyamoto,et al.  Algorithms for Fuzzy Clustering - Methods in c-Means Clustering with Applications , 2008, Studies in Fuzziness and Soft Computing.

[9]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[10]  Charu C. Aggarwal,et al.  An Introduction to Cluster Analysis , 2018, Data Clustering: Algorithms and Applications.

[11]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[12]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[13]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[15]  B. Everitt,et al.  Cluster Analysis: Everitt/Cluster Analysis , 2011 .