A Clustering-Oriented Star Coordinate Translation Method for Reliable Clustering Parameterization

When conducting a clustering process, users are generally concerned whether the clustering result is reliable enough to reflect the actual clustering phenomenon. The number of clusters and initial cluster centers are two critical parameters that influence the reliability of clustering results highly. We propose a Clustering-Oriented Star Coordinate Translation (COSCT) method to help users determining the two parameters more confidently. Through COSCT all objects from a multi-dimensional space are adaptively translated to a 2D starcoordinate plane, so that the clustering parameterization can be easily conducted by observing the clustering phenomenon in the plane. To enhance the cluster-displaying quality of the star-coordinate plane, the feature weighting and coordinate arrangement procedures are developed. The effectiveness of the COSCT method is demonstrated using a set of experiments.

[1]  Eser Kandogan,et al.  Visualizing multi-dimensional clusters, trends, and outliers using star coordinates , 2001, KDD '01.

[2]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  Honghua Dai,et al.  A Study on Reliability in Graph Discovery , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[5]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[6]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[7]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[8]  I. Jolliffe Principal Component Analysis , 2002 .