Validation of overlapping clustering

As a widely used clustering validation measure, the F-measure has received increased attention in the field of information retrieval. In this paper, we reveal that the F-measure can lead to biased ...

[1]  Hui Xiong,et al.  Towards understanding hierarchical clustering: A data distribution perspective , 2009, Neurocomputing.

[2]  Vipin Kumar,et al.  WebACE: a Web agent for document categorization and exploration , 1998, AGENTS '98.

[3]  Ramiz M. Aliguliyev,et al.  Performance evaluation of density-based clustering methods , 2009, Inf. Sci..

[4]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[5]  Michalis Vazirgiannis,et al.  Cluster validity methods: part I , 2002, SGMD.

[6]  Chinatsu Aone,et al.  Fast and effective text mining using linear-time document clustering , 1999, KDD '99.

[7]  Marina Meila,et al.  Comparing clusterings: an axiomatic view , 2005, ICML.

[8]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[9]  Hui Xiong,et al.  External validation measures for K-means clustering: A data distribution perspective , 2009, Expert Syst. Appl..

[10]  Hui Xiong,et al.  Hyperclique pattern discovery , 2006, Data Mining and Knowledge Discovery.

[11]  Jian Jhen Chen,et al.  K-means clustering versus validation measures: a data-distribution perspective. , 2009, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society.

[12]  Sheldon M. Ross Introduction to Probability Models. , 1995 .

[13]  Kweku-Muata Osei-Bryson,et al.  Towards supporting expert evaluation of clustering results using a data mining process model , 2010, Inf. Sci..

[14]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[15]  Robert Villa,et al.  The effectiveness of query-specific hierarchic clustering in information retrieval , 2002, Inf. Process. Manag..

[16]  Shashi Shekhar,et al.  Clustering and Information Retrieval , 2011, Network Theory and Applications.

[17]  Hui Xiong,et al.  Adapting the right measures for K-means clustering , 2009, KDD.

[18]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[19]  Martin Ester,et al.  Frequent term-based text clustering , 2002, KDD.

[20]  Edward A. Fox,et al.  Recent Developments in Document Clustering , 2007 .

[21]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[22]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[23]  Michalis Vazirgiannis,et al.  Clustering validity checking methods: part II , 2002, SGMD.

[24]  Hong-Gee Kim,et al.  Exploiting noun phrases and semantic relationships for text document clustering , 2009, Inf. Sci..

[25]  Christian Bauckhage,et al.  A new evaluation measure for information retrieval systems , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[26]  Oren Etzioni,et al.  Web document clustering: a feasibility demonstration , 1998, SIGIR '98.