Eecient and Eeective Clustering Methods for Spatial Data Mining

Spatial data mining is the discovery of interesting relationships and characteristics that may exist implicitly in spatial databases. In this paper, we explore whether clustering methods have a role to play in spatial data mining. To this end, we develop a new clustering method called CLARANS which is based on randomized search. We also develop two spatial data mining algorithms that use CLARANS. Our analysis and experiments show that with the assistance of CLARANS, these two algorithms are very e ective and can lead to discoveries that are di cult to nd with current spatial data mining algorithms. Furthermore, experiments conducted to compare the performance of CLARANS with that of existing clustering methods show that CLARANS is the most e cient. keywords: spatial data mining, clustering algorithms, randomized search

[1]  Beng Chin Ooi,et al.  Discovery of General Knowledge in Large Spatial Databases , 1993 .

[2]  Fionn Murtagh,et al.  Cluster Dissection and Analysis: Theory, Fortran Programs, Examples. , 1986 .

[3]  Hans-Peter Kriegel,et al.  Supporting data mining of large databases by visual feedback queries , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[4]  Alexander Borgida,et al.  Loading data into description reasoners , 1993, SIGMOD Conference.

[5]  André Hardy,et al.  An examination of procedures for determining the number of clusters in a data set , 1994 .

[6]  Eugene Wong,et al.  Query optimization by simulated annealing , 1987, SIGMOD '87.

[7]  Efficient processing of spatial joins using R-trees , 1993 .

[8]  Derek Thompson,et al.  Fundamentals of spatial information systems , 1992, A.P.I.C. series.

[9]  Yannis E. Ioannidis,et al.  Randomized algorithms for optimizing large join queries , 1990, SIGMOD '90.

[10]  Tomasz Imielinski,et al.  An Interval Classifier for Database Mining Applications , 1992, VLDB.

[11]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[12]  Oliver G Unther Eecient Computation of Spatial Joins , 1993 .

[13]  Hanan Samet,et al.  The Design and Analysis of Spatial Data Structures , 1989 .

[14]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[15]  Jiawei Han,et al.  Knowledge Discovery in Databases: An Attribute-Oriented Approach , 1992, VLDB.