Huge amounts of data have been stored in databases, data warehouses, geographic information systems, and other information repositories , and this data is still growing rapidly 4]. Companies are building data warehouses capable of storing hundreds of terabytes of data related to natural resource exploration; astronomical databases are measured in terabytes in size; and it is expected that the NASA Earth Observing System will transmit about 50 giga-bytes of image data per hour. This huge amount of data has posed great challenges to traditional data analysis methods for information and knowledge extraction. Data mining, or knowledge discovery in databases (KDD), has been emerging as a new research eld and a new technology for discovery of interesting, implicit, and previously unknown knowledge from large databases 4]. Data mining represents the connuence of several research elds, including machine learning, database systems , data visualization, statistics, and information theory. Besides many studies of knowledge discovery in relational and transaction databases, spatial data mining, which refers to the extraction of implicit knowledge, spatial relationships , or other patterns not explicitly stored in spatial databases, has attracted attention in recent research 2, 6, 7, 8, 10, 12]. Spatial data has many features distinguishing it from relational databases. It carries topologi-cal and/or distance information, usually organized by sophisticated, multidimensional spatial indexing structures, accessed by spatial data access methods, and often requiring spatial reasoning, geometric computation, and spatial knowledge representation techniques. Thus, spatial data mining demands an integration of data mining with spatial database technologies. A crucial challenge to spatial data mining is the exploration of eecient spatial data mining techniques due to the huge amount of spatial data and the complexity of spatial data types and spatial access methods. Spatial data mining can be used for browsing spatial databases, understanding spatial data, discovering spatial relationships and relationships between spatial and nonspatial data, reorganizing spatial databases, constructing spatial knowledge-bases, optimizing spatial queries, etc. It is expected to have wide applications in geographic information systems, remote sensing , image database exploration, medical imaging , navigation, and many other areas where spatial data is used. In this article, a short overview is provided to summarize recent studies on spatial data mining , including spatial data mining techniques, their strengths and weaknesses, how and when to apply them, and what are the challenges yet to be faced. Statistics was used as the most common approach for analyzing spatial data. Statistical …
[1]
Hans-Peter Kriegel,et al.
Visualization Techniques for Mining Large Databases: A Comparison
,
1996,
IEEE Trans. Knowl. Data Eng..
[2]
Richard R. Muntz,et al.
Scalable Exploratory Data Mining of Distributed Geoscientific Data
,
1996,
KDD.
[3]
Jiawei Han,et al.
Discovery of Spatial Association Rules in Geographic Information Databases
,
1995,
SSD.
[4]
Tian Zhang,et al.
BIRCH: an efficient data clustering method for very large databases
,
1996,
SIGMOD '96.
[5]
Raymond T. Ng,et al.
Finding Aggregate Proximity Relationships and Commonalities in Spatial Data Mining
,
1996,
IEEE Trans. Knowl. Data Eng..
[6]
Hans-Peter Kriegel,et al.
Knowledge Discovery in Large Spatial Databases: Focusing Techniques for Efficient Class Identification
,
1995,
SSD.
[7]
Beng Chin Ooi,et al.
Discovery of General Knowledge in Large Spatial Databases
,
1993
.
[8]
Robert F. Cromp,et al.
Data mining of multidimensional remotely sensed images
,
1993,
CIKM '93.
[9]
R. Ng,et al.
Eecient and Eeective Clustering Methods for Spatial Data Mining
,
1994
.
[10]
Padhraic Smyth,et al.
Image database exploration: progress and challenges
,
1993
.