A Grid-based Spatial Association Mining Method

The grid is a distributed computing infrastructure that supports the sharing and coordinated use of various resources in virtual organizations. The grid can be used for compute intensive tasks and data intensive applications. Data mining algorithms are intensive compute and data, and spatial data are heterogeneous, multidimensional and stored at various places. Therefore, the grid can provide a computing and data management platform for spatial data. In this paper, a grid-based spatial association mining method, grid-based spatial apriori algorithm (GSAA), is presented to find hidden relations and regularities in the grid framework. The main thoughts of GSAA are described. We adopt GSAA in a traffic information system. It discovers the inherent connections and relative factors, and finds the causes of traffic accidents and the places where the traffic accidents often take place. It proves to improve the traffic conditions of cities in the grid framework effectively.

[1]  Shashi Shekhar,et al.  Spatial Databases - Accomplishments and Research Needs , 1999, IEEE Trans. Knowl. Data Eng..

[2]  Ickjai Lee,et al.  Mining Multivariate Associations within GIS Environments , 2004, IEA/AIE.

[3]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[4]  Yasuhiko Morimoto,et al.  Mining frequent neighboring class sets in spatial databases , 2001, KDD '01.

[5]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[6]  Philip K. Chan,et al.  Advances in Distributed and Parallel Knowledge Discovery , 2000 .

[7]  Petr Kuba,et al.  Data structures for spatial data mining , 2001 .

[8]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[9]  Hans-Peter Kriegel,et al.  Algorithms and Applications for Spatial Data Mining , 2001 .

[10]  Domenico Talia Grid-based Distributed Data Mining Systems , Algorithms and Services , 2006 .

[11]  F. Leon,et al.  Mining Association Rules in Geographic Information Systems using Intelligent Agents , 2004 .

[12]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[13]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[14]  Le Gruenwald,et al.  A survey of data mining and knowledge discovery software tools , 1999, SKDD.

[15]  Dave Pearson,et al.  Grid Database Access and Integration: Requirements and Functionalities , 2003 .

[16]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[17]  Jiawei Han,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[18]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[19]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..