EasySDM: An integrated and easy to use Spatial Data Mining platform

Spatial Data Mining allows users to extract implicit but valuable knowledge from spatial related data. Two main approaches have been used in the literature. The first one applies simple Data Mining algorithms after a spatial pre-processing step. While the second one consists of developing specific algorithms that considers the spatial relations inside the mining process. In this work, we first present a study of existing Spatial Data Mining tools according to the implemented tasks and specific characteristics. Then, we illustrate a new open source Spatial Data Mining platform (EasySDM) that integrates both approaches (pre-processing and dynamic mining). It proposes a set of algorithms belonging to clustering, classification and association rule mining tasks. Moreover and more importantly, it allows geographic visualization of both the data and the results. Either via an internal map display or using any external Geographic Information System.

[1]  H. Miller Tobler's First Law and Spatial Analysis , 2004 .

[2]  Youngihn Kho,et al.  GeoDa: An Introduction to Spatial Data Analysis , 2006 .

[3]  Hervé Thiriez,et al.  OR software , 1998, European Journal of Operational Research.

[4]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[5]  Jiawei Han,et al.  GeoMiner: a system prototype for spatial data mining , 1997, SIGMOD '97.

[6]  Jiawei Han,et al.  DBMiner: A System for Mining Knowledge in Large Relational Databases , 1996, KDD.

[7]  Vania Bogorny,et al.  Weka-GDPM – Integrating Classical Data Mining Toolkit to Geographic Information Systems , 2006 .

[8]  Le Gruenwald,et al.  A survey of data mining and knowledge discovery software tools , 1999, SKDD.

[9]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[10]  Michael MAY,et al.  An architecture for the SPIN ! spatial data mining platform , 2001 .

[11]  Jeremy Mennis,et al.  Spatial data mining and geographic knowledge discovery - An introduction , 2009, Comput. Environ. Urban Syst..

[12]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[13]  Franco Turini,et al.  Knowledge Discovery from Geographical Data , 2008, Mobility, Data Mining and Privacy.

[14]  Zoran Obradovic,et al.  A software system for spatial data analysis and modeling , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[15]  Toshinori Munakata,et al.  Knowledge discovery , 1999, Commun. ACM.

[16]  Diansheng Guo,et al.  Regionalization with dynamically constrained agglomerative clustering and partitioning (REDCAP) , 2008, Int. J. Geogr. Inf. Sci..

[17]  Donato Malerba,et al.  An Integrated Platform for Spatial Data Mining within a GIS Environment , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[18]  Mamadou Ouattara,et al.  Fouille de données : vers une nouvelle approche intégrant de façon cohérente et transparente la composante spatiale , 2010 .

[19]  Frank Hsu,et al.  Knowledge Discovery , 2014, Encyclopedia of Social Network Analysis and Mining.