Fouille de données spatiales. Approche basée sur la programmation logique inductive

Spatial data mining requires the analysis of the interactions in space. The conventional data mining algorithms do not support well this type of analysis. We present in this paper an approach based on inductive logic programming (ILP). It is based on two ideas. The first one consists in materializing these interactions using distance tables, so that the spatial data mining problem is reduced to relational data mining problem. The second consists in transforming data into first order logic, and then applying the inductive logic programming methods. This paper details this approach, and describes its application to the supervised classification by spatial decision tree. It shows also some experimentation results in the shellfish contamination analysis in Thau lagoon.

[1]  Hans-Peter Kriegel,et al.  Spatial Data Mining: A Database Approach , 1997, SSD.

[2]  Derek Thompson,et al.  Fundamentals of spatial information systems , 1992, A.P.I.C. series.

[3]  Saso Dzeroski,et al.  Inductive Logic Programming: Techniques and Applications , 1993 .

[4]  Shashi Shekhar,et al.  Spatial Databases: A Tour , 2003 .

[5]  Luc De Raedt,et al.  How to Upgrade Propositional Learners to First Order Logic: A Case Study , 2001, Machine Learning and Its Applications.

[6]  Junas Adhikary,et al.  Knowledge Discovery in Spatial Databases Progress and Challenges , 1996 .

[7]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[8]  Nadjim Chelghoum,et al.  A Decision Tree for Multi-Layered Spatial Data , 2002 .

[9]  Daniel A. Grijfith Statistical Techniques in Geographical Analysis , 1985 .

[10]  Donato Malerba,et al.  An ILP Method for Spatial Association Rule Mining , 2001 .

[11]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[12]  Luc Anselin,et al.  Do spatial effects really matter in regression analysis , 2005 .

[13]  Peter A. Flach,et al.  Propositionalization approaches to relational data mining , 2001 .

[14]  Michelangelo Ceci,et al.  Spatial Associative Classification at Different Levels of Granularity: A Probabilistic Approach , 2004, PKDD.

[15]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[16]  Hendrik Blockeel,et al.  Top-Down Induction of First Order Logical Decision Trees , 1998, AI Commun..

[17]  Jiawei Han,et al.  An Efficient Two-Step Method for Classification of Spatial Data , 1998 .

[18]  Karine Zeitouni,et al.  Join Indices as a Tool for Spatial Data Mining , 2000, TSDM.

[19]  Nadjim Chelghoum,et al.  Spatial Data Mining Implementation: Alternatives and Performances , 2004, GeoInfo.

[20]  Max J. Egenhofer,et al.  Reasoning about Binary Topological Relations , 1991, SSD.

[21]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[22]  L. Anselin What is Special About Spatial Data? Alternative Perspectives on Spatial Data Analysis (89-4) , 1989 .