Multi-relational Data Mining: a perspective

Multi-relational data mining (MRDM) is a form of data mining operating on data stored in multiple database tables. While machine learning and data mining are traditionally concerned with learning from single tables, MRDM is required in domains where the data are highly structured. One approach to MRDM is to use a predicate-logical language like clausal logic or Prolog to represent and reason about structured objects, an approach which came to be known as inductive logic programming (ILP) [18, 19, 15, 16, 13, 17, 2, 5]. In this talk I will review recent developments that have led from ILP to the broader field of MRDM. Briefly, these developments include the following: - the use of other declarative languages, including functional and higher-order languages, to represent data and learned knowledge [9, 6, 1]; - a better understanding of knowledge representation issues, and the importance of data modelling in MRDM tasks [7, 11]; - a better understanding of the relation between MRDM and standard single-table learning, and how to upgrade single-table methods to MRDM or downgrade MRDM tasks to single-table ones (propositionalisation) [3, 12, 10, 14]; - the study of non-classificatory learning tasks, such as subgroup discovery and multi-relational association rule mining [8, 4, 21]; - the incorporation of ROC analysis and cost-sensitive classification [20].

[1]  Saso Dzeroski,et al.  Inductive Logic Programming: Techniques and Applications , 1993 .

[2]  Peter A. Flach Knowledge Representation for Inductive Learning , 1999, ESCQARU.

[3]  De Raedt,et al.  Advances in Inductive Logic Programming , 1996 .

[4]  Peter A. Flach,et al.  Strongly Typed Inductive Concept Learning , 1998, ILP.

[5]  Stefan Wrobel,et al.  Inductive Logic Programming for Knowledge Discovery in Databases , 2001 .

[6]  Luc De Raedt,et al.  Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[7]  Luc Dehaspe,et al.  Discovery of relational association rules , 2001 .

[8]  Ehud Shapiro,et al.  Algorithmic Program Debugging , 1983 .

[9]  Peter A. Flach,et al.  An extended transformation approach to inductive logic programming , 2001, ACM Trans. Comput. Log..

[10]  José Hernández-Orallo,et al.  A Strong Complete Schmema for Inductive Functional Logic Programming , 1999, ILP.

[11]  John W. Lloyd,et al.  Classification of Individuals with Complex Structure , 2000, ICML.

[12]  Luc De Raedt,et al.  How to Upgrade Propositional Learners to First Order Logic: A Case Study , 2001, Machine Learning and Its Applications.

[13]  G. Plotkin Automatic Methods of Inductive Inference , 1972 .

[14]  Luc De Raedt,et al.  Attribute-Value Learning Versus Inductive Logic Programming: The Missing Links (Extended Abstract) , 1998, ILP.

[15]  Peter A. Flach,et al.  Propositionalization approaches to relational data mining , 2001 .