论文信息 - Supporting Relational Knowledge Discovery: Lessons in Architecture and Algorithm Design

Supporting Relational Knowledge Discovery: Lessons in Architecture and Algorithm Design

This paper discusses a few of the lessons we have learned d eveloping a relational knowledge discovery system. The relationships among data instances in relational data provide e xtra information for “mining.” This additional information has the potential to greatly improve the quality of learned models. However, the dependencies among instances in the data a lso introduce new statistical challenges for learning algorithms. Relational data provide a n ideal environment i n which to examine a central challenge of knowledge discovery ‐ its “chicken and egg” character. Data representation can impair the a bility to learn important knowledge, but knowing the “right” data representation often requires just that knowledge. With relational data, representation is often a c hoice; many alternate views of the data provide a bundant fodder for r easoning about transformations. In light of this, we discuss representation and d esign choices that support a co-evolutionary process of knowledge discovery and data transformation in relation data.

Jennifer Neville | David Jensen

[1] Jennifer Neville,et al. Autocorrelation and Linkage Cause Bias in Evaluation of Relational Learners , 2002, ILP.

[2] Peter A. Flach,et al. The role of feature construction in inductive rule learning , 2000 .

[3] David D. Jensen. Statistical challenges to inductive inference in linked data , 1999, AISTATS.

[4] Corinna Cortes,et al. Communities of interest , 2001, Intell. Data Anal..

[5] Tom M. Mitchell,et al. Discovering Test Set Regularities in Relational Domains , 2000, ICML.

[6] Jennifer Neville,et al. Linkage and Autocorrelation Cause Feature Selection Bias in Relational Learning , 2002, ICML.

[7] Carson C. Chow,et al. Small Worlds , 2000 .

[8] D. A. Bell,et al. Applied Statistics , 1953, Nature.

[9] Gesine Reinert,et al. Small worlds , 2001, Random Struct. Algorithms.

[10] Lise Getoor,et al. Learning Probabilistic Relational Models , 1999, IJCAI.

[11] Stanley Wasserman,et al. Social Network Analysis: Methods and Applications , 1994 .

[12] S. Džeroski,et al. Relational Data Mining , 2001, Springer Berlin Heidelberg.

[13] Michael J. Pazzani,et al. Relational Clichés: Constraining Induction During Relational Learning , 1991, ML.

[14] Jennifer Neville,et al. Iterative Classification in Relational Data , 2000 .

[15] David Page,et al. KDD Cup 2001 report , 2002, SKDD.