A Simple Relational Classifier

Abstract : We analyze a Relational Neighbor (RN) classifier, a simple relational predictive model the predicts only based on class labels of related neighbors, using no learning and no inherent attributes. We show that it performs surprisingly well by comparing it to more complex models such as Probabilistic Relational Models and Relational Probability Trees on three data sets from published work. We argue that a simple model such as this should be used as a baseline to assess the performance of relational learners.

[1]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Data Mining Researchers , 2003 .

[2]  Jennifer Neville,et al.  Learning relational probability trees , 2003, KDD '03.

[3]  Foster Provost,et al.  Relational Learning Problems and Simple Models , 2003 .

[4]  Thorsten Joachims,et al.  A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[5]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[6]  Pedro M. Domingos,et al.  Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier , 1996, ICML.

[7]  S. Džeroski,et al.  Relational Data Mining , 2001, Springer Berlin Heidelberg.

[8]  Piotr Indyk,et al.  Enhanced hypertext categorization using hyperlinks , 1998, SIGMOD '98.

[9]  David Jensen,et al.  Data Mining in Social Networks , 2002 .

[10]  Lise Getoor,et al.  Learning Probabilistic Relational Models , 1999, IJCAI.

[11]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[12]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[13]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[14]  Andrew McCallum,et al.  Automating the Construction of Internet Portals with Machine Learning , 2000, Information Retrieval.

[15]  Abraham Bernstein,et al.  The Relational Vector-Space Model , 2003 .

[16]  P. Blau Inequality and Heterogeneity: A Primitive Theory of Social Structure , 1978 .

[17]  Jennifer Neville,et al.  Linkage and Autocorrelation Cause Feature Selection Bias in Relational Learning , 2002, ICML.

[18]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[19]  Avi Pfeffer,et al.  Probabilistic Frame-Based Systems , 1998, AAAI/IAAI.

[20]  Jennifer Neville,et al.  Schemas and Models , 2002 .

[21]  Ben Taskar,et al.  Probabilistic Classification and Clustering in Relational Data , 2001, IJCAI.

[22]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[23]  Jennifer Neville,et al.  Simple estimators for relational Bayesian classifiers , 2003, Third IEEE International Conference on Data Mining.

[24]  Peter A. Flach,et al.  IBC: A First-Order Bayesian Classifier , 1999, ILP.

[25]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[26]  Tom M. Mitchell,et al.  Learning to Extract Symbolic Knowledge from the World Wide Web , 1998, AAAI/IAAI.

[27]  Dietrich Wettschereck,et al.  Relational Instance-Based Learning , 1996, ICML.