Overcoming Relational Learning Biases to Accurately Predict Preferences in Large Scale Networks

Many individuals on social networking sites provide traits about themselves, such as interests or demographics. Social networking sites can use this information to provide better content to match their users' interests, such as recommending scheduled events or various relevant products. These tasks require accurate probability estimates to determine the correct answer to return. Relational machine learning (RML) is an excellent framework for these problems as it jointly models the user labels given their attributes and the relational structure. Further, semi-supervised learning methods could enable RML methods to exploit the large amount of unlabeled data in networks. However, existing RML approaches have limitations that prevent their application in large scale domains. First, semi-supervised methods for RML do not fully utilize all the unlabeled instances in the network. Second, the collective inference procedures necessary to jointly infer the missing labels are generally viewed as too expensive to apply in large scale domains. In this work, we address each of these limitations. We analyze the effect of full semi-supervised RML and find that collective inference methods can introduce considerable bias into predictions. We correct this by implementing a maximum entropy constraint on the inference step, forcing the predictions to have the same distribution as the observed labels. Next, we outline a massively scalable variational inference algorithm for large scale relational network domains. We extend this inference algorithm to incorporate the maximum entropy constraint, proving that it only requires a constant amount of overhead while remaining massively parallel. We demonstrate our method's improvement over a variety of baselines on seven real world datasets, including large scale networks with over five million edges.

[1]  Paul N. Bennett,et al.  Composite Likelihood Data Augmentation for Within-Network Statistical Relational Learning , 2014, 2014 IEEE International Conference on Data Mining.

[2]  Ramana Rao Kompella,et al.  Space-efficient sampling from social activity streams , 2012, BigMine '12.

[3]  Jennifer Neville,et al.  Randomization Tests for Relational Learning , 2003 .

[4]  Jennifer Neville,et al.  Collective Classification with Relational Dependency Networks , 2003 .

[5]  Jennifer Neville,et al.  Modeling relationship strength in online social networks , 2010, WWW '10.

[6]  Ramana Rao Kompella,et al.  Network Sampling Designs for Relational Classification , 2012, ICWSM.

[7]  Tina Eliassi-Rad,et al.  Correcting Bias in Statistical Tests for Network Classifier Evaluation , 2011, ECML/PKDD.

[8]  Ryan A. Rossi,et al.  Role-dynamics: fast mining of large dynamic networks , 2012, WWW.

[9]  Jennifer Neville,et al.  Statistical Relational Learning: Four Claims and a Survey , 2003 .

[10]  Jun'ichi Tsujii,et al.  Training a Naive Bayes Classifier via the EM Algorithm with a Class Distribution Constraint , 2003, CoNLL.

[11]  Jennifer Neville,et al.  Supporting Relational Knowledge Discovery: Lessons in Architecture and Algorithm Design , 2002 .

[12]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[13]  Andrew McCallum,et al.  Introduction to Statistical Relational Learning , 2007 .

[14]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[15]  Jennifer Neville,et al.  Dependency networks for relational data , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[16]  Ben Taskar,et al.  Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning) , 2007 .

[17]  David D. Jensen,et al.  Autocorrelation and Relational Learning: Challenges and Opportunities , 2004 .

[18]  Jennifer Neville,et al.  Correlation and Sampling in Relational Data Mining , 2001 .

[19]  Micah Adler,et al.  Clustering Relational Data Using Attribute and Link Information , 2003 .

[20]  Jennifer Neville,et al.  Fast Generation of Large Scale Social Networks While Incorporating Transitive Closures , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[21]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[22]  Dna Word Graphs Grace Hopper Celebration of Women in Computing 2007 , 2007 .

[23]  David W. Aha,et al.  Semi-Supervised Collective Classification via Hybrid Label Regularization , 2012, ICML.

[24]  Jennifer Neville,et al.  On the Mismatch Between Learning and Inference for Single Network Domains , 2012 .

[25]  Ramana Rao Kompella,et al.  Network Sampling via Edge-based Node Selection with Graph Induction , 2011 .

[26]  David D. Jensen,et al.  Spectral Clustering with Links and Attributes , 2004 .

[27]  Ryan A. Rossi,et al.  Modeling the evolution of discussion topics and communication to improve relational classification , 2010, SOMA '10.

[28]  Ramana Rao Kompella,et al.  Time-based sampling of social network activity graphs , 2010, MLG '10.

[29]  Bruce G. Lindsay,et al.  Approximate medians and other quantiles in one pass and with limited memory , 1998, SIGMOD '98.

[30]  A. Kuwadekar Combining Semi-supervised Learning and Relational Resampling for Active Learning in Network Domains , 2010 .

[31]  Jennifer Neville,et al.  Tied Kronecker product graph models to capture variance in network populations , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[32]  Jennifer Neville,et al.  Pseudolikelihood EM for Within-network Relational Learning , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[33]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[34]  Jennifer Neville,et al.  An Investigation of the Distributional Characteristics of Generative Graph Models , 2009 .

[35]  Jennifer Neville,et al.  Exploiting time-varying relationships in statistical relational models , 2007, WebKDD/SNA-KDD '07.

[36]  Jennifer Neville,et al.  Using Latent Communication Styles to Predict Individual Characteristics , 2012 .

[37]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[38]  Andre Wibisono,et al.  Streaming Variational Bayes , 2013, NIPS.

[39]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[40]  Jennifer Neville,et al.  Understanding Propagation Error and Its Effect on Collective Classification , 2011, 2011 IEEE 11th International Conference on Data Mining.

[41]  Jennifer Neville,et al.  A Shrinkage Approach for Modeling Non-stationary Relational Autocorrelation , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[42]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[43]  David F. Gleich,et al.  Tall and skinny QR factorizations in MapReduce architectures , 2011, MapReduce '11.

[44]  Jennifer Neville,et al.  Modeling the Variance of Network Populations with Mixed Kronecker Product Graph Models , 2010 .

[45]  Jennifer Neville,et al.  A FRAMEWORK FOR EXPLOITING TEMPORAL VARIATIONS IN RELATIONAL DOMAINS , 2008 .

[46]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[47]  G. Loukidis,et al.  SIAM International Conference on Data Mining (SDM) , 2015 .

[48]  Jennifer Neville Structure Learning for Statistical Relational Models , 2005, AAAI.

[49]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[50]  Jennifer Neville,et al.  Structured Comparative Analysis of Systems Logs to Diagnose Performance Problems , 2012, NSDI.

[51]  Angelika Foerster,et al.  An R And S Plus Companion To Applied Regression , 2016 .

[52]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[53]  Jennifer Neville,et al.  Statistical relational learning for single network domains , 2012 .

[54]  Jennifer Neville,et al.  A Resampling Technique for Relational Data Graphs , 2008 .

[55]  David Jensen,et al.  Data Mining in Social Networks , 2002 .