Stream Mining Using Statistical Relational Learning

Stream mining has gained popularity in recent years due to the availability of numerous data streams from sources such as social media and sensor networks. Data mining on such continuous streams possess a variety of challenges including concept drift and unbounded stream length. Traditional data mining approaches to these problems have difficulty incorporating relational domain knowledge and feature relationships, which can be used to improve the accuracy of a classifier. In this work, we model large data streams using statistical relational learning techniques for classification, in particular, we use a Markov Logic Network to capture relational features in structured data and show that this approach performs better for supervised learning than current state-of-the-art approaches. Additionally, we evaluate our approach with semi-supervised learning scenarios, where class labels are only partially available during training.

[1]  Pedro M. Domingos,et al.  Efficient Weight Learning for Markov Logic Networks , 2007, PKDD.

[2]  Philip S. Yu,et al.  Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.

[3]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[4]  spacercece,et al.  Evaluating Markov Logic Networks for Collective Classification , 2011 .

[5]  Matthew Richardson,et al.  The Alchemy System for Statistical Relational AI: User Manual , 2007 .

[6]  Pedro M. Domingos,et al.  Sound and Efficient Inference with Probabilistic and Deterministic Dependencies , 2006, AAAI.

[7]  Geoff Holmes,et al.  MOA: Massive Online Analysis , 2010, J. Mach. Learn. Res..

[8]  Jerzy Stefanowski,et al.  Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Dan Roth,et al.  On the Hardness of Approximate Reasoning , 1993, IJCAI.

[10]  Latifur Khan,et al.  Novel Class Detection and Feature via a Tiered Ensemble Approach for Stream Mining , 2012, 2012 IEEE 24th International Conference on Tools with Artificial Intelligence.

[11]  Lilyana Mihalkova,et al.  Structure Selection from Streaming Relational Data , 2011, ArXiv.

[12]  Pedro M. Domingos,et al.  Hybrid Markov Logic Networks , 2008, AAAI.

[13]  Gerhard Widmer,et al.  Effective Learning in Dynamic Environments by Explicit Context Tracking , 1993, ECML.

[14]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.