Causal Prediction of Top-k Event Types Over Real-Time Event Streams

This paper addresses the problem of causally predicting the top-k most likely next events over real-time event streams. Existing approaches have limitations—(i) they model causality in an acyclic causal network structure and search it to find the top-k next events, which does not work with real world event streams as they frequently manifest cyclic causality, and (ii) they prune out possible non-causal links from a causal network too aggressively and end up omitting many less frequent yet important causal links. We overcome these limitations using a novel event precedence model (EPM) and a run-time causal inference mechanism. The EPM constructs a Markov chain incrementally over event streams, where an edge between two events signifies a temporal precedence relationship between them, which is a necessary condition for causality. Then, the run-time causal inference mechanism performs causality tests on the EPM during query processing, and temporal precedence relationships that fail the causality test in the presence of other events are removed. Two query processing algorithms are presented. One performs exhaustive search on the model and the other performs more efficient reduced search with early termination. Experiments using two real data sets (cascading blackouts in power systems and web page views) verify efficacy and efficiency of the proposed probabilistic top-k prediction algorithms.

[1]  W. Wong,et al.  Learning Causal Bayesian Network Structures From Experimental Data , 2008 .

[2]  S. Stanley Young,et al.  Deming, data and observational studies , 2011 .

[3]  T. J. Breen,et al.  Biostatistical Analysis (2nd ed.). , 1986 .

[4]  E. Frías-Martínez A Prediction Model for User Access Sequences , 2002 .

[5]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[6]  Sergey I. Nikolenko,et al.  Directed Cycles in Bayesian Belief Networks: Probabilistic Semantics and Consistency Checking Complexity , 2005, MICAI.

[7]  Jim E. Greer,et al.  A Comparison of Association Rule Discovery and Bayesian Network Causal Inference Algorithms to Discover Relationships in Discrete Data , 2000, Canadian Conference on AI.

[8]  Isabelle Guyon,et al.  Design and Analysis of the Causation and Prediction Challenge , 2008, WCCI Causation and Prediction Challenge.

[9]  Tze-Yun Leong,et al.  Active Learning for Causal Bayesian Network Structure with Non-symmetrical Entropy , 2009, PAKDD.

[10]  David A. Bell,et al.  Learning Bayesian networks from data: An information-theory based approach , 2002, Artif. Intell..

[11]  Byung Suk Lee,et al.  Fast Causal Network Inference over Event Streams , 2013, DaWaK.

[12]  Toyoaki Nishida,et al.  Mining Causal Relationships in Multidimensional Time Series , 2010, Smart Information and Knowledge Management.

[13]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[14]  Catarina Sismeiro,et al.  A Model of Web Site Browsing Behavior Estimated on Clickstream Data , 2003 .

[15]  Le Gruenwald,et al.  Microarray gene expression data association rules mining based on BSC-tree and FIS-tree , 2005, Data Knowl. Eng..

[16]  I. Dobson,et al.  Risk Assessment of Cascading Outages: Methodologies and Challenges , 2012, IEEE Transactions on Power Systems.

[17]  Gerhard Weikum,et al.  Efficient top-k querying over social-tagging networks , 2008, SIGIR '08.

[18]  S. Kullback,et al.  Information Theory and Statistics , 1959 .

[19]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[20]  David Heckerman,et al.  A Bayesian Approach to Learning Causal Networks , 1995, UAI.

[21]  Zina M. Ibrahim,et al.  Biological Knowledge Discovery Handbook: Preprocessing, Mining and Postprocessing of Biological Data , 2013 .

[22]  Ross D. Shachter Evidence Absorption and Propagation through Evidence Reversals , 2013, UAI.

[23]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[24]  Nevin L. Zhang,et al.  A simple approach to Bayesian network computations , 1994 .

[25]  Luis M. de Campos,et al.  A Scoring Function for Learning Bayesian Networks based on Mutual Information and Conditional Independence Tests , 2006, J. Mach. Learn. Res..

[26]  Karl Aberer,et al.  The gist of everything new: personalized top-k processing over web 2.0 streams , 2010, CIKM.

[27]  M. A. Kłopotek,et al.  Cyclic Bayesian Network : Markov Process Approach , 2006 .

[28]  Benjamin M. Rottman,et al.  Reasoning about causal relationships: Inferences on causal networks. , 2014, Psychological bulletin.

[29]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[30]  Lin Lu,et al.  Mining Significant Usage Patterns from Clickstream Data , 2005, WEBKDD.

[31]  David J. C. Mackay,et al.  Introduction to Monte Carlo Methods , 1998, Learning in Graphical Models.

[32]  Cynthia Rudin,et al.  Sequential Event Prediction with Association Rules , 2011, COLT.

[33]  Paul Hines,et al.  A “Random Chemistry” Algorithm for Identifying Collections of Multiple Contingencies That Initiate Cascading Failure , 2012, IEEE Transactions on Power Systems.

[34]  Michael A. Rodriguez,et al.  Clickstream Data Yields High-Resolution Maps of Science , 2009, PloS one.

[35]  C. Glymour Learning, prediction and causal Bayes nets , 2003, Trends in Cognitive Sciences.

[36]  J. Pearl Causal diagrams for empirical research , 1995 .

[37]  P. Spirtes,et al.  From probability to causality , 1991 .

[38]  M. Tamer Özsu,et al.  A Web page prediction model based on click-stream tree representation of user behavior , 2003, KDD '03.

[39]  Wagner Meira,et al.  Learning to rank at query-time using association rules , 2008, SIGIR '08.

[40]  Eli Upfal,et al.  Database-support for continuous prediction queries over streaming data , 2010, Proc. VLDB Endow..

[41]  Lawrence J. Mazlack MINING CAUSALITY FROM IMPERFECT DATA , 2004 .

[42]  Bernard Manderick,et al.  Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach , 2006, MDAI.

[43]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[44]  Jian Cheng,et al.  Confidence Inference in Bayesian Networks , 2001, UAI.

[45]  Jian Cheng,et al.  AIS-BN: An Adaptive Importance Sampling Algorithm for Evidential Reasoning in Large Bayesian Networks , 2000, J. Artif. Intell. Res..

[46]  Nevin Lianwen Zhang,et al.  Exploiting Causal Independence in Bayesian Network Inference , 1996, J. Artif. Intell. Res..

[47]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[48]  Max Henrion,et al.  Search-Based Methods to Bound Diagnostic Probabilities in Very Large Belief Nets , 1991, UAI.

[49]  J. Pearl Graphs, Causality, and Structural Equation Models , 1998 .

[50]  Tsau Young Lin,et al.  Data Mining: Foundations and Practice , 2008, Data Mining: Foundations and Practice.

[51]  Richard Scheines,et al.  Causation, Prediction, and Search, Second Edition , 2000, Adaptive computation and machine learning.

[52]  Arnaud Doucet,et al.  An adaptive sequential Monte Carlo method for approximate Bayesian computation , 2011, Statistics and Computing.

[53]  Solomon Kullback,et al.  Information Theory and Statistics , 1970, The Mathematical Gazette.

[54]  Ngoc Thanh Nguyen,et al.  Smart Information and Knowledge Management: Advances, Challenges, and Critical Issues , 2010, Smart Information and Knowledge Management.

[55]  Kannan Srinivasan,et al.  Modeling Online Browsing and Path Analysis Using Clickstream Data , 2004 .