A Frequent Pattern Conjunction Heuristic for Rule Generation in Data Streams

This paper introduces a new and expressive algorithm for inducing descriptive rule-sets from streaming data in real-time in order to describe frequent patterns explicitly encoded in the stream. Data Stream Mining (DSM) is concerned with the automatic analysis of data streams in real-time. Rapid flows of data challenge the state-of-the art processing and communication infrastructure, hence the motivation for research and innovation into real-time algorithms that analyse data streams on-the-fly and can automatically adapt to concept drifts. To date, DSM techniques have largely focused on predictive data mining applications that aim to forecast the value of a particular target feature of unseen data instances, answering questions such as whether a credit card transaction is fraudulent or not. A real-time, expressive and descriptive Data Mining technique for streaming data has not been previously established as part of the DSM toolkit. This has motivated the work reported in this paper, which has resulted in developing and validating a Generalised Rule Induction (GRI) tool, thus producing expressive rules as explanations that can be easily understood by human analysts. The expressiveness of decision models in data streams serves the objectives of transparency, underpinning the vision of ‘explainable AI’ and yet is an area of research that has attracted less attention despite being of high practical importance. The algorithm introduced and described in this paper is termed Fast Generalised Rule Induction (FGRI). FGRI is able to induce descriptive rules incrementally for raw data from both categorical and numerical features. FGRI is able to adapt rule-sets to changes of the pattern encoded in the data stream (concept drift) on the fly as new data arrives and can thus be applied continuously in real-time. The paper also provides a theoretical, qualitative and empirical evaluation of FGRI.

[1]  William Nick Street,et al.  A streaming ensemble algorithm (SEA) for large-scale classification , 2001, KDD '01.

[2]  S. Muthukrishnan,et al.  Data streams: algorithms and applications , 2005, SODA '03.

[3]  Johannes Fürnkranz,et al.  Foundations of Rule Learning , 2012, Cognitive Technologies.

[4]  Bartosz Krawczyk,et al.  Evolving rule-based classifiers with genetic programming on GPUs for drifting data streams , 2019, Pattern Recognit..

[5]  Leandro L. Minku,et al.  A heterogeneous online learning ensemble for non-stationary environments , 2020, Knowl. Based Syst..

[6]  E. Mohanraj,et al.  A novel fuzzy association rule for efficient data mining of ubiquitous real-time data , 2020, J. Ambient Intell. Humaniz. Comput..

[7]  Fernando Gomide,et al.  Optimal Rule-Based Granular Systems From Data Streams , 2020, IEEE Transactions on Fuzzy Systems.

[8]  David Haussler,et al.  Decision Theoretic Generalizations of the PAC Model for Neural Net and Other Learning Applications , 1992, Inf. Comput..

[9]  Sean Louis Alan Floyd Semi-Supervised Hybrid Windowing Ensembles for Learning from Evolving Streams , 2019 .

[10]  Aoying Zhou,et al.  Density-Based Clustering over an Evolving Data Stream with Noise , 2006, SDM.

[11]  Marko Bohanec,et al.  DEX: An Expert System Shell for Decision Support • , 1990 .

[12]  João Gama,et al.  Learning Decision Rules from Data Streams , 2011, IJCAI.

[13]  Mohamed Medhat Gaber,et al.  EACD: evolutionary adaptation to concept drifts in data streams , 2019, Data Mining and Knowledge Discovery.

[14]  Cynthia Rudin,et al.  Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead , 2018, Nature Machine Intelligence.

[15]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[16]  A. Bifet,et al.  Early Drift Detection Method , 2005 .

[17]  Ira Assent,et al.  The ClusTree: indexing micro-clusters for anytime stream mining , 2011, Knowledge and Information Systems.

[18]  Mohamed Medhat Gaber,et al.  eRules: A Modular Adaptive Classification Rule Learning Algorithm for Data Streams , 2012, SGAI Conf..

[19]  Mohamed Medhat Gaber,et al.  A rule dynamics approach to event detection in Twitter with its application to sports and politics , 2016, Expert Syst. Appl..

[20]  Latifur Khan,et al.  IoT Big Data Stream Mining , 2016, KDD.

[21]  Andrew W. Moore,et al.  Hoeffding Races: Accelerating Model Selection Search for Classification and Function Approximation , 1993, NIPS.

[22]  Unil Yun,et al.  Advanced approach of sliding window based erasable pattern mining with list structure of industrial fields , 2019, Inf. Sci..

[23]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[24]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[25]  Unil Yun,et al.  Efficient High Utility Pattern Mining for Establishing Manufacturing Plans With Sliding Window Control , 2017, IEEE Transactions on Industrial Electronics.

[26]  Katarzyna Musial,et al.  Next challenges for adaptive learning systems , 2012, SKDD.

[27]  Stefan Schliebs,et al.  Evolving spiking neural network—a survey , 2013, Evolving Systems.

[28]  Giuseppe Di Fatta,et al.  On expressiveness and uncertainty awareness in rule-based classification for data streams , 2017, Neurocomputing.

[29]  Ludmila I. Kuncheva,et al.  A framework for generating data to simulate changing environments , 2007, Artificial Intelligence and Applications.

[30]  Jean Paul Barddal,et al.  SFNClassifier: a scale-free social network method to handle concept drift , 2014, SAC.

[31]  Saso Dzeroski,et al.  Learning model trees from evolving data streams , 2010, Data Mining and Knowledge Discovery.

[32]  Unil Yun,et al.  Efficient Approach for Damped Window-Based High Utility Pattern Mining With List Structure , 2020, IEEE Access.

[33]  Max Bramer,et al.  An Information-Theoretic Approach to the Pre-pruning of Classification Rules , 2002, Intelligent Information Processing.

[34]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[35]  Rodrigo Fernandes de Mello,et al.  On learning guarantees to unsupervised concept drift detection on data streams , 2019, Expert Syst. Appl..

[36]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[37]  D. Altman,et al.  Statistics notes: The normal distribution , 1995, BMJ.

[38]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[39]  D. Haussler,et al.  Boolean Feature Discovery in Empirical Learning , 1990, Machine Learning.

[40]  João Gama,et al.  On evaluating stream learning algorithms , 2012, Machine Learning.

[41]  M. Olave,et al.  Chapter 10 : An application for admission in public school systems ∗ , 2006 .

[42]  Christian Borgelt,et al.  Frequent item set mining , 2012, WIREs Data Mining Knowl. Discov..

[43]  Ruoming Jin,et al.  An algorithm for in-core frequent itemset mining on streaming data , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[44]  Detlef Nauck,et al.  A Rule Induction Approach to Forecasting Critical Alarms in a Telecommunication Network , 2019, 2019 International Conference on Data Mining Workshops (ICDMW).

[45]  Jie Chen,et al.  Explainable Neural Networks based on Additive Index Models , 2018, ArXiv.

[46]  Geoff Holmes,et al.  New ensemble methods for evolving data streams , 2009, KDD.

[47]  Alan C. Elliott,et al.  Statistical Analysis Quick Reference Guidebook: With SPSS Examples , 2006 .

[48]  Hamido Fujita,et al.  Damped window based high average utility pattern mining over data streams , 2017, Knowl. Based Syst..

[49]  Junming Shao,et al.  Exploiting evolving micro-clusters for data stream classification with emerging class detection , 2020, Inf. Sci..

[50]  Ryszard S. Michalski,et al.  On the Quasi-Minimal Solution of the General Covering Problem , 1969 .

[51]  Javier Del Ser,et al.  Evolving Spiking Neural Networks for online learning over drifting data streams , 2018, Neural Networks.

[52]  Matthias Carnein,et al.  evoStream - Evolutionary Stream Clustering Utilizing Idle Times , 2018, Big Data Res..

[53]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[54]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[55]  Yun Sing Koh,et al.  Adaptive Self-Sufficient Itemset Miner for Transactional Data Streams , 2019, PRICAI.

[56]  Xiaofeng Zhou,et al.  An Intrusion Detection Method Based on Damped Window of Data Stream Clustering , 2017, 2017 9th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC).

[57]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[58]  Padraig Cunningham,et al.  A Comparison of Ensemble and Case-Base Maintenance Techniques for Handling Concept Drift in Spam Filtering , 2006, FLAIRS.

[59]  Atta Badii,et al.  Real-time feature selection technique with concept drift detection using adaptive micro-clusters for data stream mining , 2018, Knowl. Based Syst..

[60]  Ludmila I. Kuncheva,et al.  Classifier Ensembles for Changing Environments , 2004, Multiple Classifier Systems.

[61]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[62]  Tzung-Pei Hong,et al.  Generalizing Concept-Drift Patterns for Fuzzy Association Rules , 2018, J. Netw. Intell..

[63]  Charu C. Aggarwal,et al.  Data Streams - Models and Algorithms , 2014, Advances in Database Systems.

[64]  Yue Tan,et al.  Synchronization-based clustering on evolving data stream , 2019, Inf. Sci..

[65]  Junho Shim,et al.  Efficient Data Stream Clustering With Sliding Windows Based on Locality-Sensitive Hashing , 2018, IEEE Access.

[66]  Plamen Angelov,et al.  Towards Explainable Deep Neural Networks (xDNN) , 2019, Neural Networks.

[67]  Thomas Seidl,et al.  MOA: Massive Online Analysis, a Framework for Stream Classification and Clustering , 2010, WAPA.

[68]  Li Su,et al.  A New Classification Algorithm for Data Stream , 2011 .

[69]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[70]  A. Plastino,et al.  Unravelling the size distribution of social groups with information theory in complex networks , 2009, 0905.3704.

[71]  Johannes Fürnkranz,et al.  ROC ‘n’ Rule Learning—Towards a Better Understanding of Covering Algorithms , 2005, Machine Learning.