Enabling user-driven rule management in event data analysis

Event data analysis is becoming increasingly of interest to academic researchers looking for patterns in the data. Unlike domain experts working in large companies who have access to IT staff and expensive software infrastructures, researchers find it harder to efficiently manage their event data analysis by themselves. Particularly, user-driven rule management is a challenge especially when analysis rules increase in size and complexity over time. In this paper, we propose an event data analysis platform called EP-RDR intended for non-IT experts that facilitates the evolution of event processing rules according to changing requirements. This platform integrates a rule learning framework called Ripple-Down Rules (RDR) operating in conjunction with an event pattern detection component invoked as a service (EPDaaS). We have built a prototype to demonstrate this solution on real-life scenario involving financial data analysis.

[1]  Richard Murch Project Management: Best Practices for IT Professionals , 2000 .

[2]  Shailender Kumar A Study of Temporal Database Research , 2013 .

[3]  Debbie Richards,et al.  Two decades of Ripple Down Rules research , 2009, The Knowledge Engineering Review.

[4]  Klaus R. Dittrich,et al.  The active database management system manifesto: a rulebase of ADBMS features , 1995, SGMD.

[5]  D. Luckham Event Processing for Business: Organizing the Real-Time Enterprise , 2011 .

[6]  Alessandro Margara,et al.  TESLA: a formally defined event specification language , 2010, DEBS '10.

[7]  Susanne Boll,et al.  MediÆther: an event space for context-aware multimedia experiences , 2003, ETP '03.

[8]  Fethi A. Rabhi,et al.  ADAGE: a framework for supporting user-driven ad-hoc data analysis processes , 2012, Computing.

[9]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[10]  Byeong Ho Kang,et al.  Linked Production Rules: Controlling Inference with Knowledge , 2014, PKAW.

[11]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[12]  John J. Binder The Event Study Methodology Since 1969 , 1997 .

[13]  Andrew Berry,et al.  Real-Time Analytics for Legacy Data Streams in Health: Monitoring Health Data Quality , 2013, 2013 17th IEEE International Enterprise Distributed Object Computing Conference.

[14]  Tova Milo,et al.  Using Schema Matching to Simplify Heterogeneous Data Translation , 1998, VLDB.

[15]  Andreas Bitterer Magic Quadrant for Data Quality Tools , 2011 .

[16]  Adrian Paschke,et al.  Rule-Based Event Processing and Reaction Rules , 2009, RuleML.

[17]  Jennifer Widom,et al.  The CQL continuous query language: semantic foundations and query execution , 2006, The VLDB Journal.

[18]  Byeong Ho Kang,et al.  Multiple Classification Ripple Down Rules : Evaluation and Possibilities , 2000 .

[19]  Weisi Chen E-research event data quality , 2013, 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW).

[20]  K. Mani Chandy,et al.  Event Processing - Designing IT Systems for Agile Companies , 2009 .

[21]  Szabolcs Rozsnyai,et al.  Event-driven rules for sensing and responding to business situations , 2007, DEBS '07.

[22]  Christine Collet,et al.  Composite Events in NAOS , 1996, DEXA.

[23]  J. Cole,et al.  On design and implementation of a contract monitoring facility , 2004, Proceedings. First IEEE International Workshop on Electronic Contracting, 2004..

[24]  Jean Bacon,et al.  Generic Support for Distributed Applications , 2000, Computer.

[25]  Michael Eckert Complex Event Processing with XChangeEQ , 2008 .

[26]  Paul Compton,et al.  Experience with Ripple-Down Rules , 2005, SGAI Conf..

[27]  Yanlei Diao,et al.  SASE: Complex Event Processing over Streams , 2006, ArXiv.

[28]  ChenWeisi,et al.  Enabling user-driven rule management in event data analysis , 2016 .

[29]  Giuseppe Psaila,et al.  Composite Events in Chimera , 1996, EDBT.

[30]  Paul Compton,et al.  Experience with long-term knowledge acquisition , 2011, K-CAP '11.

[31]  Miriam A. M. Capretz,et al.  Data management in cloud environments: NoSQL and NewSQL data stores , 2013, Journal of Cloud Computing: Advances, Systems and Applications.

[32]  David Luckham,et al.  The power of events - an introduction to complex event processing in distributed enterprise systems , 2002, RuleML.

[33]  David J. DeWitt,et al.  NiagaraCQ: a scalable continuous query system for Internet databases , 2000, SIGMOD '00.

[34]  Opher Etzion,et al.  Event Processing in Action , 2010 .

[35]  Opher Etzion,et al.  The Situation Manager Component of Amit - Active Middleware Technology , 2002, NGITS.

[36]  Jennifer Widom,et al.  Towards a streaming SQL standard , 2008, Proc. VLDB Endow..

[37]  Peter Kepplinger,et al.  User-oriented rule management for event-based applications , 2011, DEBS '11.

[38]  Paul Compton,et al.  EMMA: an e-mail management assistant , 2003, IEEE/WIC International Conference on Intelligent Agent Technology, 2003. IAT 2003..

[39]  Fethi A. Rabhi,et al.  An open architecture for event-based analytics , 2016, International Journal of Data Science and Analytics.

[40]  Perfecto Mariño Espiñeira,et al.  An event driven software architecture for enterprise-wide data source integration , 2000, Proceedings International Conference on Information Technology: Coding and Computing (Cat. No.PR00540).

[41]  Fethi A. Rabhi,et al.  Validating an Incremental Rule Management Approach for Financial Market Data Pre-processing , 2014, FinanceCom.

[42]  Fethi A. Rabhi,et al.  An RDR-Based Approach for Event Data Analysis , 2013, ASSRI.

[43]  Klaus R. Dittrich,et al.  Events in an Active Object-Oriented Database System , 1993, Rules in Database Systems.

[44]  P. Compton,et al.  A philosophical basis for knowledge acquisition , 1990 .

[45]  Nenad Stojanovic,et al.  GRUVe: A Methodology for Complex Event Pattern Life Cycle Management , 2010, CAiSE.

[46]  Ivan Bratko,et al.  Prolog Programming for Artificial Intelligence , 1986 .

[47]  G. Vining,et al.  Data Analysis: A Model-Comparison Approach , 1989 .

[48]  Paolo Mancarella,et al.  Production Systems with Negation as Failure , 2002, IEEE Trans. Knowl. Data Eng..

[49]  Calton Pu,et al.  Continual Queries for Internet Scale Event-Driven Information Delivery , 1999, IEEE Trans. Knowl. Data Eng..

[50]  Victor Jupp,et al.  Data Collection and Analysis , 2012, Lean Six Sigma for the Office.

[51]  Paul Compton,et al.  A Simulation Framework for Knowledge Acquisition Evaluation , 2005, ACSC.

[52]  Goodluck Ifijeh,et al.  Information Explosion and University Libraries: Current Trends and Strategies for Intervention , 2010 .

[53]  Timos K. Sellis,et al.  Window Specification over Data Streams , 2006, EDBT Workshops.

[54]  Robert Kajic Evaluation of the Stream Query Language CQL , 2010 .

[55]  Umeshwar Dayal,et al.  The HiPAC project: combining active databases and timing constraints , 1988, SGMD.

[56]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[57]  Bill Appelbe,et al.  eResearch - Paradigm Shift or Propaganda , 2007, J. Res. Pract. Inf. Technol..

[58]  Rachel Schutt,et al.  Doing Data Science , 2013 .

[59]  L. Venkata Subramaniam,et al.  A Knowledge Acquisition Method for Improving Data Quality in Services Engagements , 2010, 2010 IEEE International Conference on Services Computing.

[60]  Sharma Chakravarthy,et al.  Snoop: An Expressive Event Specification Language for Active Databases , 1994, Data Knowl. Eng..

[61]  Julien Vayssière,et al.  An Incremental Knowledge Acquisition Method for Improving Duplicate Invoices Detection , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[62]  Chung-Sheng Li Real-time event driven architecture for activity monitoring and early warning , 2005, Conference, Emerging Information Technology 2005..

[63]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[64]  Fethi A. Rabhi,et al.  A data model for processing financial market and news data , 2009 .

[65]  Beatriz de la Iglesia,et al.  Clustering Rules: A Comparison of Partitioning and Hierarchical Clustering Algorithms , 2006, J. Math. Model. Algorithms.

[66]  Paul Compton,et al.  Knowledge in Context: A Strategy for Expert System Maintenance , 1990, Australian Joint Conference on Artificial Intelligence.

[67]  Roger S. Barga,et al.  Event Correlation and Pattern Detection in CEDR , 2006, EDBT Workshops.

[68]  Narain H. Gehani,et al.  Event specification in an active object-oriented database , 1992, SIGMOD '92.

[69]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[70]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[71]  L. Venkata Subramaniam,et al.  Data Cleansing Techniques for Large Enterprise Datasets , 2011, 2011 Annual SRII Global Conference.

[72]  P McFedries The coming data deluge [Technically Speaking] , 2011 .

[73]  Gang Chen,et al.  An event and service interacting model and event detection based on the broker/service model , 2001, Proceedings of the Sixth International Conference on Computer Supported Cooperative Work in Design (IEEE Cat. No.01EX472).

[74]  Annika Hinze,et al.  Event-based applications and enabling technologies , 2009, DEBS '09.