Meta-association rules for mining interesting associations in multiple datasets

Graphical abstractProcess flow: from original datasets to final meta-association rules. The process starts from a set of databases {D1, , Dk} which share some of their content, i.e. they have attributes in common. After applying a rule extraction procedure, we obtain k sets of association rules represented by Ri. We are interested in searching associations between the already extracted rules in the sets Ri. For achieving this, we create a meta-database D collecting the information. We propose two different ways, by considering only presence-absence of rules (crisp meta-database, D) or taking into account their reliability represented by a degree in the unit interval (fuzzy meta-database, D). We can also introduce additional information into the process by adding new attributes about the original datasets Di. After this the so-called meta-association rules are mined. Examples of meta-association rules are depicted in the right part of the figure relating the primary rules and the added attributes. The paper explains and compares both proposals (crisp and fuzzy), proposes a level-based mining algorithm using the RL-theory for the representation of fuzziness and makes some experimentation with synthetic and real data. Display Omitted HighlightsMeta-association rules are extracted from regular rules and contextual information.Meta-rules discover which associations are more frequent in multiple datasets.Fuzzy and non-fuzzy approaches are described.The algorithm proposed avoids information loss and supports data imprecision.Fuzzy algorithm allows parameters fine-tuning with acceptable execution time. Association rules have been widely used in many application areas to extract new and useful information expressed in a comprehensive way for decision makers from raw data. However, raw data may not always be available, it can be distributed in multiple datasets and therefore there resulting number of association rules to be inspected is overwhelming. In the light of these observations, we propose meta-association rules, a new framework for mining association rules over previously discovered rules in multiple databases. Meta-association rules are a new tool that convey new information from the patterns extracted from multiple datasets and give a summarized representation about most frequent patterns. We propose and compare two different algorithms based respectively on crisp rules and fuzzy rules, concluding that fuzzy meta-association rules are suitable to incorporate to the meta-mining procedure the obtained quality assessment provided by the rules in the first step of the process, although it consumes more time than the crisp approach. In addition, fuzzy meta-rules give a more manageable set of rules for its posterior analysis and they allow the use of fuzzy items to express additional knowledge about the original databases. The proposed framework is illustrated with real-life data about crime incidents in the city of Chicago. Issues such as the difference with traditional approaches are discussed using synthetic data.

[1]  Daniel Sánchez,et al.  A New Framework to Assess Association Rules , 2001, IDA.

[2]  Ken Kaneiwa,et al.  A rough set approach to multiple dataset analysis , 2011, Appl. Soft Comput..

[3]  Kilian Stoffel,et al.  From Temporal Rules to Temporal Meta-rules , 2004, DaWaK.

[4]  Eyke Hüllermeier Fuzzy Association Rules: Semantic Issues and Quality Measures , 2001, Fuzzy Days.

[5]  Man Hon Wong,et al.  Mining fuzzy association rules in databases , 1998, SGMD.

[6]  Attila Gyenesei,et al.  A Fuzzy Approach for Mining Quantitative Association Rules , 2000, Acta Cybern..

[7]  Daniel Sánchez,et al.  Measuring the accuracy and interest of association rules: A new framework , 2002, Intell. Data Anal..

[8]  John F. Roddick,et al.  Higher order mining , 2008, SKDD.

[9]  José L. Balcázar,et al.  Evaluation of Association Rule Quality Measures through Feature Extraction , 2013, IDA.

[10]  Wu Jian,et al.  An Efficient Association Rule Mining Algorithm In Distributed Databases , 2008, First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008).

[11]  Daniel Sánchez,et al.  Meta-association rules for fusing regular association rules from different databases , 2014, 17th International Conference on Information Fusion (FUSION).

[12]  Yannis Manolopoulos,et al.  Mining association rules in very large clustered domains , 2007, Inf. Syst..

[13]  Edward H. Shortliffe,et al.  A model of inexact reasoning in medicine , 1990 .

[14]  Thomas Sudkamp,et al.  Examples, counterexamples, and measuring fuzzy associations , 2005, Fuzzy Sets Syst..

[15]  Alexandre Villeminot,et al.  Combined use of association rules mining and clustering methods to find relevant links between binary rare attributes in a large data set , 2007, Comput. Stat. Data Anal..

[16]  Daniel Sánchez,et al.  Fuzzy association rules: general model and applications , 2003, IEEE Trans. Fuzzy Syst..

[17]  Chris Cornelis,et al.  Elicitation of fuzzy association rules from positive and negative examples , 2005, Fuzzy Sets Syst..

[18]  Vikram Pudi,et al.  FAR-miner: a fast and efficient algorithm for fuzzy association rule mining , 2012, Int. J. Bus. Intell. Data Min..

[19]  Eyke Hüllermeier,et al.  A systematic approach to the assessment of fuzzy association rules , 2006, Data Mining and Knowledge Discovery.

[20]  John Wang,et al.  Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications , 2008 .

[21]  Alvis Cheuk M. Fong,et al.  Discovering diverse association rules from multidimensional schema , 2013, Expert Syst. Appl..

[22]  Wai-Ho Au,et al.  FARM: a data mining system for discovering fuzzy association rules , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[23]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[24]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[25]  M. Amparo Vila,et al.  Fuzzy Meta-Association Rules , 2015, IFSA-EUSFLAT.

[26]  Manoj Kumar Shukla,et al.  Removal of Duplicate Rules for Association Rule Mining from Multilevel Dataset , 2015 .

[27]  David Wai-Lok Cheung,et al.  Efficient Mining of Association Rules in Distributed Databases , 1996, IEEE Trans. Knowl. Data Eng..

[28]  Xindong Wu,et al.  Discovering Relational Patterns across Multiple Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[29]  Daniel Sánchez,et al.  Studying Interest Measures for Association Rules through a Logical Model , 2010, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[30]  Ruoming Jin,et al.  Shared memory parallelization of data mining algorithms: techniques, programming interface, and performance , 2005, IEEE Transactions on Knowledge and Data Engineering.

[31]  Guoqing Chen,et al.  Fuzzy association rules and the extended mining algorithms , 2002, Inf. Sci..

[32]  Reza Sheibani,et al.  Two Efficient Algorithms for Mining Fuzzy AssociationRules , 2011 .

[33]  M. Dolores Ruiz,et al.  New Approaches for Discovering Exception and Anomalous Rules , 2011, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[34]  Ruoming Jin,et al.  Systematic Approach for Optimizing Complex Mining Tasks on Multiple Databases , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[35]  Pradeep Yadav,et al.  An Efficient Approach of Association Rule Mining on Distributed Database Algorithm , 2013 .

[36]  W. Marsden I and J , 2012 .

[37]  Siu Cheung Hui,et al.  An Effective Clustering-based Approach for Conceptual Association Rules Mining , 2009, 2009 IEEE-RIVF International Conference on Computing and Communication Technologies.

[38]  Daniel Sánchez,et al.  On a non-nested level-based representation of fuzziness , 2012, Fuzzy Sets Syst..

[39]  Keith C. C. Chan,et al.  Mining changes in association rules: a fuzzy approach , 2005, Fuzzy Sets Syst..

[40]  Daniel Sánchez,et al.  A formal model for mining fuzzy rules using the RL representation theory , 2011, Inf. Sci..

[41]  Jiawei Han,et al.  Meta-Rule-Guided Mining of Association Rules in Relational Databases , 1995, KDOOD/TDOOD.

[42]  Nicolás Marín,et al.  A Complexity Guided Algorithm for Association Rule Extraction on Fuzzy DataCubes , 2008, IEEE Transactions on Fuzzy Systems.

[43]  Chien-Hua Wang,et al.  Mining association rules uses fuzzy weighted FP-growth , 2012, The 6th International Conference on Soft Computing and Intelligent Systems, and The 13th International Symposium on Advanced Intelligence Systems.