Category-Driven Association Rule Mining

The quality of rules generated by ontology-driven association rule mining algorithms is constrained by the algorithm’s effectiveness in exploiting the usually large ontology in the mining process. We present a framework built around superimposing a hierarchical graph structure on a given ontology to divide the rule mining problem into disjoint subproblems whose solutions can be iteratively joined to find global associations. We present a new metric for evaluating the interestingness of generated rules based on where their constructs fall within the ontology. Our metric is anti-monotonic on subsets, making it usable in an Apriori-like algorithm which we present here. The algorithm categorises the ontology into disjoint subsets utilising the hierarchical graph structure and uses the metric to find associations in each, joining the results using the guidance of anti-monotonicity. The algorithm optionally embeds built-in definitions of user-specified filters to reflect user preferences. We evaluate the resulting model using a large collection of patient health records.

[1]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[2]  Nicolette de Keizer,et al.  Forty years of SNOMED: a literature review , 2008, BMC Medical Informatics Decis. Mak..

[3]  Ramakrishnan Srikant,et al.  Mining Association Rules with Item Constraints , 1997, KDD.

[4]  K. Raghava Rao,et al.  Interactive Post Mining Association Rules using Cost Complexity Pruning and Ontologies KDD , 2013 .

[5]  Pawel Garbacz,et al.  A metaontology for applied ontology , 2013, Appl. Ontology.

[6]  K. Baclawski,et al.  The Open Ontology Repository Initiative : Requirements and Research Challenges , 2009 .

[7]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[8]  Amedeo Napoli,et al.  First Elements on Knowledge Discovery Guided by Domain Knowledge (KDDK) , 2006, CLA.

[9]  Shamkant B. Navathe,et al.  An Efficient Algorithm for Mining Association Rules in Large Databases , 1995, VLDB.

[10]  Bin Chen,et al.  An Algorithm for Constrained Association Rule Mining in Semi-structured Data , 1999, PAKDD.

[11]  Hong-Gee Kim,et al.  Query-based association rule mining supporting user perspective , 2011, Computing.

[12]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[13]  Javier González,et al.  Hierarchical graph search for mobile robot path planning , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[14]  Amedeo Napoli,et al.  Knowledge-Based Selection of Association Rules for Text Mining , 2004, ECAI.

[15]  Heinrich Herre,et al.  A Meta-ontological Architecture for Foundational Ontologies , 2005, OTM Conferences.

[16]  Amedeo Napoli,et al.  Towards a text mining methodology using association rule extraction , 2006, Soft Comput..

[17]  Fabrice Guillet,et al.  Knowledge-Based Interactive Postmining of Association Rules Using Ontologies , 2010, IEEE Transactions on Knowledge and Data Engineering.