A Restriction-Based Approach to Generalizations

Generalizations, also known as contrast patterns, are in the core of many learning systems. A key component to automatically find generalizations is the predicate to select the most important ones. These predicates are usually formed by restrictions that every generalization must fulfill. Previous studies are mainly focused on the types of generalizations, each one associated to a particular predicate. In this paper, we shift the focus from predicates to restrictions. Restrictions are analyzed based on a set of intuitions that they materialize. Additionally, an analysis of the restrictions used in a large collection of existing generalizations suggests interesting conclusions.

[1]  Stefan Wrobel,et al.  An Algorithm for Multi-relational Discovery of Subgroups , 1997, PKDD.

[2]  Raj P. Gopalan,et al.  Building a More Accurate Classifier Based on Strong Frequent Patterns , 2004, Australian Conference on Artificial Intelligence.

[3]  Nada Lavrac,et al.  Expert-Guided Subgroup Discovery: Methodology and Application , 2011, J. Artif. Intell. Res..

[4]  Kotagiri Ramamohanarao,et al.  Efficiently Mining Interesting Emerging Patterns , 2003, WAIM.

[5]  Jie Wang,et al.  Discriminative pattern mining and its applications in bioinformatics , 2015, Briefings Bioinform..

[6]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[7]  Kotagiri Ramamohanarao,et al.  An Efficient Single-Scan Algorithm for Mining Essential Jumping Emerging Patterns for Classification , 2002, PAKDD.

[8]  Kotagiri Ramamohanarao,et al.  Instance-Based Classification by Emerging Patterns , 2000, PKDD.

[9]  Stephen D. Bay,et al.  Detecting change in categorical data: mining contrast sets , 1999, KDD '99.

[10]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[11]  Kotagiri Ramamohanarao,et al.  Fast discovery and the generalization of strong jumping emerging patterns for building compact and accurate classifiers , 2006, IEEE Transactions on Knowledge and Data Engineering.

[12]  Jesús Ariel Carrasco-Ochoa,et al.  Evaluation of quality measures for contrast patterns by using unseen objects , 2017, Expert Syst. Appl..

[13]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[14]  Kotagiri Ramamohanarao,et al.  Exploring constraints to efficiently mine emerging patterns from large high-dimensional datasets , 2000, KDD '00.

[15]  Geoffrey I. Webb,et al.  Supervised Descriptive Rule Discovery: A Unifying Survey of Contrast Set, Emerging Pattern and Subgroup Mining , 2009, J. Mach. Learn. Res..

[16]  Jinyan Li,et al.  Relative risk and odds ratio: a data mining perspective , 2005, PODS '05.

[17]  Peter A. Flach,et al.  Subgroup Discovery with CN2-SD , 2004, J. Mach. Learn. Res..

[18]  Robert J. Hilderman,et al.  Statistical Methodologies for Mining Potentially Interesting Contrast Sets , 2007, Quality Measures in Data Mining.

[19]  Vincent Mwintieru Nofong Mining Productive Emerging Patterns and Their Application in Trend Prediction , 2015, AusDM.

[20]  Stephen D. Bay,et al.  Detecting Group Differences: Mining Contrast Sets , 2001, Data Mining and Knowledge Discovery.

[21]  James Bailey,et al.  Fast mining of high dimensional expressive contrast patterns using zero-suppressed binary decision diagrams , 2006, KDD '06.

[22]  J. Bailey,et al.  Efficient Mining of Contrast Patterns and Their Applications to Classification , 2005, 2005 3rd International Conference on Intelligent Sensing and Information Processing.

[23]  Florian Lemmerich,et al.  Fast Subgroup Discovery for Continuous Target Concepts , 2009, ISMIS.

[24]  Ju Wang,et al.  Conditional discriminative pattern mining: Concepts and algorithms , 2017, Inf. Sci..

[25]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[26]  James Bailey,et al.  Classification Using Constrained Emerging Patterns , 2003, WAIM.

[27]  José Francisco Martínez Trinidad,et al.  CAR-NF: A classifier based on specific rules with high netconf , 2012, Intell. Data Anal..

[28]  José Francisco Martínez Trinidad,et al.  Fuzzy emerging patterns for classifying hard domains , 2011, Knowledge and Information Systems.

[29]  Zhou Wang,et al.  Exploiting Maximal Emerging Patterns for Classification , 2004, Australian Conference on Artificial Intelligence.