Revisiting Numerical Pattern Mining with Formal Concept Analysis

We investigate the problem of mining numerical data with Formal Concept Analysis. The usual way is to use a scaling procedure -transforming numerical attributes into binary ones- leading either to a loss of information or of efficiency, in particular w.r.t. the volume of extracted patterns. By contrast, we propose to directlywork on numerical data in a more precise and efficient way. For that, the notions of closed patterns, generators and equivalent classes are revisited in the numerical context. Moreover, two algorithms are proposed and tested in an evaluation involving real-world data, showing the quality of the present approach.

[1]  Vladimir Gurvich,et al.  An Intersection Inequality for Discrete Distributions and Related Generation Problems , 2003, ICALP.

[2]  Jinyan Li,et al.  Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise , 2006, SDM.

[3]  Bernhard Ganter,et al.  Pattern Structures and Their Projections , 2001, ICCS.

[4]  Bernhard Ganter,et al.  Formal Concept Analysis , 2013 .

[5]  L. Beran,et al.  [Formal concept analysis]. , 1996, Casopis lekaru ceskych.

[6]  Sergei O. Kuznetsov,et al.  Comparing performance of algorithms for generating concept lattices , 2002, J. Exp. Theor. Artif. Intell..

[7]  AgrawalRakesh,et al.  Mining quantitative association rules in large relational tables , 1996 .

[8]  Gerd Stumme,et al.  Computing iceberg concept lattices with T , 2002, Data Knowl. Eng..

[9]  Jian Pei,et al.  Minimum Description Length Principle: Generators Are Preferable to Closed Patterns , 2006, AAAI.

[10]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[11]  Hiroki Arimura,et al.  LCM ver. 2: Efficient Mining Algorithms for Frequent/Closed/Maximal Itemsets , 2004, FIMI.

[12]  Toon Calders,et al.  Depth-First Non-Derivable Itemset Mining , 2005, SDM.

[13]  Amedeo Napoli,et al.  Pattern Mining in Numerical Data: Extracting Closed Patterns and their Generators , 2010 .

[14]  Gerd Stumme,et al.  Mining frequent patterns with counting inference , 2000, SKDD.

[15]  StummeGerd,et al.  Computing iceberg concept lattices with TITANIC , 2002 .

[16]  Amedeo Napoli,et al.  Mining gene expression data with pattern structures in formal concept analysis , 2011, Inf. Sci..