Listing closed sets of strongly accessible set systems with applications to data

We study the problem of listing all closed sets of a closure operator @s that is a partial function on the power set of some finite ground set E, i.e., @s:F->F with [email protected]?P(E). A very simple divide-and-conquer algorithm is analyzed that correctly solves this problem if and only if the domain of the closure operator is a strongly accessible set system. Strong accessibility is a strict relaxation of greedoids as well as of independence systems. This algorithm turns out to have delay O(|E|(T"F+T"@s+|E|)) and space O(|E|+S"F+S"@s), where T"F, S"F, T"@s, and S"@s are the time and space complexities of checking membership in F and computing @s, respectively. In contrast, we show that the problem becomes intractable for accessible set systems. We relate our results to the data mining problem of listing all support-closed patterns of a dataset and show that there is a corresponding closure operator for all datasets if and only if the set system satisfies a certain confluence property.

[1]  Mihalis Yannakakis,et al.  On Generating All Maximal Independent Sets , 1988, Inf. Process. Lett..

[2]  Hiroki Arimura,et al.  An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases , 2004, Discovery Science.

[3]  Alex Ferguson,et al.  Diverse Product Recommendations Using an Expressive Language for Case Retrieval , 2002, ECCBR.

[4]  Dino Pedreschi,et al.  Spatiotemporal Data Mining , 2008, Mobility, Data Mining and Privacy.

[5]  Dino Pedreschi,et al.  Mobility, Data Mining and Privacy - Geographic Knowledge Discovery , 2008, Mobility, Data Mining and Privacy.

[6]  László Lovász,et al.  Relations between subclasses of greedoids , 1985, Z. Oper. Research.

[7]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[8]  Stefan Wrobel,et al.  Efficient Closed Pattern Mining in Strongly Accessible Set Systems , 2007, MLG.

[9]  Shin-Ichi Nakano,et al.  Constant Time Generation of Trees with Specified Diameter , 2004, WG.

[10]  Eugene L. Lawler,et al.  Generating all Maximal Independent Sets: NP-Hardness and Polynomial-Time Algorithms , 1980, SIAM J. Comput..

[11]  Alain Gély A Generic Algorithm for Generating Closed Sets of a Binary Relation , 2005, ICFCA.

[12]  Nicolas Pasquier,et al.  Efficient Mining of Association Rules Using Closed Itemset Lattices , 1999, Inf. Syst..

[13]  Hiroki Arimura,et al.  Polynomial-Delay and Polynomial-Space Algorithms for Mining Closed Sequences, Graphs, and Pictures in Accessible Set Systems , 2009, SDM.

[14]  B. Ganter,et al.  Finding all closed sets: A general approach , 1991 .