Closed and Maximal Tree Mining Using Natural Representations

Closure-based mining on purely relational data, that is, itemset mining, is, by now, well-established, and there are interesting algorithmic developments. Sharing some of the attractive features of frequency-based summarization of subsets, it offers an alternative view with both downsides and advantages; among the latter, there are the facts that, first, by imposing closure, the number of frequent sets is heavily reduced and, second, the possibility appears of developing a mathematical foundation that connects closure-based mining with lattice-theoretic approaches like Formal Concept Analysis.

[1]  Yun Chi,et al.  Mining Closed and Maximal Frequent Subtrees from Databases of Labeled Rooted Trees , 2005, IEEE Trans. Knowl. Data Eng..

[2]  Gabriel Valiente,et al.  Algorithms on Trees and Graphs , 2002, Springer Berlin Heidelberg.

[3]  José L. Balcázar,et al.  Subtree Testing and Closed Tree Mining Through Natural Representations , 2007, 18th International Workshop on Database and Expert Systems Applications (DEXA 2007).

[4]  José L. Balcázar,et al.  Intersection Algorithms and a Closure Operator on Unordered Trees , 2006 .

[5]  Yun Chi,et al.  Mining closed and maximal frequent subtrees from databases of labeled rooted trees , 2005, IEEE Transactions on Knowledge and Data Engineering.

[6]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[7]  José L. Balcázar,et al.  Mining frequent closed rooted trees , 2009, Machine Learning.

[8]  Yun Chi,et al.  Frequent Subtree Mining - An Overview , 2004, Fundam. Informaticae.

[9]  Carla E. Brodley,et al.  KDD-Cup 2000 organizers' report: peeling the onion , 2000, SKDD.

[10]  Gemma C. Garriga,et al.  Horn axiomatizations for sequential data , 2007, Theor. Comput. Sci..

[11]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[12]  L. Beran,et al.  [Formal concept analysis]. , 1996, Casopis lekaru ceskych.

[13]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.