Closed and Maximal Subgraph Mining in Internally and Externally Weighted Graph Databases

We formalize a problem of closed and maximal pattern discovery in internally and externally weighted graph databases. We introduce two weights, internal weights and external weights, which represent utility and significance of each edge in the graph, and importance and reliability of the graph itself, respectively. In our formulation, graphs with the two sets of weights describe the target data to be mined precisely. As an extension of traditional sub graph miners, we develop a mining algorithm called "wgMiner" for discovering all closed and maximal patterns in the weighted graph databases. With wgMiner, experiments demonstrate the effectiveness of our formulation in pattern mining from communication networks.

[1]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[2]  Jun Zhang,et al.  FOGGER: an algorithm for graph generator discovery , 2009, EDBT '09.

[3]  Jing Wang,et al.  Pushing Frequency Constraint to Utility Mining Model , 2007, International Conference on Computational Science.

[4]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[5]  Yiming Yang,et al.  The Enron Corpus: A New Dataset for Email Classi(cid:12)cation Research , 2004 .

[6]  Nada Lavrač,et al.  A Fast Algorithm for Mining Utility-Frequent Itemsets , 2007 .