A COMPARATIVE STUDY OF FREQUENT SUBGRAPH MINING ALGORITHMS

Data mining algorithms are facing the challenge to deal with an increasing number of complex objects. Graph is a natural data structure used for modeling complex objects. Frequent subgraph mining is another active research topic in data mining . A graph is a general model to represent data and has been used in many domains like cheminformatics and bioinformatics. Mining patterns from graph databases is challenging since graph related operations, such as subgraph testing, generally have higher time complexity than the corresponding operations on itemsets, sequences, and trees. Many frequent subgraph Mining algorithms have been proposed. SPIN, SUBDUE, g_Span, FFSM, GREW are a few to mention. In this paper we present a detailed survey on frequent subgraph mining algorithms, which are used for knowledge discovery in complex objects and also propose a frame work for classification of these algorithms. The purpose is to help user to apply the techniques in a task specific manner in various application domains and to pave wave for further research.

[1]  Cheng-Te Li,et al.  Mining Temporal Subgraph Patterns in Heterogeneous Information Networks , 2010, 2010 IEEE Second International Conference on Social Computing.

[2]  Mario Vento,et al.  An Improved Algorithm for Matching Large Graphs , 2001 .

[3]  Lawrence B. Holder,et al.  Graph-Based Data Mining in Dynamic Networks: Empirical Comparison of Compression-Based and Frequency-Based Subgraph Mining , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[4]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[5]  Joost N. Kok,et al.  A quickstart in frequent structure mining can make a difference , 2004, KDD.

[6]  Thorsten Meinl,et al.  Hybrid fragment mining with MoFa and FSG , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[7]  Christian Böhm,et al.  Frequent subgraph discovery in dynamic networks , 2010, MLG '10.

[8]  Xing Wu,et al.  Mining Partially Labeled Subgraphs Using Support Constraints , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[9]  George Karypis,et al.  GREW - a scalable frequent subgraph discovery algorithm , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[10]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[11]  Yuhua Li,et al.  A Directed Labeled Graph Frequent Pattern Mining Algorithm Based on Minimum Code , 2009, 2009 Third International Conference on Multimedia and Ubiquitous Engineering.

[12]  Li Jian Mining Frequent Jump Patterns from Graph Databases , 2010 .

[13]  Jeffrey Xu Yu,et al.  Efficient Discovery of Frequent Correlated Subgraph Pairs , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[14]  Hannu Toivonen,et al.  Finding Frequent Substructures in Chemical Compounds , 1998, KDD.

[15]  Jianzhong Li,et al.  Efficient Algorithms for Summarizing Graph Patterns , 2011, IEEE Transactions on Knowledge and Data Engineering.

[16]  Scott Fortin The Graph Isomorphism Problem , 1996 .

[17]  Ambuj K. Singh,et al.  GraphSig: A Scalable Approach to Mining Significant Subgraphs in Large Graph Databases , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[18]  George Karypis,et al.  Frequent substructure-based approaches for classifying chemical compounds , 2003, IEEE Transactions on Knowledge and Data Engineering.

[19]  Lawrence B. Holder,et al.  Substructure Discovery Using Minimum Description Length and Background Knowledge , 1993, J. Artif. Intell. Res..

[20]  Jiong Yang,et al.  SPIN: mining maximal frequent subgraphs from graph databases , 2004, KDD.

[21]  Christian Borgelt,et al.  Mining molecular fragments: finding relevant substructures of molecules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[22]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[23]  Joost N. Kok,et al.  Faster Association Rules for Multiple Relations , 2001, IJCAI.

[24]  Kamalakar Karlapalem,et al.  MARGIN: Maximal Frequent Subgraph Mining , 2006, ICDM.

[25]  George Karypis,et al.  Finding Frequent Patterns in a Large Sparse Graph* , 2005, Data Mining and Knowledge Discovery.

[26]  Jianzhong Li,et al.  JPMiner: Mining Frequent Jump Patterns from Graph Databases , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[27]  Shijie Zhang,et al.  RING: An Integrated Method for Frequent Representative Subgraph Mining , 2009, 2009 Ninth IEEE International Conference on Data Mining.