Graph mining: A survey of graph mining techniques

Data mining is comprised of many data analysis techniques. Its basic objective is to discover the hidden and useful data pattern from very large set of data. Graph mining, which has gained much attention in the last few decades, is one of the novel approaches for mining the dataset represented by graph structure. Graph mining finds its applications in various problem domains, including: bioinformatics, chemical reactions, Program flow structures, computer networks, social networks etc. Different data mining approaches are used for mining the graph-based data and performing useful analysis on these mined data. In literature various graph mining approaches have been proposed. Each of these approaches is based on either classification; clustering or decision trees data mining techniques. In this study, we present a comprehensive review of various graph mining techniques. These different graph mining techniques have been critically evaluated in this study. This evaluation is based on different parameters. In our future work, we will provide our own classification based graph mining technique which will efficiently and accurately perform mining on the graph structured data.

[1]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[2]  Lawrence B. Holder,et al.  Empirical comparison of graph classification algorithms , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[3]  Hisashi Kashima,et al.  Kernels for graph classification , 2002 .

[4]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[5]  Zbigniew Walczak,et al.  Classification of Web Documents Using a Graph-Based Model and Structural Patterns , 2007, PKDD.

[6]  Heiner Stuckenschmidt,et al.  Ontology-Based Integration of Information - A Survey of Existing Approaches , 2001, OIS@IJCAI.

[7]  Giuseppe Di Fatta,et al.  High Performance Subgraph Mining in Molecular Compounds , 2005, HPCC.

[8]  Yun Chi,et al.  Indexing and mining free trees , 2003, Third IEEE International Conference on Data Mining.

[9]  Günther Palm,et al.  Semi‐Supervised Clustering in Functional Genomics , 2009 .

[10]  Yiling Chen,et al.  A Bipartite Graph Co-Clustering Approach to Ontology Mapping , 2003 .

[11]  Luiz S. Ochi,et al.  Efficient evolutionary algorithms for the clustering problem in directed graphs , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[12]  Jeffrey Xu Yu,et al.  Mining Closed Frequent Free Trees in Graph Databases , 2007, DASFAA.

[13]  Hiroshi Motoda What Can We Do with Graph-Structured Data? - A Data Mining Perspective , 2006, Australian Conference on Artificial Intelligence.

[14]  Luc De Raedt,et al.  Feature Construction with Version Spaces for Biochemical Applications , 2001, ICML.

[15]  Inderjit S. Dhillon,et al.  A fast kernel-based multilevel algorithm for graph clustering , 2005, KDD '05.

[16]  Foster J. Provost,et al.  Classification in Networked Data: a Toolkit and a Univariate Case Study , 2007, J. Mach. Learn. Res..

[17]  Casimir A. Kulikowski,et al.  Coring method for clustering a graph , 2008, 2008 19th International Conference on Pattern Recognition.

[18]  Abraham Kandel,et al.  Classification of Web documents using a graph model , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[19]  Kevin Françoisse,et al.  Semi-supervised Classification from Discriminative Random Walks , 2008, ECML/PKDD.

[20]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[21]  M. Sewell,et al.  Kernel Methods , 2017, Encyclopedia of Machine Learning and Data Mining.

[22]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[23]  Günther Palm,et al.  On the robustness of semi-supervised hierarchical graph clustering in functional genomics , 2007 .