Optimized and Frequent Subgraphs: How Are They Related?

Frequent subgraph mining (FSM) is one of the most challenging tasks in graph mining. FSM consists of applying the data mining algorithms to extract interesting, unexpected, and useful graph patterns from the graphs. It also aspires to offer a richer apprehension of the given graph data. FSM has been applied to many domains, such as graphical data management and knowledge discovery, social network analysis, Bioinformatics, and security. In this context, a large number of techniques have been suggested to deal with the graph data, with the objective to extract the frequently occurring graph patterns. Such patterns are called frequent subgraph patterns (FSPs). FSPs are extracted using the traditional support threshold parameter. However, there exists no specialized scheme to decide the discovered FSPs are optimized as well. Thus, the aim of this paper is to suggest an optimization strategy to uncover the association between the frequent and the optimized subgraph patterns. For exploring the existence of the potential association between the FSPs and the optimized subgraph, a Particle Swarm Optimization technique is suggested. This relationship will be very handy to reduce the FSPs, by choosing those FSPs which were also discovered as optimized FSPs. Different experiments are performed using benchmark graph data sets to validate the existence of the aforementioned relationship between the optimized and the frequent FSPs.

[1]  Tri Nguyen,et al.  Topic Discovery Using Frequent Subgraph Mining Approach , 2017 .

[2]  Mong-Li Lee,et al.  A Partition-Based Approach to Graph Mining , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[3]  Chowdhury Farhan Ahmed,et al.  An Efficient Approach for Mining Frequent Subgraphs , 2017, PReMI.

[4]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[5]  Kai Zhao,et al.  Solving the traveling salesman problem based on an adaptive simulated annealing algorithm with greedy search , 2011, Appl. Soft Comput..

[6]  Julius Beneoluchi Odili,et al.  A GLOBAL AFRICAN BUFFALO OPTIMIZATION , 2017 .

[7]  Russell C. Eberhart,et al.  Parameter Selection in Particle Swarm Optimization , 1998, Evolutionary Programming.

[8]  Chunming Hu,et al.  Big graph search: challenges and techniques , 2014, Frontiers of Computer Science.

[9]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[10]  Joost N. Kok,et al.  A quickstart in frequent structure mining can make a difference , 2004, KDD.

[11]  Changhe Li,et al.  A survey of swarm intelligence for dynamic optimization: Algorithms and applications , 2017, Swarm Evol. Comput..

[12]  George Karypis,et al.  Frequent substructure-based approaches for classifying chemical compounds , 2003, IEEE Transactions on Knowledge and Data Engineering.

[13]  Wei Wang,et al.  Mining protein family specific residue packing patterns from protein structure graphs , 2004, RECOMB.

[14]  Shijie Zhang,et al.  Monkey: Approximate Graph Mining Based on Spanning Trees , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[15]  Hannu Toivonen,et al.  Finding Frequent Substructures in Chemical Compounds , 1998, KDD.

[16]  Oscar Castillo,et al.  Fuzzy Logic for Combining Particle Swarm Optimization and Genetic Algorithms: Preliminary Results , 2009, MICAI.

[17]  Jyothisha J. Nair,et al.  A survey on extracting frequent subgraphs , 2016, 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[18]  Philip S. Yu,et al.  Mining significant graph patterns by leap search , 2008, SIGMOD Conference.

[19]  Huilian Fan,et al.  Discrete Particle Swarm Optimization for TSP based on Neighborhood , 2010 .

[20]  Izzat Alsmadi,et al.  Clustering and classification of email contents , 2015, J. King Saud Univ. Comput. Inf. Sci..

[21]  Baseem Khan,et al.  Selecting a Meta-Heuristic Technique for Smart Micro-Grid Optimization Problem: A Comprehensive Analysis , 2017, IEEE Access.

[22]  Philip S. Yu,et al.  Searching Substructures with Superimposed Distance , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[23]  Joong-Rin Shin,et al.  A particle swarm optimization for economic dispatch with nonsmooth cost functions , 2005, IEEE Transactions on Power Systems.

[24]  N.R. Malik,et al.  Graph theory with applications to engineering and computer science , 1975, Proceedings of the IEEE.

[25]  Julius Beneoluchi Odili,et al.  A comparative evaluation of swarm intelligence techniques for solving combinatorial optimization problems , 2017 .

[26]  Panos Kalnis,et al.  Incremental Frequent Subgraph Mining on Large Evolving Graphs , 2017, IEEE Transactions on Knowledge and Data Engineering.

[27]  Hisao Ishibuchi,et al.  Guest Editorial Evolutionary Many-Objective Optimization , 2018, IEEE Trans. Evol. Comput..

[28]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[29]  Charu C. Aggarwal,et al.  Graph Data Management and Mining: A Survey of Algorithms and Applications , 2010, Managing and Mining Graph Data.

[30]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[31]  Georgios Dounias,et al.  Honey bees mating optimization algorithm for the Euclidean traveling salesman problem , 2011, Inf. Sci..

[32]  Satvir Singh,et al.  The Firefly Optimization Algorithm: Convergence Analysis and Parameter Selection , 2013 .

[33]  Kamalakar Karlapalem,et al.  MARGIN: Maximal Frequent Subgraph Mining , 2006, ICDM.

[34]  Benhard Sitohang,et al.  A Review of Big Graph Mining Research , 2017 .

[35]  Jiawei Han,et al.  On effective presentation of graph patterns: a structural representative approach , 2008, CIKM '08.

[36]  Saeed Jalili,et al.  High-performance parallel frequent subgraph discovery , 2015, The Journal of Supercomputing.

[37]  A. John MINING GRAPH DATA , 2022 .

[38]  Marco Dorigo,et al.  AntNet: Distributed Stigmergetic Control for Communications Networks , 1998, J. Artif. Intell. Res..

[39]  Christian Borgelt,et al.  Mining molecular fragments: finding relevant substructures of molecules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[40]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[41]  Aristides Gionis,et al.  Mining Frequent Patterns in Evolving Graphs , 2018, CIKM.

[42]  Yuji Matsumoto,et al.  An Application of Boosting to Graph Classification , 2004, NIPS.

[43]  Lawrence B. Holder,et al.  Substucture Discovery in the SUBDUE System , 1994, KDD Workshop.

[44]  Thorsten Meinl,et al.  A Quantitative Comparison of the Subgraph Miners MoFa, gSpan, FFSM, and Gaston , 2005, PKDD.

[45]  Hui Wang,et al.  A Parallel Approach for Frequent Subgraph Mining in a Single Large Graph Using Spark , 2018 .

[46]  Philip S. Yu,et al.  Direct mining of discriminative and essential frequent patterns via model-based search tree , 2008, KDD.

[47]  J. A. Rodríguez-Velázquez,et al.  Subgraph centrality in complex networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[48]  Arvinder Kaur,et al.  A Comprehensive Comparison of Ant Colony and Hybrid Particle Swarm Optimization Algorithms Through Test Case Selection , 2018 .

[49]  Todd P. Newman Tracking the release of IPCC AR5 on Twitter: Users, comments, and sources following the release of the Working Group I Summary for Policymakers , 2017, Public understanding of science.

[50]  Can Berk Kalayci,et al.  A survey of swarm intelligence for portfolio optimization: Algorithms and applications , 2018, Swarm Evol. Comput..

[51]  Aarzoo Dhiman,et al.  Frequent subgraph mining algorithms for single large graphs — A brief survey , 2016, 2016 International Conference on Advances in Computing, Communication, & Automation (ICACCA) (Spring).

[52]  George Karypis,et al.  An efficient algorithm for discovering frequent subgraphs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[53]  Jong-Bae Park,et al.  An Improved Particle Swarm Optimization for Nonconvex Economic Dispatch Problems , 2010, IEEE Transactions on Power Systems.

[54]  R. Eberhart,et al.  Empirical study of particle swarm optimization , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[55]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[56]  Jianzhong Li,et al.  Mining Frequent Subgraph Patterns from Uncertain Graph Data , 2010, IEEE Transactions on Knowledge and Data Engineering.

[57]  Peter Sanders,et al.  Think Locally, Act Globally: Highly Balanced Graph Partitioning , 2013, SEA.

[58]  Satyanarayana R. Valluri,et al.  Maximal Frequent Subgraph Mining , 2022 .

[59]  Andreas Brandstädt,et al.  Efficient Domination and Efficient Edge Domination: A Brief Survey , 2018, CALDAM.

[60]  Jonathan L. Gross,et al.  Handbook of graph theory , 2007, Discrete mathematics and its applications.

[61]  Taku Kudo,et al.  Clustering graphs by weighted substructure mining , 2006, ICML.

[62]  Jiong Yang,et al.  SPIN: mining maximal frequent subgraphs from graph databases , 2004, KDD.

[63]  Xiang-Yang Li,et al.  Ranking of Closeness Centrality for Large-Scale Social Networks , 2008, FAW.

[64]  Yang Yu,et al.  FSP: Frequent Substructure Pattern mining , 2007, 2007 6th International Conference on Information, Communications & Signal Processing.

[65]  Cherif Chiraz Latiri,et al.  LC-mine: a framework for frequent subgraph mining with local consistency techniques , 2014, Knowledge and Information Systems.

[66]  Enzo Baccarelli,et al.  FLAPS: bandwidth and delay-efficient distributed data searching in Fog-supported P2P content delivery networks , 2017, The Journal of Supercomputing.

[67]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[68]  Mohammad Al Hasan,et al.  FS3: A sampling based method for top-k frequent subgraph mining , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[69]  Philip S. Yu,et al.  Towards Graph Containment Search and Indexing , 2007, VLDB.

[70]  Philip S. Yu,et al.  Feature-based similarity search in graph structures , 2006, TODS.

[71]  Xin-She Yang,et al.  An improved discrete bat algorithm for symmetric and asymmetric Traveling Salesman Problems , 2016, Eng. Appl. Artif. Intell..

[72]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[73]  Frans Coenen,et al.  A survey of frequent subgraph mining algorithms , 2012, The Knowledge Engineering Review.

[74]  Frans Coenen,et al.  Frequent Sub-graph Mining on Edge Weighted Graphs , 2010, DaWak.

[75]  Xueliang Fu,et al.  An Ant System-Assisted Genetic Algorithm For Solving The Traveling Salesman Problem , 2012 .

[76]  James A. McHugh,et al.  Algorithmic Graph Theory , 1986 .