Mining significant trend sequences in dynamic attributed graphs

Abstract Discovering patterns in graphs has many applications such as social network, biological and chemistry data analysis. Although many algorithms were proposed to identify interesting patterns in graphs, most of them consider simple types of graphs such as static graphs or graphs having a single attribute per vertex. Recently, studies have considered discovering frequent patterns in dynamic attributed graphs. These graphs can represent not only relationships between entities but also how they evolve over time, and describe entities with multiple attributes. Algorithms for mining frequent patterns in dynamic attributed graphs select patterns based on their occurrence frequency. But a major drawback of this approach is that many frequent patterns may contain entities that are weakly correlated. Thus, numerous frequent but spurious patterns may be shown to the user. To allows discovering strongly correlated patterns in dynamic attributed graphs, this paper proposes a novel significance measure named Sequence Virtual Growth Rate. It allows evaluating if a pattern represents entities that are correlated in terms of their proximity in a graph over time. Based on this measure a novel type of graph patterns is defined called Significant Trend Sequence. To efficiently mine these patterns, two algorithms named TSeqMiner d f s − b f s and TSeqMiner d f s − d f s are proposed. They rely on a novel upper bound and pruning strategy to reduce the search space. Experimental results show that the proposed algorithms are efficient and can identify interesting patterns in real-world social network and flight data.

[1]  Yun Sing Koh,et al.  Mining local and peak high utility itemsets , 2019, Inf. Sci..

[2]  Lawrence B. Holder,et al.  Substructure Discovery Using Minimum Description Length and Background Knowledge , 1993, J. Artif. Intell. Res..

[4]  Hamido Fujita,et al.  An efficient method for mining high utility closed itemsets , 2019, Inf. Sci..

[5]  Philippe Fournier-Viger,et al.  A Survey of High Utility Sequential Pattern Mining , 2019, Studies in Big Data.

[6]  Jerry Chun-Wei Lin,et al.  A Survey of High Utility Itemset Mining , 2019, Studies in Big Data.

[7]  Yan Huang,et al.  A Framework for Mining Sequential Patterns from Spatio-Temporal Event Data Sets , 2008, IEEE Transactions on Knowledge and Data Engineering.

[8]  Ruoming Jin,et al.  Trend Motif: A Graph Mining Approach for Analysis of Dynamic Complex Networks , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[9]  Yi Yang,et al.  Diversified Temporal Subgraph Pattern Mining , 2016, KDD.

[10]  Ying Wang,et al.  Modeling and Density Estimation of an Urban Freeway Network Based on Dynamic Graph Hybrid Automata , 2017, Sensors.

[11]  Deyuan Meng,et al.  Network-to-Network Control Over Heterogeneous Topologies: A Dynamic Graph Approach , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[12]  Xiangliang Zhang,et al.  An up-to-date comparison of state-of-the-art classification algorithms , 2017, Expert Syst. Appl..

[13]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[14]  Danai Koutra,et al.  TimeCrunch: Interpretable Dynamic Graph Summarization , 2015, KDD.

[15]  Wen-Chih Peng,et al.  Mining of Location-Based Social Networks for Spatio-Temporal Social Influence , 2017, PAKDD.

[16]  Danna Zhou,et al.  d. , 1934, Microbial pathogenesis.

[17]  Mete Celik Partial spatio-temporal co-occurrence pattern mining , 2014, Knowledge and Information Systems.

[18]  Jean-François Boulicaut,et al.  Trend Mining in Dynamic Attributed Graphs , 2013, ECML/PKDD.

[19]  Satoshi Matsuoka,et al.  Towards a Distributed Large-Scale Dynamic Graph Data Store , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[20]  Aaron Striegel,et al.  On the Interplay Between Individuals’ Evolving Interaction Patterns and Traits in Dynamic Multiplex Social Networks , 2016, IEEE Transactions on Network Science and Engineering.

[21]  Hans-Peter Kriegel,et al.  Pattern Mining in Frequent Dynamic Subgraphs , 2006, Sixth International Conference on Data Mining (ICDM'06).

[22]  Philippe Fournier-Viger,et al.  A survey of itemset mining , 2017, WIREs Data Mining Knowl. Discov..

[23]  Rasmus Pagh,et al.  Triangle Counting in Dynamic Graph Streams , 2014, Algorithmica.

[24]  Jean-François Boulicaut,et al.  Weighted Path as a Condensed Pattern in a Single Attributed DAG , 2013, IJCAI.

[25]  Chang-Dong Wang,et al.  Community Detection in Graph Streams by Pruning Zombie Nodes , 2017, PAKDD.

[26]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[27]  Takashi Washio,et al.  Complete Mining of Frequent Patterns from Graphs: Mining Graph Data , 2003, Machine Learning.

[28]  Sihem Amer-Yahia,et al.  Scalable Interactive Dynamic Graph Clustering on Multicore CPUs , 2019, IEEE Transactions on Knowledge and Data Engineering.

[29]  Michael Burch,et al.  Visualizing the Evolution of Ontologies: A Dynamic Graph Perspective , 2015, VOILA@ISWC.

[30]  Geoffrey I. Webb,et al.  Supervised Descriptive Rule Discovery: A Unifying Survey of Contrast Set, Emerging Pattern and Subgroup Mining , 2009, J. Mach. Learn. Res..

[31]  James Bailey,et al.  Node Re-Ordering as a Means of Anomaly Detection in Time-Evolving Graphs , 2016, ECML/PKDD.

[32]  Mehdi Kaytoue-Uberall,et al.  Triggering patterns of topology changes in dynamic graphs , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[33]  George Karypis,et al.  Algorithms for Mining the Evolution of Conserved Relational States in Dynamic Networks , 2011, ICDM.

[34]  Charu C. Aggarwal,et al.  Managing and Mining Graph Data , 2010, Managing and Mining Graph Data.

[35]  Claude Pasquier,et al.  Attributed graph mining in the presence of automorphism , 2016, Knowledge and Information Systems.

[36]  Jean-François Boulicaut,et al.  Cohesive Co-evolution Patterns in Dynamic Attributed Graphs , 2012, Discovery Science.

[37]  Benjamin A. Miller,et al.  Analytical Models and Methods for Anomaly Detection in Dynamic, Attributed Graphs , 2016 .

[38]  Yun Sing Koh,et al.  A Survey of Sequential Pattern Mining , 2017 .

[39]  Pong C. Yuen,et al.  Dynamic Label Graph Matching for Unsupervised Video Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[40]  Ambuj K. Singh,et al.  Mining discriminative subgraphs from global-state networks , 2013, KDD.

[41]  Marc Plantevit,et al.  Mining Graph Topological Patterns: Finding Covariations among Vertex Descriptors , 2013, IEEE Transactions on Knowledge and Data Engineering.

[42]  Hamido Fujita,et al.  Damped window based high average utility pattern mining over data streams , 2017, Knowl. Based Syst..

[43]  Christophe Rigotti,et al.  Finding Collections of k-Clique Percolated Components in Attributed Graphs , 2012, PAKDD.

[44]  Danielle S. Bassett,et al.  Dynamic graph metrics: Tutorial, toolbox, and tale , 2017, NeuroImage.

[45]  Hamido Fujita,et al.  Extracting Non-redundant Correlated Purchase Behaviors by Utility Measure , 2017, DaWaK.

[46]  Nazha Selmaoui-Folcher,et al.  Frequent pattern mining in attributed trees: algorithms and applications , 2016, Knowledge and Information Systems.

[47]  Karthik Ganesan Pillai,et al.  Spatio-temporal Co-occurrence Pattern Mining in Data Sets with Evolving Regions , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[48]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[49]  Li Li,et al.  A Pattern Tree Based Method for Mining Conditional Contrast Patterns of Multi-source Data , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[50]  Hadi Fanaee-T,et al.  Dynamic graph summarization: a tensor decomposition approach , 2018, Data Mining and Knowledge Discovery.

[51]  Lawrence B. Holder,et al.  Learning patterns in the dynamics of biological networks , 2009, KDD.

[52]  Wei Cao,et al.  An effective contrast sequential pattern mining approach to taxpayer behavior analysis , 2015, World Wide Web.

[53]  Tiejun Lv,et al.  Space-Time Hierarchical-Graph Based Cooperative Localization in Wireless Sensor Networks , 2016, IEEE Transactions on Signal Processing.

[54]  George Karypis,et al.  Finding Frequent Patterns in a Large Sparse Graph* , 2005, Data Mining and Knowledge Discovery.

[55]  Fabio Fassetti,et al.  Discovering discriminative graph patterns from gene expression data , 2016, SAC.

[56]  Haitao Zhao,et al.  Dynamic graph embedding for fault detection , 2018, Comput. Chem. Eng..

[57]  Karthik Ganesan Pillai,et al.  Spatiotemporal Co-occurrence Rules , 2013, ADBIS.

[58]  Takashi Washio,et al.  A Fast Method to Mine Frequent Subsequences from Graph Sequence Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[59]  Philippe Fournier-Viger,et al.  Mining Correlated High-Utility Itemsets Using the Bond Measure , 2016, HAIS.

[60]  Young-Koo Lee,et al.  Supergraph based periodic pattern mining in dynamic social networks , 2017, Expert Syst. Appl..

[61]  Jian Lu,et al.  A Pattern Mining Approach to Sensor-Based Human Activity Recognition , 2011, IEEE Transactions on Knowledge and Data Engineering.

[62]  Shashi Shekhar,et al.  Cascading Spatio-Temporal Pattern Discovery , 2012, IEEE Transactions on Knowledge and Data Engineering.

[63]  Petra Perner Mining Frequent Subgraph Pattern Over a Collection of Attributed-Graphs and Construction of a Relation Hierarchy for Result Reporting , 2017, ICDM.

[64]  Michael Burch,et al.  A Taxonomy and Survey of Dynamic Graph Visualization , 2017, Comput. Graph. Forum.

[65]  Zhi Cheng,et al.  Mining Recurrent Patterns in a Dynamic Attributed Graph , 2017, PAKDD.

[66]  George Karypis,et al.  Algorithms for Mining the Coevolving Relational Motifs in Dynamic Networks , 2015, ACM Trans. Knowl. Discov. Data.