Exploring graph mining approaches for dynamic heterogeneous networks

As we become a more ’connected’ society, a greater need exists to understand complex network structures. While many in the field of data mining analyze network data, most models of networks are straightforward focusing on many connections of a single type. In order to better understand relationships between different types of entities and extract meaningful structure from heterogeneous data, data mining algorithms need to be developed for new models of complex graphs. In this extended abstract, we describe some existing graph models and propose directions for more robust models that can serve as a backbone for analysis and mining of heterogeneous network data sets. We also identify possible metrics that attempt to capture the relationship of heterogeneous components of the network and serve to give insight into the topological relationships that exist among different node and edge types. The development of these different metrics will be important for designing meaningful clustering and graph mining algorithms for these data. Finally, we consider dynamic versions of these networks and present issues for the next generation of mining algorithms related to community identification, pruning and large graph approximations, privacy preservation of complex graph data.

[1]  Cynthia Dwork,et al.  Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography , 2007, WWW '07.

[2]  Takashi Washio,et al.  State of the art of graph-based data mining , 2003, SKDD.

[3]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[4]  R. Pastor-Satorras,et al.  Epidemic spreading in correlated complex networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Tanya Y. Berger-Wolf,et al.  A framework for analysis of dynamic social networks , 2006, KDD '06.

[6]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[7]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[8]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Tanya Y. Berger-Wolf,et al.  A framework for community identification in dynamic social networks , 2007, KDD '07.

[10]  Jiawei Han,et al.  Mining hidden community in heterogeneous social networks , 2005, LinkKDD '05.

[11]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[12]  Lisa Singh,et al.  Visual Mining of Multi-Modal Social Networks at Different Abstraction Levels , 2007, 2007 11th International Conference Information Visualization (IV '07).

[13]  Lisa Singh,et al.  C-GROUP: A Visual Analytic Tool for Pairwise Analysis of Dynamic Group Membership , 2007, 2007 IEEE Symposium on Visual Analytics Science and Technology.

[14]  Lisa Singh,et al.  Pruning social networks using structural properties and descriptive attributes , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).