With the ubiquity of information networks and their broad applications, there have been numerous studies on the construction, online analytical processing, and mining of information networks in multiple disciplines, including social network analysis, World-Wide Web, database systems, data mining, machine learning, and networked communication and information systems. In this tutorial, we present an organized picture on scalable OLAP (online analytical processing) and mining of information networks, with the inclusion of the following topics: (1) an introduction to information networks and information network analysis, (2) general statistical behavior of information networks, (3) mining frequent subgraphs in large graphs and networks, (4) data integration, data cleaning and data validation in information networks, (5) clustering graphs and information networks, (6) classification of graphs and information networks; (7) summarization and simplification of graphs and information networks, (8) OLAP and multidimensional analysis of information networks, (9) evolution of dynamic information networks, and (10) research challenges on OLAP and mining of information networks.
[1]
Yizhou Sun,et al.
RankClus: integrating clustering with ranking for heterogeneous information network analysis
,
2009,
EDBT '09.
[2]
Philip S. Yu,et al.
LinkClus: efficient clustering via heterogeneous semantic links
,
2006,
VLDB.
[3]
Philip S. Yu,et al.
Graph OLAP: Towards Online Analytical Processing on Graphs
,
2008,
2008 Eighth IEEE International Conference on Data Mining.
[4]
Philip S. Yu,et al.
Object Distinction: Distinguishing Objects with Identical Names
,
2007,
2007 IEEE 23rd International Conference on Data Engineering.
[5]
Philip S. Yu,et al.
Truth Discovery with Multiple Conflicting Information Providers on the Web
,
2007,
IEEE Transactions on Knowledge and Data Engineering.
[6]
Xiaowei Xu,et al.
SCAN: a structural clustering algorithm for networks
,
2007,
KDD '07.
[7]
Jennifer Widom,et al.
SimRank: a measure of structural-context similarity
,
2002,
KDD.