Graph stream classification concerns building learning models from continuously growing graph data, in which an essential step is to explore subgraph features to represent graphs for effective learning and classification. When representing a graph using subgraph features, all existing methods employ coarse-grained feature representation, which only considers whether or not a subgraph feature appears in the graph. In this paper, we propose a fine-grained graph factorization approach for Fast Graph Stream Classification (FGSC). Our main idea is to find a set of cliques as feature base to represent each graph as a linear combination of the base cliques. To achieve this goal, we decompose each graph into a number of cliques and select discriminative cliques to generate a transfer matrix called Clique Set Matrix (M). By using M as the base for formulating graph factorization, each graph is represented in a vector space with each element denoting the degree of the corresponding subgraph feature related to the graph, so existing supervised learning algorithms can be applied to derive learning models for graph classification.
[1]
Charu C. Aggarwal,et al.
On Classification of Graph Streams
,
2011,
SDM.
[2]
Charu C. Aggarwal,et al.
Managing and Mining Graph Data
,
2010,
Managing and Mining Graph Data.
[3]
F. James Rohlf,et al.
Biometry: The Principles and Practice of Statistics in Biological Research
,
1969
.
[4]
Huan Liu,et al.
Community evolution in dynamic multi-mode networks
,
2008,
KDD.
[5]
Solomon Kullback,et al.
Information Theory and Statistics
,
1970,
The Mathematical Gazette.
[6]
D. Serre.
Matrices: Theory and Applications
,
2002
.
[7]
Hans Dietmar Gröger.
On the Randomized Complexity of Monotone Graph Properties
,
1992,
Acta Cybern..
[8]
Bin Li,et al.
Fast Graph Stream Classification Using Discriminative Clique Hashing
,
2013,
PAKDD.
[9]
Evgueni A. Haroutunian,et al.
Information Theory and Statistics
,
2011,
International Encyclopedia of Statistical Science.