Existing studies on graph mining focus on exact graphs that are precise and complete. However, graph data tends to be uncertain in practice due to noise, incompleteness and inaccuracy. This paper investigates the problem of finding top-k maximal cliques in an uncertain graph. A new model of uncertain graphs is presented, and an intuitive measure is introduced to evaluate the significance of vertex sets. An optimized branch-and-bound algorithm is developed to find top-k maximal cliques, which adopts efficient pruning rules, a new searching strategy and effective preprocessing methods. The extensive experimental results show that the proposed algorithm is very efficient on real uncertain graphs, and the top-k maximal cliques are very useful for real applications, e.g. protein complex prediction.
[1]
Gary D. Bader,et al.
An automated method for finding molecular complexes in large protein interaction networks
,
2003,
BMC Bioinformatics.
[2]
Jianzhong Li,et al.
Mining Frequent Subgraph Patterns from Uncertain Graph Data
,
2010,
IEEE Transactions on Knowledge and Data Engineering.
[3]
Kazuhisa Makino,et al.
New Algorithms for Enumerating All Maximal Cliques
,
2004,
SWAT.
[4]
Roded Sharan,et al.
BMC Bioinformatics BioMed Central
,
2006
.