Neighborhood-based uncertainty generation in social networks

Imprecision, incompleteness and dynamic exist in a wide range of network applications. It is difficult to decide the uncertainty relationship among nodes since traditional models are not meaningful in uncertain networks, and the inherent computational complexity of the problems with uncertainty is always intractable. In this paper, we study how to capture uncertainty in networks by transforming a series of snapshots of a network to an uncertain graph. A novel sampling scheme is also proposed which enables the development of efficient algorithms to measure uncertainty in networks. Considering the practical aspects of neighborhood relationship in real networks, a framework is introduced to transform an uncertain network into a deterministic weighted network where the weights on edges can be measured by Jaccard-like index. The comprehensive experimental evaluation results on real data demonstrate the effectiveness and efficiency of our algorithms.

[1]  Ke Xu,et al.  Mining Community in Mobile Social Network , 2012 .

[2]  H. Chernoff A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the sum of Observations , 1952 .

[3]  Charu C. Aggarwal,et al.  Discovering highly reliable subgraphs in uncertain graphs , 2011, KDD.

[4]  Jiawei Han,et al.  Progressive clustering of networks using Structure-Connected Order of Traversal , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[5]  Lei Chen,et al.  Efficiently Answering Probability Threshold-Based Shortest Path Queries over Uncertain Graphs , 2010, DASFAA.

[6]  Jianzhong Li,et al.  Mining frequent subgraphs over uncertain graph databases under probabilistic semantics , 2012, The VLDB Journal.

[7]  Jianzhong Li,et al.  Mining Frequent Subgraph Patterns from Uncertain Graph Data , 2010, IEEE Transactions on Knowledge and Data Engineering.

[8]  Haixun Wang,et al.  Distance-Constraint Reachability Computation in Uncertain Graphs , 2011, Proc. VLDB Endow..

[9]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[10]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[11]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[12]  George Kollios,et al.  k-nearest neighbors in uncertain graphs , 2010, Proc. VLDB Endow..

[13]  Christophe G. Giraud-Carrier,et al.  Public health community mining in YouTube , 2012, IHI '12.

[14]  Philip S. Yu,et al.  A Survey of Uncertain Data Algorithms and Applications , 2009, IEEE Transactions on Knowledge and Data Engineering.

[15]  Sheldon M. Ross Introduction to Probability Models. , 1995 .

[16]  Sheldon M. Ross,et al.  Introduction to Probability Models, Eighth Edition , 1972 .

[17]  Guohui Lin,et al.  Improved Approximation Algorithms for the Capacitated Multicast Routing Problem , 2005, COCOON.

[18]  Hannu Toivonen,et al.  Finding reliable subgraphs from large probabilistic graphs , 2008, Data Mining and Knowledge Discovery.

[19]  Jianzhong Li,et al.  Frequent subgraph pattern mining on uncertain graph data , 2009, CIKM.

[20]  Jianzhong Li,et al.  O(ε)-Approximation to physical world by sensor networks , 2013, 2013 Proceedings IEEE INFOCOM.

[21]  V. Strassen Gaussian elimination is not optimal , 1969 .