Potential Theory for Directed Networks

Uncovering factors underlying the network formation is a long-standing challenge for data mining and network analysis. In particular, the microscopic organizing principles of directed networks are less understood than those of undirected networks. This article proposes a hypothesis named potential theory, which assumes that every directed link corresponds to a decrease of a unit potential and subgraphs with definable potential values for all nodes are preferred. Combining the potential theory with the clustering and homophily mechanisms, it is deduced that the Bi-fan structure consisting of 4 nodes and 4 directed links is the most favored local structure in directed networks. Our hypothesis receives strongly positive supports from extensive experiments on 15 directed networks drawn from disparate fields, as indicated by the most accurate and robust performance of Bi-fan predictor within the link prediction framework. In summary, our main contribution is twofold: (i) We propose a new mechanism for the local organization of directed networks; (ii) We design the corresponding link prediction algorithm, which can not only testify our hypothesis, but also find out direct applications in missing link prediction and friendship recommendation.

[1]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[2]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[3]  Mao-Bin Hu,et al.  Triangular clustering in document networks , 2008, 0807.2113.

[4]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[5]  Albert-László Barabási,et al.  Scale-Free Networks: A Decade and Beyond , 2009, Science.

[6]  R. Milo,et al.  Subgraphs in random networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  S. Brenner,et al.  The structure of the nervous system of the nematode Caenorhabditis elegans. , 1986, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[8]  Giulio Cimini,et al.  Emergence of Scale-Free Leadership Structure in Social Recommender Systems , 2011, PloS one.

[9]  Enys Mones,et al.  Hierarchy Measure for Complex Networks , 2012, PloS one.

[10]  János Kertész,et al.  Clustering in complex networks , 2004 .

[11]  T. Vicsek,et al.  Directed network modules , 2007, physics/0703248.

[12]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[13]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[14]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[15]  Azadeh Iranmehr,et al.  Trust Management for Semantic Web , 2009, 2009 Second International Conference on Computer and Electrical Engineering.

[16]  Christos Faloutsos,et al.  ANF: a fast and scalable tool for data mining in massive graphs , 2002, KDD.

[17]  Jari Saramäki,et al.  Emergence of communities in weighted networks. , 2007, Physical review letters.

[18]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[19]  Sriram Subramanian,et al.  Talking about tactile experiences , 2013, CHI.

[20]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[21]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[22]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[23]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[24]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[25]  Kevin Lewis,et al.  Social selection and peer influence in an online social network , 2011, Proceedings of the National Academy of Sciences.

[26]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[27]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[28]  G. Fagiolo Clustering in complex directed networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Krishna P. Gummadi,et al.  Growth of the flickr social network , 2008, WOSN '08.

[30]  Beom Jun Kim,et al.  Growing scale-free networks with tunable clustering. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Jure Leskovec,et al.  Planetary-scale views on a large instant-messaging network , 2008, WWW.

[32]  Daniel M. Romero,et al.  Who Should I Follow? Recommending People in Directed Social Networks , 2011, ICWSM.

[33]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  M. Gerstein,et al.  Genomic analysis of the hierarchical structure of regulatory networks , 2006, Proceedings of the National Academy of Sciences.

[35]  Tore Opsahl,et al.  Modeling the evolution of continuously-observed networks: Communication in a Facebook-like community , 2010, 1010.2141.

[36]  K. Selçuk Candan,et al.  How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media? , 2010, ICWSM.

[37]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[38]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[39]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[40]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[41]  Steven H Strogatz,et al.  Energy landscape of social balance. , 2009, Physical review letters.

[42]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[43]  S. Shen-Orr,et al.  Superfamilies of Evolved and Designed Networks , 2004, Science.

[44]  Paolo Pin,et al.  Identifying the roles of race-based choice and chance in high school friendship network formation , 2010, Proceedings of the National Academy of Sciences.

[45]  Rossano Schifanella,et al.  Friendship prediction and homophily in social media , 2012, TWEB.

[46]  Shang Mingsheng,et al.  Emergence of local structures in complex network:common neighborhood drives the network evolution , 2011 .

[47]  D. Baird,et al.  Assessment of spatial and temporal variability in ecosystem attributes of the St Marks national wildlife refuge, Apalachee bay, Florida , 1998 .

[48]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[49]  Brian D. Davison,et al.  Link formation analysis in microblogs , 2011, SIGIR.

[50]  Gueorgi Kossinets,et al.  Empirical Analysis of an Evolving Social Network , 2006, Science.

[51]  M E J Newman,et al.  Random acyclic networks. , 2009, Physical review letters.

[52]  Vicenç Gómez,et al.  Statistical analysis of the social network and discussion threads in slashdot , 2008, WWW.

[53]  Fabio Celli,et al.  Social Network Data and Practices: The Case of Friendfeed , 2010, SBP.

[54]  R. May Food webs. , 1983, Science.

[55]  Adilson E Motter,et al.  Local structure of directed networks. , 2007, Physical review letters.

[56]  Jure Leskovec,et al.  Microscopic evolution of social networks , 2008, KDD.

[57]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[58]  Dante R Chialvo,et al.  Emergent self-organized complex network topology out of stability constraints. , 2008, Physical review letters.

[59]  Diego Garlaschelli,et al.  Patterns of link reciprocity in directed networks. , 2004, Physical review letters.

[60]  Dunja Mladenic,et al.  Proceedings of the 3rd international workshop on Link discovery , 2005, KDD 2005.

[61]  Yi-Cheng Zhang,et al.  Leaders in Social Networks, the Delicious Case , 2011, PloS one.

[62]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[63]  H. Spencer The structure of the nervous system. , 1870 .