Algorithms for estimating relative importance in networks

Large and complex graphs representing relationships among sets of entities are an increasingly common focus of interest in data analysis---examples include social networks, Web graphs, telecommunication networks, and biological networks. In interactive analysis of such data a natural query is "which entities are most important in the network relative to a particular individual or set of individuals?" We investigate the problem of answering such queries in this paper, focusing in particular on defining and computing the importance of nodes in a graph relative to one or more root nodes. We define a general framework and a number of different algorithms, building on ideas from social networks, graph theory, Markov models, and Web graph analysis. We experimentally evaluate the different properties of these algorithms on toy graphs and demonstrate how our approach can be used to study relative importance in real-world networks including a network of interactions among September 11th terrorists, a network of collaborative research in biotechnology among companies and universities, and a network of co-authorship relationships among computer science researchers.

[1]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[2]  John G. Kemeny,et al.  Finite Markov chains , 1960 .

[3]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[4]  M. Zelen,et al.  Rethinking centrality: Methods and examples☆ , 1989 .

[5]  Vladimir Batagelj,et al.  Centrality in Social Networks , 1993 .

[6]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[7]  M. KleinbergJon Authoritative sources in a hyperlinked environment , 1999 .

[8]  C. Lee Giles,et al.  Digital Libraries and Autonomous Citation Indexing , 1999, Computer.

[9]  David A. Cohn,et al.  Creating customized authority lists , 1999, ICML 1999.

[10]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[11]  Andrew McCallum,et al.  Learning to Create Customized Authority Lists , 2000, ICML.

[12]  Shlomo Moran,et al.  The stochastic approach for link-structure analysis (SALSA) and the TKC effect , 2000, Comput. Networks.

[13]  Allan Borodin,et al.  Finding authorities and hubs from link structures on the World Wide Web , 2001, WWW '01.

[14]  Valdis E. Krebs,et al.  Mapping Networks of Terrorist Cells , 2001 .

[15]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[16]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[17]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[18]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.

[19]  W. Powell,et al.  Network Dynamics and Field Evolution: The Growth of Interorganizational Collaboration in the Life Sciences1 , 2005, American Journal of Sociology.