Axiomatic ranking of network role similarity

A key task in analyzing social networks and other complex networks is role analysis: describing and categorizing nodes by how they interact with other nodes. Two nodes have the same role if they interact with equivalent sets of neighbors. The most fundamental role equivalence is automorphic equivalence. Unfortunately, the fastest algorithm known for graph automorphism is nonpolynomial. Moreover, since exact equivalence is rare, a more meaningful task is measuring the role similarity between any two nodes. This task is closely related to the link-based similarity problem that SimRank addresses. However, SimRank and other existing simliarity measures are not sufficient because they do not guarantee to recognize automorphically or structurally equivalent nodes. This paper makes two contributions. First, we present and justify several axiomatic properties necessary for a role similarity measure or metric. Second, we present RoleSim, a role similarity metric which satisfies these axioms and which can be computed with a simple iterative algorithm. We rigorously prove that RoleSim satisfies all the axiomatic properties and demonstrate its superior interpretative power on both synthetic and real datasets.

[1]  David Avis,et al.  A survey of heuristics for the weighted matching problem , 1983, Networks.

[2]  Michael R. Lyu,et al.  Extending Link-based Algorithms for Similar Web Pages with Neighborhood Structure , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[3]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[4]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[5]  Juan-Zi Li,et al.  Extraction and mining of an academic social network , 2008, WWW.

[6]  S. Boorman,et al.  Social Structure from Multiple Networks. I. Blockmodels of Roles and Positions , 1976, American Journal of Sociology.

[7]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  A. Ferligoj,et al.  An optimizational approach to regular equivalence , 1992 .

[9]  S. Borgatti,et al.  Regular equivalence: general theory , 1994 .

[10]  Maarten Marx,et al.  Regular equivalence and dynamic logic , 2003, Soc. Networks.

[11]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[12]  Ioannis Antonellis,et al.  Simrank++: query rewriting through link analysis of the clickgraph (poster) , 2008, WWW.

[13]  Martin G. Everett,et al.  Exact colorations of graphs and digraphs , 1996 .

[14]  S. Boorman,et al.  Social structure from multiple networks: I , 1976 .

[15]  Martin G. Everett,et al.  Two algorithms for computing regular equivalence , 1993 .

[16]  D. Cvetkovic,et al.  Spectra of graphs : theory and application , 1995 .

[17]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[18]  H. White,et al.  “Structural Equivalence of Individuals in Social Networks” , 2022, The SAGE Encyclopedia of Research Design.

[19]  Xu Jia,et al.  Efficient Algorithm for Computing Link-Based Similarity in Real World Networks , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[20]  References , 1971 .

[21]  Derek G. Corneil,et al.  The graph isomorphism disease , 1977, J. Graph Theory.

[22]  Michael R. Lyu,et al.  PageSim: A Novel Link-Based Similarity Measure for the World Wide Web , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[23]  L. Sailer Structural equivalence: Meaning and definition, computation and application , 1978 .

[24]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[25]  Malcolm K. Sparrow,et al.  A linear algorithm for computing automorphic equivalence classes: the numerical signatures approach , 1993 .

[26]  Xu Jia,et al.  Calculating Similarity Efficiently in a Small World , 2009, ADMA.

[27]  Philip S. Yu,et al.  LinkClus: efficient clustering via heterogeneous semantic links , 2006, VLDB.

[28]  Pavel Velikhov,et al.  Accuracy estimate and optimization techniques for SimRank computation , 2008, Proc. VLDB Endow..

[29]  S. Boorman,et al.  Social Structure from Multiple Networks. II. Role Structures , 1976, American Journal of Sociology.

[30]  Yuchung J. Wang,et al.  Stochastic Blockmodels for Directed Graphs , 1987 .

[31]  Emilie Marie Hafner-Burton,et al.  Network Analysis for International Relations , 2008, International Organization.

[32]  Edward A. Fox,et al.  SimFusion: measuring similarity using unified relationship matrix , 2005, SIGIR '05.

[33]  Guy Melançon,et al.  Edge Metrics for Visual Graph Analytics: A Comparative Study , 2008, 2008 12th International Conference Information Visualisation.

[34]  Shmuel Friedland,et al.  On the graph isomorphism problem , 2008, ArXiv.

[35]  S. Borgatti,et al.  Notions of position in social network analysis , 1992 .

[36]  Hongyan Liu,et al.  Exploiting the Block Structure of Link Graph for Efficient Similarity Computation , 2009, PAKDD.

[37]  Dániel Fogaras,et al.  Scaling link-based similarity search , 2005, WWW '05.

[38]  Evangelos E. Milios,et al.  Node similarity in networked information spaces , 2001, CASCON.

[39]  Jonathan I. Maletic,et al.  Using method stereotype distribution as a signature descriptor for software systems , 2009, 2009 IEEE International Conference on Software Maintenance.

[40]  Yuval Shavitt,et al.  A model of Internet topology using k-shell decomposition , 2007, Proceedings of the National Academy of Sciences.

[41]  Michael R. Lyu,et al.  MatchSim: a novel neighbor-based similarity measure with maximum neighborhood matching , 2009, CIKM.

[42]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[43]  P. Holme,et al.  Role-similarity based functional prediction in networked systems: application to the yeast proteome , 2005, Journal of The Royal Society Interface.

[44]  Gordon F. Royle,et al.  Algebraic Graph Theory , 2001, Graduate texts in mathematics.

[45]  D. Cvetkovic,et al.  Spectra of Graphs: Theory and Applications , 1997 .

[46]  Michalis Faloutsos,et al.  A simple conceptual model for the Internet topology , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[47]  K. Reitz,et al.  Graph and Semigroup Homomorphisms on Networks of Relations , 1983 .

[48]  Martin G. Everett,et al.  Role similarity and complexity in social networks , 1985 .

[49]  J. M. Schultz,et al.  Topic Detection and Tracking using idf-Weighted Cosine Coefficient , 1999 .

[50]  Yizhou Sun,et al.  P-Rank: a comprehensive structural similarity measure over information networks , 2009, CIKM.