Estimating node similarity from co-citation in a spatial graph model

Co-citation (number of nodes linking to both of a given pair of nodes) is often used heuristically to judge similarity between nodes in a complex network. We investigate the relation between node similarity and co-citation in the context of the Spatial Preferred Attachment (SPA) model. The SPA model is a spatial model, where nodes live in a metric space, and nodes that are close together in space are considered similar, and are more likely to link to one another. Theoretical analysis of the SPA model leads to a measure to estimate spatial distance from the link information, based on co-citation as well as the degrees of both nodes. Simulation results show this measure to be highly accurate in predicting the actual spatial distance.

[1]  Alan M. Frieze,et al.  A Geometric Preferential Attachment Model of Networks II , 2007, Internet Math..

[2]  Monika Henzinger,et al.  Finding Related Pages in the World Wide Web , 1999, Comput. Networks.

[3]  Amin Vahdat,et al.  On curvature and temperature of complex networks , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Anthony Bonato,et al.  A Spatial Web Graph Model with Local Influence Regions , 2007, WAW.

[5]  N. Konno,et al.  Geographical threshold graphs with small-world and scale-free properties. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[7]  Desmond J. Higham,et al.  Fitting a geometric graph to a protein-protein interaction network , 2008, Bioinform..

[8]  Filippo Menczer,et al.  Lexical and semantic clustering by Web links , 2004, J. Assoc. Inf. Sci. Technol..

[9]  Alan M. Frieze,et al.  A Geometric Preferential Attachment Model of Networks , 2004, WAW.

[10]  Kuei-Kuei Lai,et al.  Using the patent co-citation approach to establish a new patent classification system , 2005, Inf. Process. Manag..

[11]  Julie Bichteler,et al.  The combined use of bibliographic coupling and cocitation for document retrieval , 1980, J. Am. Soc. Inf. Sci..

[12]  Aric A. Hagberg,et al.  Giant Component and Connectivity in Geographical Threshold Graphs , 2007, WAW.

[13]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .