Influence Neighbourhoods in CiteSeer: A Case Study

We are interested in using purely network-based techniques to assist in matching instances across databases that can be represented as complex networks. In particular we are interested in the individuality of the influence neighbourhood of a node (the sub graph induced by its in-neighbours) in a directed network. We derive a paper citation network from the archived Cite Seer database and use this network as a case study of influence neighbourhoods. We show that papers can reliably be distinguished by their influence neighbourhoods.

[1]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[2]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[3]  M. Newman Coauthorship networks and patterns of scientific collaboration , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Adi Shamir,et al.  A method for obtaining digital signatures and public-key cryptosystems , 1978, CACM.

[5]  Carter T. Butts,et al.  network: A Package for Managing Relational Data in R , 2008 .

[6]  Silvana Castano,et al.  Ontology and Instance Matching , 2011, Knowledge-Driven Multimedia Information Extraction and Ontology Evolution.

[7]  Horst Bunke,et al.  A Graph-Theoretic Approach to Network Dynamics , 2007 .

[8]  Sharma Chakravarthy,et al.  Snoop: An Expressive Event Specification Language for Active Databases , 1994, Data Knowl. Eng..

[9]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[10]  Gjergji Kasneci,et al.  SIGMa: simple greedy matching for aligning large knowledge bases , 2012, KDD.

[11]  John K. Ousterhout,et al.  Tcl and the Tk Toolkit , 1994 .