Arnetminer: expertise oriented search using social networks

Expertise Oriented Search (EOS) aims at providing comprehensive expertise analysis on data from distributed sources. It is useful in many application domains, for example, finding experts on a given topic, detecting the confliction of interest between researchers, and assigning reviewers to proposals. In this paper, we present the design and implementation of our expertise oriented search system, Arnetminer (http://www.arnetminer.net). Arnetminer has gathered and integrated information about a half-million computer science researchers from the Web, including their profiles and publications. Moreover, Arnetminer constructs a social network among these researchers through their co-authorship, and utilizes this network information as well as the individual profiles to facilitate expertise oriented search tasks. In particular, the co-authorship information is used both in ranking the expertise of individual researchers for a given topic and in searching for associations between researchers. We have conducted initial experiments on Arnetminer. Our results demonstrate that the proposed relevancy propagation expert finding method outperforms the method that only uses person local information, and the proposed two-stage association search on a large-scale social network is order of magnitude faster than the baseline method.

[1]  Bart Selman,et al.  Referral Web: combining social networks and collaborative filtering , 1997, CACM.

[2]  Maarten de Rijke,et al.  Searching for People in the Personal Work Space , 2006 .

[3]  Kôiti Hasida,et al.  POLYPHONET: an advanced social network extraction system from the web , 2006, WWW '06.

[4]  Raghu Ramakrishnan,et al.  Community Information Management , 2006, IEEE Data Eng. Bull..

[5]  Yiqun Liu,et al.  THUIR at TREC 2005: Enterprise Track , 2005, TREC.

[6]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[7]  Dan Brickley,et al.  FOAF Vocabulary Specification , 2004 .

[8]  Paul P. Maglio,et al.  Expertise identification using email communications , 2003, CIKM '03.

[9]  Alan F. Smeaton,et al.  Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century? , 2002, SIGF.

[10]  Andrew McCallum,et al.  Extracting social networks and contact information from email and the Web , 2004, CEAS.

[11]  Shenghua Bao,et al.  Research on Expert Search at Enterprise Track of TREC 2006 , 2005, TREC.

[12]  Michael J. Pazzani,et al.  Mining for proposal reviewers: lessons learned at the national science foundation , 2006, KDD '06.

[13]  Leonard N. Foner,et al.  Yenta: a multi-agent, referral-based matchmaking system , 1997, AGENTS '97.

[14]  Jörg Sander,et al.  Analysis of SIGMOD's co-authorship graph , 2003, SGMD.

[15]  Amit P. Sheth,et al.  Semantic analytics on social networks: experiences in addressing the problem of conflict of interest detection , 2006, WWW '06.

[16]  Stefan Decker,et al.  Semantic Social Collaborative Filtering with FOAFRealm , 2005, Semantic Desktop Workshop.

[17]  Li Juanzi,et al.  SWARMS : A Tool for Exploring Domain Knowledge on Semantic Web , 2005 .

[18]  Nick Craswell,et al.  Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[19]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[20]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[21]  Alfred Kobsa,et al.  Expert-Finding Systems for Organizations: Problem and Domain Analysis and the DEMOIR Approach , 2003, J. Organ. Comput. Electron. Commer..

[22]  Lada A. Adamic,et al.  How to search a social network , 2005, Soc. Networks.

[23]  Michael F. Schwartz,et al.  Discovering shared interests using graph analysis , 1993, CACM.

[24]  David Hawking,et al.  Challenges in Enterprise Search , 2004, ADC.

[25]  Jie Tang,et al.  Email data cleaning , 2005, KDD '05.

[26]  Hongjun Lu,et al.  iASA: Learning to Annotate the Semantic Web , 2005, J. Data Semant..

[27]  Alan F. Smeaton,et al.  Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century? , 2002, SIGIR Forum.

[28]  Peter Mika,et al.  Flink: Semantic Web technology for the extraction and analysis of social networks , 2005, J. Web Semant..

[29]  F SmeatonAlan,et al.  Analysis of papers from twenty-five years of SIGIR conferences , 2003 .

[30]  Mark T. Maybury,et al.  Enterprise expert and knowledge discovery , 1999, HCI.

[31]  J. Carroll,et al.  Jena: implementing the semantic web recommendations , 2004, WWW Alt. '04.