Being Omnipresent To Be Almighty: The Importance of The Global Web Evidence for Organizational Expert Finding

Modern expert nding algorithms are developed under the assumption that all possible expertise evidence for a person is concentrated in a company that currently employs the person. The evidence that can be acquired outside of an enterprise is traditionally unnoticed. At the same time, the Web is full of personal information which is sufficiently detailed to judge about a person's skills and knowledge. In this work, we review various sources of expertise evidence out-side of an organization and experiment with rankings built on the data acquired from six dierent sources, accessible through APIs of two major web search engines. We show that these rankings and their combinations are often more realistic and of higher quality than rankings built on organizational data only.

[1]  Wouter Weerkamp,et al.  Bloggers as experts , 2008 .

[2]  Ying Li,et al.  Personal name classification in web queries , 2008, WSDM '08.

[3]  W. Bruce Croft,et al.  Proximity-based document representation for named entity retrieval , 2007, CIKM '07.

[4]  David Hawking,et al.  Panoptic Expert: Searching for experts not just for documents , 2001 .

[5]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[6]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[7]  Lada A. Adamic,et al.  Knowledge sharing and yahoo answers: everyone knows something , 2008, WWW.

[8]  F. Gobet,et al.  The Cambridge handbook of expertise and expert performance , 2006 .

[9]  Craig MacDonald,et al.  High Quality Expertise Evidence for Expert Search , 2008, ECIR.

[10]  Michael Idinopulos,et al.  Do you Know who your Experts are , 2006 .

[11]  Daniel Gruhl,et al.  The web beyond popularity: a really simple system for web scale RSS , 2006, WWW '06.

[12]  Irma Becerra-Fernandez Facilitating the Online Search of Experts at NASA using Expert Seeker People-Finder , 2000, PAKM.

[13]  Gianluca Demartini Finding Experts Using Wikipedia , 2007, FEWS.

[14]  Judit Bar-Ilan,et al.  Which h-index? — A comparison of WoS, Scopus and Google Scholar , 2008, Scientometrics.

[15]  M. de Rijke,et al.  Broad expertise retrieval in sparse data environments , 2007, SIGIR.

[16]  Ethan A. Kolek,et al.  Online Disclosure: An Empirical Examination of Undergraduate Facebook Profiles , 2008 .

[17]  Haiqiang Chen,et al.  Social Network Structure Behind the Mailing Lists: ICT-IIIS at TREC 2006 Expert Finding Track , 2006, TREC.

[18]  Craig MacDonald,et al.  Voting for candidates: adapting data fusion techniques for an expert search task , 2006, CIKM '06.

[19]  Mike Thelwall,et al.  Blog search engines , 2007, Online Inf. Rev..

[20]  Nick Craswell,et al.  Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[21]  Paul J. Feltovich,et al.  The Cambridge handbook of expertise and expert performance , 2006 .

[22]  Michael L. Nelson,et al.  Agreeing to disagree: search engines and their public interfaces , 2007, JCDL '07.

[23]  Steven Skiena,et al.  Newspapers vs. Blogs: Who Gets the Scoop? , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[24]  Mark T. Maybury,et al.  Expert Finding Systems , 2006 .

[25]  Jonathan Grudin,et al.  Crossing Boundaries: A Case Study of Employee Blogging , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[26]  Chris Sherman Google Power: Unleash the Full Potential of Google , 2005 .

[27]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[28]  Thomas H. Davenport,et al.  Ten principles of knowledge management and four case studies , 1997 .

[29]  Cai-Nicolas Ziegler,et al.  Towards Automated Reputation and Brand Monitoring on the Web , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[30]  Tao Qin,et al.  Supervised rank aggregation , 2007, WWW '07.

[31]  Javed A. Aslam,et al.  Models for metasearch , 2001, SIGIR '01.

[32]  Rachel K. E. Bellamy,et al.  BlogCentral: the role of internal blogs at work , 2007, CHI Extended Abstracts.

[33]  Filip Radlinski,et al.  A support vector method for optimizing average precision , 2007, SIGIR.

[34]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.