Formal models for expert finding in enterprise corpora

Searching an organization's document repositories for experts provides a cost effective solution for the task of expert finding. We present two general strategies to expert searching given a document collection which are formalized using generative probabilistic models. The first of these directly models an expert's knowledge based on the documents that they are associated with, whilst the second locates documents on topic, and then finds the associated expert. Forming reliable associations is crucial to the performance of expert finding systems. Consequently, in our evaluation we compare the different approaches, exploring a variety of associations along with other operational parameters (such as topicality). Using the TREC Enterprise corpora, we show that the second strategy consistently outperforms the first. A comparison against other unsupervised techniques, reveals that our second model delivers excellent performance.

[1]  Paul P. Maglio,et al.  Expertise identification using email communications , 2003, CIKM '03.

[2]  Mark Sanderson,et al.  Information retrieval system evaluation: effort, sensitivity, and reliability , 2005, SIGIR '05.

[3]  Dawit Yimam,et al.  Expert Finding Systems for Organizations: Domain Analysis and The DEMOIR Approach , 1999 .

[4]  Paul Thompson,et al.  An Inductive Search System: Theory, Design, and Implementation , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[6]  Raymond J. D'Amore Expertise community detection , 2004, SIGIR '04.

[7]  Morten Hertzum,et al.  The information-seeking practices of engineers: searching for documents as well as for people , 2000, Inf. Process. Manag..

[8]  Thomas H. Davenport,et al.  Book review:Working knowledge: How organizations manage what they know. Thomas H. Davenport and Laurence Prusak. Harvard Business School Press, 1998. $29.95US. ISBN 0‐87584‐655‐6 , 1998 .

[9]  Alfred Kobsa,et al.  Expert-Finding Systems for Organizations: Problem and Domain Analysis and the DEMOIR Approach , 2003, J. Organ. Comput. Electron. Commer..

[10]  Audris Mockus,et al.  Expertise Browser: a quantitative approach to identifying expertise , 2002, Proceedings of the 24th International Conference on Software Engineering. ICSE 2002.

[11]  Richard M. Schwartz,et al.  A hidden Markov model information retrieval system , 1999, SIGIR '99.

[12]  Bart Selman,et al.  Agent Amplified Communication , 1996, AAAI/IAAI, Vol. 1.

[13]  Nick Craswell,et al.  Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[14]  Mark S. Ackerman,et al.  Expertise recommender: a flexible recommendation system and architecture , 2000, CSCW '00.

[15]  David Hawking,et al.  Challenges in Enterprise Search , 2004, ADC.

[16]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[17]  Makoto Yokoo,et al.  Socialware: multiagent systems for supporting network communities , 1999, CACM.

[18]  David Hawking,et al.  Panoptic Expert: Searching for experts not just for documents , 2001 .