Using Rank Aggregation for Expert Search in Academic Digital Libraries

The task of expert finding has been getting increasing attention in information retrieval literature. However, the current state-of-the-art is still lacking in principled approaches for combining different sources of evidence. This paper explores the usage of unsupervised rank aggregation methods as a principled approach for combining multiple estimators of expertise, derived from the textual contents, from the graph-structure of the citation patterns for the community of experts, and from profile information about the experts. We specifically experimented two unsupervised rank aggregation approaches well known in the information retrieval literature, namely CombSUM and CombMNZ. Experiments made over a dataset of academic publications for the area of Computer Science attest for the adequacy of these methods.

[1]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[2]  Julien Ah-Pine,et al.  On data fusion in information retrieval using different aggregation operators , 2011, Web Intell. Agent Syst..

[3]  Enrico Motta,et al.  The Open University at TREC 2006 Enterprise Track Expert Search Task , 2006, TREC.

[4]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[5]  Yannis Manolopoulos,et al.  Generalized comparison of graph-based ranking algorithms for publications and authors , 2006, J. Syst. Softw..

[6]  Nick Craswell,et al.  Overview of the TREC 2006 Enterprise Track , 2006, TREC.

[7]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[8]  Pavel Serdyukov,et al.  Search for expertise : going beyond direct evidence , 2009 .

[9]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[10]  Xiangji Huang,et al.  Modeling document features for expert finding , 2008, CIKM '08.

[11]  W. Bruce Croft,et al.  Proximity-based document representation for named entity retrieval , 2007, CIKM '07.

[12]  Mônica G. Campiteli,et al.  Is it possible to compare researchers with different scientific interests? , 2006, Scientometrics.

[13]  Chun-Ting Zhang,et al.  The e-Index, Complementing the h-Index for Excess Citations , 2009, PloS one.

[14]  Craig MacDonald,et al.  Voting techniques for expert search , 2008, Knowledge and Information Systems.

[15]  Michael G. Banks An extension of the Hirsch index: Indexing scientific topics and compounds , 2006, Scientometrics.

[16]  Sergei Maslov,et al.  Finding scientific gems with Google's PageRank algorithm , 2006, J. Informetrics.

[17]  Mohamed Farah,et al.  An outranking approach for rank aggregation in information retrieval , 2007, SIGIR.

[18]  Yannis Manolopoulos,et al.  Generalized h-index for Disclosing Latent Facts in Citation Networks , 2006, ArXiv.

[19]  Johan Bollen,et al.  Co-authorship networks in the digital library research community , 2005, Inf. Process. Manag..

[20]  L. Egghe,et al.  Theory and practise of the g-index , 2006, Scientometrics.

[21]  Bo Wang,et al.  Expert2Bólè: From Expert Finding to Bólè Search , 2009 .

[22]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[23]  Yannis Manolopoulos,et al.  Generalized Hirsch h-index for disclosing latent facts in citation networks , 2007, Scientometrics.

[24]  Luísa Coheur,et al.  Towards the Rapid Development of a Natural Language Understanding Module , 2011, IVA.

[25]  Nick Craswell,et al.  Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[26]  Yannis Manolopoulos,et al.  A citation-based system to assist prize awarding , 2005, SGMD.

[27]  Shenghua Bao,et al.  Research on Expert Search at Enterprise Track of TREC 2006 , 2005, TREC.

[28]  ChengXiang Zhai,et al.  Probabilistic Models for Expert Finding , 2007, ECIR.

[29]  Hongbo Deng,et al.  Formal Models for Expert Finding on DBLP Bibliography Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.