A Unified Approach for Learning Expertise and Authority in Digital Libraries

Managing individual expertise is a major concern within any industrial-wide organization. If previous works have extensively studied the related expertise and authority profiling issues, they assume a semantic independence of these two key concepts. In digital libraries, state-of-the-art models generally summarize the researchers’ profile by using solely textual information. Consequently, authors with a large amount of publications are mechanically fostered to the detriment of less prolific ones with probably higher expertise. To overcome this drawback we propose to merge the two representations of expertise and authority and balance the results by capturing a mutual reinforcement principle between these two notions. Based on a graph representation of the library, the expert profiling task is formulated as an optimization problem where latent expertise and authority representations are learned simultaneously, unbiasing the expertise scores of individuals with a large amount of publications. The proposal is instanciated on a public scientific bibliographic dataset where researchers’ publications are considered as a source of evidence of individuals’ expertise and citation relations as a source of authoritative signals. Results from our experiments conducted over the Microsoft Academic Search database demonstrate significant efficiency improvement in comparison with state-of-the-art models for the expert retrieval task.

[1]  C. Lee Giles,et al.  Ranking experts using author-document-topic graphs , 2013, JCDL '13.

[2]  Padhraic Smyth,et al.  Algorithms for estimating relative importance in networks , 2003, KDD '03.

[3]  M. de Rijke,et al.  Determining Expert Profiles (With an Application to Expert Finding) , 2007, IJCAI.

[4]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[5]  Paul P. Maglio,et al.  Expertise identification using email communications , 2003, CIKM '03.

[6]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[7]  Bo Wang,et al.  Expert2Bólè: From Expert Finding to Bólè Search , 2009 .

[8]  Jie Tang,et al.  A Combination Approach to Web User Profiling , 2010, TKDD.

[9]  Eugene Agichtein,et al.  Discovering authorities in question answer communities by using link analysis , 2007, CIKM '07.

[10]  David Hawking,et al.  Panoptic Expert: Searching for experts not just for documents , 2001 .

[11]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[12]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[13]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[14]  Krisztian Balog,et al.  Temporal Expertise Profiling , 2014, ECIR.

[15]  Shenghuo Zhu,et al.  Learning multiple graphs for document recommendations , 2008, WWW.

[16]  Thomas H. Davenport,et al.  Book review:Working knowledge: How organizations manage what they know. Thomas H. Davenport and Laurence Prusak. Harvard Business School Press, 1998. $29.95US. ISBN 0‐87584‐655‐6 , 1998 .

[17]  Ruoming Jin,et al.  Topic level expertise search over heterogeneous networks , 2010, Machine Learning.

[18]  Hongbo Deng,et al.  Formal Models for Expert Finding on DBLP Bibliography Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[19]  Olivier Teste,et al.  Measuring article quality in Wikipedia using the collaboration network , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[20]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[21]  Atsuhiro Takasu,et al.  Collaborator Recommendation for Isolated Researchers , 2014, 2014 28th International Conference on Advanced Information Networking and Applications Workshops.

[22]  J. Nocedal Updating Quasi-Newton Matrices With Limited Storage , 1980 .

[23]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[24]  Ryen W. White,et al.  Automatic People Tagging for Expertise Profiling in the Enterprise , 2011, ECIR.

[25]  Shou-De Lin,et al.  Combination of feature engineering and ranking models for paper-author identification in KDD Cup 2013 , 2013, KDD Cup '13.

[26]  Alfred Kobsa,et al.  Expert-Finding Systems for Organizations: Problem and Domain Analysis and the DEMOIR Approach , 2003, J. Organ. Comput. Electron. Commer..