Distributed Representations of Expertise

Collaborative networks are common in real life, where domain experts work together to solve tasks issued by customers. How to model the proficiency of experts is critical for us to understand and optimize collaborative networks. Traditional expertise models, such as topic model based methods, cannot capture two aspects of human expertise simultaneously: Specialization (what area an expert is good at?) and Proficiency Level (to what degree?). In this paper, we propose new models to overcome this problem. We embed all historical task data in a lower dimension space and learn vector representations of expertise based on both solved and unsolved tasks. Specifically, in our first model, we assume that each expert will only handle tasks whose difficulty level just matches his/her proficiency level, while experts in the second model accept tasks whose levels are equal to or lower than his/her proficiency level. Experiments on real world datasets show that both models outperform topic model based approaches and standard classifiers such as logistic regression and support vector machine in terms of prediction accuracy. The learnt vector representations can be used to compare expertise in a large organization and optimize expert allocation.

[1]  Kush R. Varshney,et al.  Predicting employee expertise for talent management in the enterprise , 2014, KDD.

[2]  Devavrat Shah,et al.  Efficient crowdsourcing for multi-class labeling , 2013, SIGMETRICS '13.

[3]  Thomas L. Griffiths,et al.  Learning author-topic models from text corpora , 2010, TOIS.

[4]  M. de Rijke,et al.  Determining Expert Profiles (With an Application to Expert Finding) , 2007, IJCAI.

[5]  Reyyan Yeniterzi Effective approaches to retrieving and using expertise in social media , 2013, SIGIR.

[6]  W. Bruce Croft,et al.  Finding experts in community-based question-answering services , 2005, CIKM '05.

[7]  Thomas H. Davenport,et al.  Book review:Working knowledge: How organizations manage what they know. Thomas H. Davenport and Laurence Prusak. Harvard Business School Press, 1998. $29.95US. ISBN 0‐87584‐655‐6 , 1998 .

[8]  Hongbo Deng,et al.  Formal Models for Expert Finding on DBLP Bibliography Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[9]  Junjie Yao,et al.  Routing Questions to the Right Users in Online Communities , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[10]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[11]  Yi Chen,et al.  Efficient ticket routing by resolution sequence mining , 2008, KDD.

[12]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[13]  Louise E. Moser,et al.  Generative models for ticket resolution in expert networks , 2010, KDD.

[14]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[15]  Djoerd Hiemstra,et al.  Modeling multi-step relevance propagation for expert finding , 2008, CIKM '08.

[16]  Ajita John,et al.  Collaborative Tagging and Expertise in the Enterprise , 2006 .

[17]  M. de Rijke,et al.  Expertise Retrieval , 2012, Found. Trends Inf. Retr..

[18]  Aditya Pal,et al.  Routing questions for collaborative answering in Community Question Answering , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[19]  Nick Craswell,et al.  Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[20]  David Carmel,et al.  Mining expertise and interests from social media , 2013, WWW.

[21]  Louise E. Moser,et al.  Understanding task-driven information flow in collaborative networks , 2012, WWW.

[22]  Luo Si,et al.  Discriminative models of integrating document evidence and document-candidate associations for expert search , 2010, SIGIR '10.

[23]  Pietro Perona,et al.  The Multidimensional Wisdom of Crowds , 2010, NIPS.

[24]  Yang Li,et al.  Analyzing expert behaviors in collaborative networks , 2014, KDD.

[25]  Michael R. Lyu,et al.  Question routing in community question answering: putting category in its place , 2011, CIKM '11.

[26]  Mudhakar Srivatsa,et al.  Fine-Grained Knowledge Sharing in Collaborative Environments , 2015, IEEE Transactions on Knowledge and Data Engineering.

[27]  Andrew McCallum,et al.  Expertise modeling for matching papers with reviewers , 2007, KDD '07.

[28]  Lakshminarayanan Subramanian,et al.  Reputation-based Worker Filtering in Crowdsourcing , 2014, NIPS.

[29]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[30]  Xiaohui Yan,et al.  A biterm topic model for short texts , 2013, WWW.

[31]  Eric Horvitz,et al.  Task routing for prediction tasks , 2012, AAMAS.