Harnessing global expertise: A comparative study of expertise profiling methods for online communities

Building expertise profiles in global online communities is a critical step in leveraging the range of expertise available in the global knowledge economy. In this paper we introduce a three-stage framework that automatically generates expertise profiles of online community members. In the first two stages, document-topic relevance and user-document association are estimated for calculating users’ expertise levels on individual topics. We empirically compare two state-of-the-art information retrieval techniques, the vector space model and the language model, with a Latent Dirichlet Allocation (LDA) based model for computing document-topic relevance as well as the direct and indirect association models for computing user-document association. In the third stage we test whether a filtering strategy can improve the performance of expert profiling. Our experimental results using two real datasets provide useful insights on how to select the best models for profiling users’ expertise in online communities that can work across a range of global communities.

[1]  Ulrich Remus,et al.  Critical Success Factors for Managing Offshore Software Development Projects , 2009 .

[2]  Alexander Ardichvili,et al.  Motivation and barriers to participation in virtual knowledge-sharing communities of practice , 2003, J. Knowl. Manag..

[3]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[4]  Maozhen Li,et al.  Preface , 2016, Int. J. Pattern Recognit. Artif. Intell..

[5]  Volker Wulf,et al.  Bridging Artifacts and Actors: Expertise Sharing in Organizational Ecosystems , 2012, Computer Supported Cooperative Work (CSCW).

[6]  S. Ghoshal,et al.  Social Capital, Intellectual Capital, and the Organizational Advantage , 1998 .

[7]  Eoghan Casey,et al.  Handbook of Digital Forensics and Investigation , 2009 .

[8]  Jonathon N. Cummings Work Groups, Structural Diversity, and Knowledge Sharing in a Global Organization , 2004, Manag. Sci..

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Charles L. A. Clarke,et al.  Efficient construction of large test collections , 1998, SIGIR '98.

[12]  Eric T. G. Wang,et al.  Understanding knowledge sharing in virtual communities: An integration of social capital and social cognitive theories , 2006, Decis. Support Syst..

[13]  Prashant C. Palvia Global Information Technology Research: Past, Present And Future1 , 1998 .

[14]  Dik Lun Lee,et al.  Document Ranking and the Vector-Space Model , 1997, IEEE Softw..

[15]  Ronald E. Rice,et al.  Technology Adaptation: The Case of a Computer-Supported Inter-Organizational Virtual Team , 2000, MIS Q..

[16]  Wolfgang Nejdl,et al.  A Vector Space Model for Ranking Entities and Its Application to Expert Search , 2009, ECIR.

[17]  Gregor Heinrich Parameter estimation for text analysis , 2009 .

[18]  Andrew McCallum,et al.  Expertise modeling for matching papers with reviewers , 2007, KDD '07.

[19]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[20]  Thomas L. Griffiths,et al.  Probabilistic author-topic models for information discovery , 2004, KDD.

[21]  Massimo Melucci,et al.  On rank correlation in information retrieval evaluation , 2007, SIGF.

[22]  John Seely Brown,et al.  Book Reviews : The Social Life of Information By John Seely Brown and Paul Duguid. Boston: Harvard Business School Press, 2000. 320 pages , 2000 .

[23]  Ronald E. Rice,et al.  Technology adaption: the case of a computer-supported inter-organizational virtual team 1 , 2000 .

[24]  Paul P. Maglio,et al.  Expertise identification using email communications , 2003, CIKM '03.

[25]  Tim S. Roberts Self, Peer and Group Assessment in E-Learning. , 2006 .

[26]  Wei Li,et al.  Cultural influences on knowledge sharing through online communities of practice , 2006, J. Knowl. Manag..

[27]  Volker Wulf,et al.  Expert Recommender: Designing for a Network Organization , 2009, Learning in Communities.

[28]  Aditya Johri,et al.  Sociomaterial bricolage: The creation of location-spanning work practices by global software developers , 2011, Inf. Softw. Technol..

[29]  Volker Wulf,et al.  Sharing Expertise: Beyond Knowledge Management , 2002 .

[30]  Ilan Oshri,et al.  Social ties, knowledge sharing and successful collaboration in globally distributed system development projects , 2005, Eur. J. Inf. Syst..

[31]  John C. Thomas,et al.  The knowledge management puzzle: Human and social factors in knowledge management , 2001, IBM Syst. J..

[32]  Djoerd Hiemstra,et al.  Modeling multi-step relevance propagation for expert finding , 2008, CIKM '08.

[33]  Daniel M. Herzig,et al.  Multilingual Expert Search using Linked Open Data as Interlingual Representation , 2010, CLEF.

[34]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[35]  Mark S. Ackerman,et al.  Expertise recommender: a flexible recommendation system and architecture , 2000, CSCW '00.

[36]  Samer Faraj,et al.  Why Should I Share? Examining Social Capital and Knowledge Contribution in Electronic Networks of Practice , 2005, MIS Q..

[37]  Weiguo Fan,et al.  A Qualitative Study of Web-Based Knowledge Communities: Examining Success Factors , 2009, Int. J. e Collab..

[38]  John M. Silvester,et al.  The Social Life of Information: Brown, J.S., & Duguid, P. (2000). Cambridge, MA: Harvard Business School Publishing. ISBN 0-87584-762-5. 320 pages , 2000, Internet High. Educ..

[39]  J. Brown,et al.  Knowledge and Organization: A Social-Practice Perspective , 2001 .

[40]  A. Bandura Social Foundations of Thought and Action: A Social Cognitive Theory , 1985 .

[41]  M. de Rijke,et al.  A language modeling framework for expert finding , 2009, Inf. Process. Manag..

[42]  Corey S. Reitz What is Electronic Discovery , 2012 .

[43]  Pamela J. Hinds,et al.  Distributed Work , 2002 .

[44]  T. G.,et al.  Logic in Practice , 1934, Nature.

[45]  Geoffrey J. McLachlan,et al.  Analyzing Microarray Gene Expression Data , 2004 .

[46]  Mark S. Ackerman,et al.  Just talk to me: a field study of expertise location , 1998, CSCW '98.

[47]  Anastasia Papazafeiropoulou,et al.  Inter-Country Analysis of Electronic Commerce Adoption in South Eastern Europe: Policy Recommendations for the Region , 2004 .

[48]  S. Kiesler,et al.  Applying Common Identity and Bond Theory to Design of Online Communities , 2007 .

[49]  S H Strogatz,et al.  Random graph models of social networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Mark S. Ackerman,et al.  Competing to Share Expertise: The Taskcn Knowledge Sharing Community , 2021, ICWSM.

[51]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.