Extracting evolutionary communities in community question answering

With the rapid growth of Web 2.0, community question answering (CQA) has become a prevalent information seeking channel, in which users form interactive communities by posting questions and providing answers. Communities may evolve over time, because of changes in users' interests, activities, and new users joining the network. To better understand user interactions in CQA communities, it is necessary to analyze the community structures and track community evolution over time. Existing work in CQA focuses on question searching or content quality detection, and the important problems of community extraction and evolutionary pattern detection have not been studied. In this article, we propose a probabilistic community model (PCM) to extract overlapping community structures and capture their evolution patterns in CQA. The empirical results show that our algorithm appears to improve the community extraction quality. We show empirically, using the iPhone data set, that interesting community evolution patterns can be discovered, with each evolution pattern reflecting the variation of users' interests over time. Our analysis suggests that individual users could benefit to gain comprehensive information from tracking the transition of products. We also show that the communities provide a decision‐making basis for business.

[1]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[2]  E A Leicht,et al.  Community structure in directed networks. , 2007, Physical review letters.

[3]  Yulin Fang,et al.  Understanding Sustained Participation in Open Source Software Projects , 2009, J. Manag. Inf. Syst..

[4]  Doug Schuler,et al.  Social computing , 1994, CACM.

[5]  Chen Wang,et al.  Detecting Overlapping Community Structures in Networks with Global Partition and Local Expansion , 2008, APWeb.

[6]  Paul A. Pavlou,et al.  Building Effective Online Marketplaces with Institution-Based Trust , 2004, Inf. Syst. Res..

[7]  Philip S. Yu,et al.  GraphScope: parameter-free mining of large time-evolving graphs , 2007, KDD '07.

[8]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[9]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Daniel Dajun Zeng,et al.  User community discovery from multi-relational networks , 2013, Decis. Support Syst..

[11]  Xiaolong Zheng,et al.  Analyzing open-source software systems as complex networks , 2008 .

[12]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[13]  Daniel Zeng,et al.  How Useful Are Tags? - An Empirical Analysis of Collaborative Tagging for Web Page Recommendation , 2008, ISI Workshops.

[14]  Yun Chi,et al.  Evolutionary spectral clustering by incorporating temporal smoothness , 2007, KDD '07.

[15]  Matthias Trier,et al.  Research Note - Towards Dynamic Visualization for Understanding Evolution of Digital Communication Networks , 2008, Inf. Syst. Res..

[16]  Wenji Mao,et al.  Social Computing: From Social Informatics to Social Intelligence , 2007, IEEE Intell. Syst..

[17]  Daniel Dajun Zeng,et al.  Evolutionary Community Discovery from Dynamic Multi-relational CQA Networks , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[18]  Daniel Dajun Zeng,et al.  Collaborative filtering in social tagging systems based on joint item-tag recommendations , 2010, CIKM.

[19]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[20]  Yihong Gong,et al.  A Bayesian Approach Toward Finding Communities and Their Evolutions in Dynamic Social Networks , 2009, SDM.

[21]  Bei Wang,et al.  Spatial Scan Statistics for Graph Clustering , 2008, SDM.

[22]  James A. Hendler,et al.  A Study of the Human Flesh Search Engine: Crowd-Powered Expansion of Online Knowledge , 2010, Computer.

[23]  Lorin M. Hitt,et al.  Self Selection and Information Role of Online Product Reviews , 2007, Inf. Syst. Res..

[24]  Yun Chi,et al.  Analyzing communities and their evolutions in dynamic social networks , 2009, TKDD.

[25]  Bin Gu,et al.  Competition Among Virtual Communities and User Valuation: The Case of Investing-Related Communities , 2007, Inf. Syst. Res..

[26]  Daniel Dajun Zeng,et al.  Guest Editors' Introduction: Social Computing , 2007, IEEE Intell. Syst..

[27]  Cecil Eng Huang Chua,et al.  The Role of Online Trading Communities in Managing Internet Auction Fraud , 2007, MIS Q..

[28]  Yun Chi,et al.  Facetnet: a framework for analyzing communities and their evolutions in dynamic networks , 2008, WWW.

[29]  Jennifer Preece,et al.  Toward Virtual Community Knowledge Evolution , 2002, J. Manag. Inf. Syst..

[30]  Jan Marco Leimeister,et al.  Design, Implementation, and Evaluation of Trust-Supporting Components in Virtual Communities for Patients , 2005, J. Manag. Inf. Syst..

[31]  Ann Majchrzak,et al.  Enabling Customer-Centricity Using Wikis and the Wiki Way , 2006, J. Manag. Inf. Syst..

[32]  Eugene Agichtein,et al.  Learning to recognize reliable users and content in social media with coupled mutual reinforcement , 2009, WWW '09.

[33]  Steve Gregory,et al.  An Algorithm to Find Overlapping Community Structure in Networks , 2007, PKDD.

[34]  Volker Tresp,et al.  Soft Clustering on Graphs , 2005, NIPS.

[35]  Andrew B. Whinston,et al.  Health of Electronic Communities: An Evolutionary Game Approach , 2004, J. Manag. Inf. Syst..

[36]  Huawei Shen,et al.  Quantifying and identifying the overlapping community structure in networks , 2009, 0905.2666.

[37]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[39]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[40]  Chris H. Q. Ding,et al.  A min-max cut algorithm for graph partitioning and data clustering , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[41]  Ritu Agarwal,et al.  Through a Glass Darkly: Information Technology Design, Identity Verification, and Knowledge Contribution in Online Communities , 2007, Inf. Syst. Res..

[42]  John Yen,et al.  An LDA-based Community Structure Discovery Approach for Large-Scale Social Networks , 2007, 2007 IEEE Intelligence and Security Informatics.

[43]  David J. Marchette,et al.  Scan Statistics on Enron Graphs , 2005, Comput. Math. Organ. Theory.

[44]  Eugene Agichtein,et al.  Finding the right facts in the crowd: factoid question answering over social media , 2008, WWW.

[45]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[46]  Yong Yu,et al.  Recommending questions using the mdl-based tree cut model , 2008, WWW.

[47]  Anindya Ghose,et al.  Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets , 2008, Inf. Syst. Res..

[48]  Amany R. Elbanna,et al.  From Control to Drift: The Dynamics of Corporate Information Infrastructures , 2001 .

[49]  Srinivasan Parthasarathy,et al.  An event-based framework for characterizing the evolutionary behavior of interaction graphs , 2007, KDD '07.

[50]  O. Hanseth,et al.  From Control to Drift: The Dynamics of Corporate Information Infrastructures , 2000 .