A user ranking algorithm for efficient information management of community sites using spectral clustering and folksonomy

Community question answering (CQA) sites are the major platform for information sharing where posts are created by users as questions and answers. A large number of posts are created on a day-to-day basis, which raise the problem of information management of these sites. Multiple techniques are suggested in existing research for efficient management of CQA sites. Many of the existing techniques used the user ranking for managing the CQA sites but ignored the tagging data and user subject area. In this article, a user ranking method is derived using spectral clustering for posts management by considering the tagging data of CQA sites. Folksonomy is used to build relationship between tags, posts and users. The proposed method is developed in three stages. In first stage, the folksonomy relation is created and user similarity graph is built with the help of tag frequency-inverse post frequency and text similarity techniques. In the second stage, spectral clustering algorithm is applied on user similarity graph to group the similar users. Finally, in third stage, rank of users is identified from the clusters based on user’s information. The clustered users and rank of the users are generated as the output of the proposed algorithm that can provide a way of efficient information management. The experimental results show that the proposed user ranking algorithm outperforms the other considered ranking algorithms and can be helpful for information management of CQA sites. Some real-life applications of information management in CQA sites using the proposed work are also demonstrated in this article.

[1]  Daniel Zeng,et al.  Mining Evolutionary Topic Patterns in Community Question Answering Systems , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[2]  Yichuan Jiang,et al.  Diffusion in Social Networks: A Multiagent Perspective , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[3]  Jianxin Wu,et al.  Structured Learning of Binary Codes with Column Generation for Optimizing Ranking Measures , 2016, International Journal of Computer Vision.

[4]  Manuel Lama,et al.  A Keyword Recommendation Experiment to Support Information Organization and Folksonomies in Edu-AREA , 2015, IEEE Revista Iberoamericana de Tecnologias del Aprendizaje.

[5]  Gilad Lerman,et al.  Spectral Clustering Based on Local PCA , 2013, J. Mach. Learn. Res..

[6]  Dong Zhou,et al.  Query Expansion with Enriched User Profiles for Personalized Search Utilizing Folksonomy Data , 2017, IEEE Transactions on Knowledge and Data Engineering.

[7]  Pierre Vandergheynst,et al.  Compressive Spectral Clustering , 2016, ICML.

[8]  Wilfred Ng,et al.  Expert Finding for Question Answering via Graph Regularized Matrix Completion , 2015, IEEE Transactions on Knowledge and Data Engineering.

[9]  Shengrui Wang,et al.  Identifying authoritative actors in question-answering forums: the case of Yahoo! answers , 2008, KDD.

[10]  Arkaitz Zubiaga,et al.  Using Fuzzy Logic to Leverage HTML Markup for Web Page Representation , 2016, IEEE Transactions on Fuzzy Systems.

[11]  Uwe Matzat,et al.  Online Reputation Systems , 2019, The Oxford Handbook of Gossip and Reputation.

[12]  Jeffrey C. Carver,et al.  Building reputation in StackOverflow: An empirical investigation , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[13]  Leif Singer,et al.  Assessing Technical Candidates on the Social Web , 2013, IEEE Software.

[14]  Fang Chen,et al.  Spectral clustering of high-dimensional data via Nonnegative Matrix Factorization , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[15]  Shie-Jue Lee,et al.  A Similarity Measure for Text Classification and Clustering , 2014, IEEE Transactions on Knowledge and Data Engineering.

[16]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[17]  Thibault Espinasse,et al.  Reconstructing undirected graphs from eigenspaces , 2016, J. Mach. Learn. Res..

[18]  Ralf Klamma,et al.  Community-aware ranking algorithms for expert identification in question-answer forums , 2015, I-KNOW.

[19]  Bernhard Hoisl,et al.  Social Rewarding in Wiki Systems - Motivating the Community , 2009, HCI.

[20]  A. K. Singh,et al.  TAGme: A Topical Folksonomy Based Collaborative Filtering for Tag Recommendation in Community Sites , 2017, MISNC '17.

[21]  Licheng Jiao,et al.  A Sparse Spectral Clustering Framework via Multiobjective Evolutionary Algorithm , 2016, IEEE Transactions on Evolutionary Computation.

[22]  Xiao Liu,et al.  Revisit tag-based profiles in the folksonomy: How many tags are sufficient for profiling? , 2017, 2017 IEEE International Conference on Big Data and Smart Computing (BigComp).

[23]  John Riedl,et al.  Folksonomy Formation , 2011, Computer.

[24]  Reyyan Yeniterzi Effective and Efficient Approaches to Retrieving and Using Expertise in Social Media , 2016, SIGF.

[25]  Evangelos E. Milios,et al.  Finding expert users in community question answering , 2012, WWW.

[26]  Jure Leskovec,et al.  Discovering value from community activity on focused question answering sites: a case study of stack overflow , 2012, KDD.

[27]  Naresh Kumar Nagwani,et al.  A Comment on "A Similarity Measure for Text Classification and Clustering" , 2015, IEEE Trans. Knowl. Data Eng..

[28]  Chen-Kun Tsung,et al.  A Spectral Clustering Approach Based on Modularity Maximization for Community Detection Problem , 2016, 2016 International Computer Symposium (ICS).

[29]  A. Rinaldo,et al.  Consistency of spectral clustering in stochastic block models , 2013, 1312.2050.

[30]  Hongfei Lin,et al.  Predicting Best Answerers for New Questions: An Approach Leveraging Distributed Representations of Words in Community Question Answering , 2015, 2015 Ninth International Conference on Frontier of Computer Science and Technology.

[31]  Bryce Glass,et al.  Building Web Reputation Systems , 2010 .

[32]  Jun Sun,et al.  Joint Latent Dirichlet Allocation for Social Tags , 2018, IEEE Transactions on Multimedia.

[33]  Suresh Manandhar,et al.  Tag-based expert recommendation in community question answering , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[34]  Xiaohua Hu,et al.  Learning the Multilingual Translation Representations for Question Retrieval in Community Question Answering via Non-Negative Matrix Factorization , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[35]  Naresh Kumar Nagwani,et al.  Folksonomy Based Trend Analysis on Community Question Answering Sites: A Perspective on Software Technologies , 2016, IEEE Access.

[36]  Yong Yu,et al.  Tapping on the potential of q&a community by recommending answer providers , 2008, CIKM '08.

[37]  Shamik Sural,et al.  Similarity between Euclidean and cosine angle distance for nearest neighbor queries , 2004, SAC '04.

[38]  Yike Guo,et al.  Fast graph clustering with a new description model for community detection , 2017, Inf. Sci..

[39]  Stephan Lukosch,et al.  Reputation in Peer-based Learning Environments , 2012 .

[40]  Xiaolong Wang,et al.  Answer Sequence Learning with Neural Networks for Answer Selection in Community Question Answering , 2015, ACL.

[41]  John G. Breslin,et al.  Evolution of Social Networks Based on Tagging Practices , 2013, IEEE Transactions on Services Computing.

[42]  Ahmed E. Hassan,et al.  What are developers talking about? An analysis of topics and trends in Stack Overflow , 2014, Empirical Software Engineering.

[43]  Abdulmotaleb El-Saddik,et al.  Folksonomy link prediction based on a tripartite graph for tag recommendation , 2012, Journal of Intelligent Information Systems.

[44]  Fei Xu,et al.  Dual role model for question recommendation in community question answering , 2012, SIGIR '12.

[45]  Feng Xu,et al.  Detecting high-quality posts in community question answering sites , 2015, Inf. Sci..

[46]  Carey E. Priebe,et al.  Community Detection and Classification in Hierarchical Stochastic Blockmodels , 2015, IEEE Transactions on Network Science and Engineering.