Understanding User’s Interests in NoSQL Databases in Stack Overflow

NoSQL (Not only SQL) has gained popularity with emerging demands of scalable database with big data. Despite the great interest of users toward NoSQL technology, an attempt to analyze how the actual users react to NoSQL has not been made yet. Thus, the present work utilizes question-answer data acquired from Stack Overflow, a question and answer site that works as a large knowledge repository to understand how people perceive NoSQL technology. To this end, LDA topic modeling techniques are used to find out the trend of NoSQL databases. In addition, we proposed topic discrimination value in attempt to find topics that distinguish each NoSQL databases.

[1]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[2]  Yang Wang,et al.  Middleware design for integrating relational database and NOSQL based on data dictionary , 2011, Proceedings 2011 International Conference on Transportation, Mechanical, and Electrical Engineering (TMEE).

[3]  Ruslan Salakhutdinov,et al.  Evaluation methods for topic models , 2009, ICML '09.

[4]  Hector Garcia-Molina,et al.  Clustering the tagged web , 2009, WSDM '09.

[5]  Gokhan Tur,et al.  LDA Based Similarity Modeling for Question Answering , 2010, HLT-NAACL 2010.

[6]  Ahmed E. Hassan,et al.  What are developers talking about? An analysis of topics and trends in Stack Overflow , 2014, Empirical Software Engineering.

[7]  Antonio Messina,et al.  Keep It Simple, Fast and Scalable: A Multi-model NoSQL DBMS as an (eb) XML-over-SOAP Service , 2016, 2016 30th International Conference on Advanced Information Networking and Applications Workshops (WAINA).

[8]  Jorge Bernardino,et al.  Choosing the right NoSQL database for the job: a quality attribute evaluation , 2015, Journal of Big Data.

[9]  Alessandro Bozzon,et al.  Sparrows and Owls: Characterisation of Expert Behaviour in StackOverflow , 2014, UMAP.

[10]  Cristian Bucur,et al.  A comparison between several NoSQL databases with comments and notes , 2011, 2011 RoEduNet International Conference 10th Edition: Networking in Education and Research.

[11]  Kuldeep Singh,et al.  NoSQL, A Solution for Distributed Database Management System , 2013 .

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13]  Hongfei Yan,et al.  Comparing Twitter and Traditional Media Using Topic Models , 2011, ECIR.