Mining User Opinions to Support Requirement Engineering: An Empirical Study

App reviews provide a rich source of user opinions that can support requirement engineering activities. Analysing them manually to find these opinions, however, is challenging due to their large quantity and noisy nature. To overcome the problem, automated approaches have been proposed for so-called opinion mining. These approaches facilitate the analysis by extracting features discussed in app reviews and identifying their associated sentiments. The effectiveness of these approaches has been evaluated using different methods and datasets. Unfortunately, replicating these studies to confirm their results and to provide benchmarks of different approaches is a challenging problem. We address the problem by extending previous evaluations and performing a comparison of these approaches. In this paper, we present an empirical study in which, we evaluated feature extraction and sentiment analysis approaches on the same dataset. The results show these approaches achieve lower effectiveness than reported originally, and raise an important question about their practical use.

[1]  Thomas Zimmermann,et al.  Information needs for software development analytics , 2012, 2012 34th International Conference on Software Engineering (ICSE).

[2]  Yuanyuan Zhang,et al.  A Survey of App Store Analysis for Software Engineering , 2017, IEEE Transactions on Software Engineering.

[3]  Xiaodong Gu,et al.  "What Parts of Your Apps are Loved by Users?" (T) , 2015, 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[4]  Walid Maalej,et al.  User feedback in the appstore: An empirical study , 2013, 2013 21st IEEE International Requirements Engineering Conference (RE).

[5]  Dietmar Pfahl,et al.  Is the SAFE Approach Too Simple for App Feature Extraction? A Replication Study , 2019, REFSQ.

[6]  ThelwallMike,et al.  Sentiment strength detection in short informal text , 2010 .

[7]  Anton van den Hengel,et al.  Image-Based Recommendations on Styles and Substitutes , 2015, SIGIR.

[8]  Dietmar Pfahl,et al.  Simulating the Impact of Annotation Guidelines and Annotated Data on Extracting App Features from App Reviews , 2019, ICSOFT.

[9]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[10]  H. Cunningham,et al.  Developing Language Processing Components with GATE , 2001 .

[11]  Walid Maalej,et al.  SAFE: A Simple Approach for Feature Extraction from App Descriptions and App Reviews , 2017, 2017 IEEE 25th International Requirements Engineering Conference (RE).

[12]  John Mylopoulos,et al.  Panel: Context-Dependent Evaluation of Tools for NL RE Tasks: Recall vs. Precision, and Beyond , 2017, 2017 IEEE 25th International Requirements Engineering Conference (RE).

[13]  Ido Dagan,et al.  Synthesis Lectures on Human Language Technologies , 2009 .

[14]  Tung Thanh Nguyen,et al.  Mining User Opinions in Mobile App Reviews: A Keyword-Based Approach (T) , 2015, 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[15]  Gabriele Bavota,et al.  Sentiment Analysis for Software Engineering: How Far Can We Go? , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[16]  Andi Rexha,et al.  An unsupervised aspect extraction strategy for monitoring real-time reviews stream , 2019, Inf. Process. Manag..

[17]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[18]  James Pustejovsky,et al.  Natural Language Annotation for Machine Learning - a Guide to Corpus-Building for Applications , 2012 .

[19]  Walid Maalej,et al.  How Do Users Like This Feature? A Fine Grained Sentiment Analysis of App Reviews , 2014, 2014 IEEE 22nd International Requirements Engineering Conference (RE).

[20]  Anna Perini,et al.  Finding and Analyzing App Reviews Related to Specific Features: A Research Preview , 2019, REFSQ.

[21]  Mark Harman,et al.  App Store Effects on Software Engineering Practices , 2019, IEEE Transactions on Software Engineering.

[22]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[23]  Fabiano Dalpiaz,et al.  RE-SWOT: From User Feedback to Requirements via Competitor Analysis , 2019, REFSQ.

[24]  Jan Ole Johanssen,et al.  How do Practitioners Capture and Utilize User Feedback During Continuous Software Engineering? , 2019, 2019 IEEE 27th International Requirements Engineering Conference (RE).

[25]  Andrew Begel,et al.  Analyze this! 145 questions for data scientists in software engineering , 2013, ICSE.