PAID: Prioritizing app issues for developers by tracking user reviews over versions

User review analysis is critical to the bug-fixing and version-modification process for app developers. Many research efforts have been put to user review mining in discovering app issues, including laggy user interface, high memory overhead, privacy leakage, etc. Existing exploration of app reviews generally depends on static collections. As a result, they largely ignore the fact that user reviews are tightly related to app versions. Furthermore, the previous approaches require a developer to spend much time on filtering out trivial comments and digesting the informative textual data. This would be labor-intensive especially to popular apps with tremendous reviews. In the paper, we target at designing a framework in Prioritizing App Issues for Developers (PAID) with minimal manual power and good accuracy. The PAID design is based on the fact that the issues presented in the level of phrase, i.e., a couple of consecutive words, can be more easily understood by developers than in long sentences. Hence, we aim at recommending phrase-level issues of an app to its developers by tracking reviews over the release versions of the app. To assist developers in better comprehending the app issues, PAID employs ThemeRiver to visualize the analytical results to developers. Finally, PAID also allows the developers to check the most related reviews, when they want to obtain a deep insight of a certain issue. In contrast to the traditional evaluation methods such as manual labeling or examining the discussion forum, our experimental study exploits the first-hand information from developers, i.e., app changelogs, to measure the performance of PAID. We analyze millions of user reviews from 18 apps with 117 app versions and the results show that the prioritized issues generated by PAID match the official changelogs with high precision.

[1]  Ning Chen,et al.  AR-miner: mining informative reviews for developers from mobile app marketplace , 2014, ICSE.

[2]  Kristina Winbladh,et al.  Analysis of user comments: An approach for software requirements evolution , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[3]  Christos Faloutsos,et al.  Why people hate your app: making sense of user feedback in a mobile app store , 2013, KDD.

[4]  Barry Smyth,et al.  Learning to recommend helpful hotel reviews , 2009, RecSys '09.

[5]  Yasuhiro Hashimoto,et al.  Heat Map Scope Technique for Stacked Time-series Data Visualization , 2012, 2012 16th International Conference on Information Visualisation.

[6]  Andrey A. Puretskiy,et al.  Survey of Text Visualization Techniques , 2010 .

[7]  Mark Harman,et al.  RN / 14 / 12 Life and Death in the App Store : Theory and Analysis of Feature Migration , 2014 .

[8]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[9]  Judith Gebauer,et al.  User requirements of mobile technology: results from a content analysis of user reviews , 2008, Inf. Syst. E Bus. Manag..

[10]  Zuhair Bandar,et al.  Sentence similarity based on semantic nets and corpus statistics , 2006, IEEE Transactions on Knowledge and Data Engineering.

[11]  Hui Xu,et al.  AR-Tracker: Track the Dynamics of Mobile Apps via User Review Mining , 2015, 2015 IEEE Symposium on Service-Oriented System Engineering.

[12]  Jae-Gil Lee,et al.  Facilitating developer-user interactions with mobile app review digests , 2013, CHI Extended Abstracts.

[13]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[14]  Bojan Cukic,et al.  Defect Prediction between Software Versions with Active Learning and Dimensionality Reduction , 2014, 2014 IEEE 25th International Symposium on Software Reliability Engineering.

[15]  Weiwei Cui,et al.  A visual approach to text corpora comparison , 2010, IUI 2010.

[16]  Robert C. Moore On Log-Likelihood-Ratios and the Significance of Rare Events , 2004, EMNLP.

[17]  Elisabeth Platzer,et al.  Opportunities of automated motive-based user review analysis in the context of mobile app acceptance , 2011 .

[18]  Rachel Harrison,et al.  Retrieving and analyzing mobile apps feature requests from online reviews , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[19]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[20]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[21]  W. Bradford Paley,et al.  TextArc: Showing Word Frequency and Distribution in Text , 2002 .

[22]  Walid Maalej,et al.  How Do Users Like This Feature? A Fine Grained Sentiment Analysis of App Reviews , 2014, 2014 IEEE 22nd International Requirements Engineering Conference (RE).

[23]  Ahmed E. Hassan,et al.  Analyzing and automatically labelling the types of user issues that are raised in mobile app reviews , 2015, Empirical Software Engineering.

[24]  Keun Soo Yim Norming to Performing: Failure Analysis and Deployment Automation of Big Data Software Developed by Highly Iterative Models , 2014, 2014 IEEE 25th International Symposium on Software Reliability Engineering.

[25]  ChengXiang Zhai,et al.  Automatic labeling of multinomial topic models , 2007, KDD '07.

[26]  Kon Mouzakis,et al.  A preliminary analysis of mobile app user reviews , 2012, OZCHI.