论文信息 - Studying software logging using topic models

Studying software logging using topic models

Software developers insert logging statements in their source code to record important runtime information; such logged information is valuable for understanding system usage in production and debugging system failures. However, providing proper logging statements remains a manual and challenging task. Missing an important logging statement may increase the difficulty of debugging a system failure, while too much logging can increase system overhead and mask the truly important information. Intuitively, the actual functionality of a software component is one of the major drivers behind logging decisions. For instance, a method maintaining network communications is more likely to be logged than getters and setters. In this paper, we used automatically-computed topics of a code snippet to approximate the functionality of a code snippet. We studied the relationship between the topics of a code snippet and the likelihood of a code snippet being logged (i.e., to contain a logging statement). Our driving intuition is that certain topics in the source code are more likely to be logged than others. To validate our intuition, we conducted a case study on six open source systems, and we found that i) there exists a small number of “log-intensive” topics that are more likely to be logged than other topics; ii) each pair of the studied systems share 12% to 62% common topics, and the likelihood of logging such common topics has a statistically significant correlation of 0.35 to 0.62 among all the studied systems; and iii) our topic-based metrics help explain the likelihood of a code snippet being logged, providing an improvement of 3% to 13% on AUC and 6% to 16% on balanced accuracy over a set of baseline metrics that capture the structural information of a code snippet. Our findings highlight that topics contain valuable information that can help guide and drive developers’ logging decisions.

[1] Abram Hindle,et al. Do topics make sense to managers and developers? , 2014, Empirical Software Engineering.

[2] Anas N. Al-Rabadi,et al. A comparison of modified reconstructability analysis and Ashenhurst‐Curtis decomposition of Boolean functions , 2004 .

[3] Domenico Cotroneo,et al. Industry Practices and Event Logging: Assessment of a Critical Software Development Process , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[4] Tibor Gyimóthy,et al. Modeling class cohesion as mixtures of latent topics , 2009, 2009 IEEE International Conference on Software Maintenance.

[5] Trevor Hastie,et al. Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[6] Denys Poshyvanyk,et al. Using Latent Dirichlet Allocation for automatic categorization of software , 2009, 2009 6th IEEE International Working Conference on Mining Software Repositories.

[7] Yang Liu,et al. Be conservative: enhancing failure diagnosis with proactive logging , 2012, OSDI 2012.

[8] Bin Li,et al. Modeling the evolution of development topics using Dynamic Topic Models , 2015, 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[9] Michael I. Jordan,et al. Detecting large-scale system problems by mining console logs , 2009, SOSP '09.

[10] Trevor Hastie,et al. Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[11] Qiang Fu,et al. Where do developers log? an empirical study on logging practices in industry , 2014, ICSE Companion.

[12] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13] Bin Li,et al. Exploring topic models in software engineering data analysis: A survey , 2016, 2016 17th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD).

[14] Chong Wang,et al. Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[15] Ahmed E. Hassan,et al. Validating the Use of Topic Models for Software Evolution , 2010, 2010 10th IEEE Working Conference on Source Code Analysis and Manipulation.

[16] Ahmed E. Hassan,et al. Studying the relationship between logging characteristics and the code quality of platform software , 2015, Empirical Software Engineering.

[17] Sushil Krishna Bajracharya,et al. A theory of aspects as latent topics , 2008, OOPSLA.

[18] Gabriele Bavota,et al. Methodbook: Recommending Move Method Refactorings via Relational Topic Models , 2014, IEEE Transactions on Software Engineering.

[19] Michael W. Godfrey,et al. An Exploratory Study of the Evolution of Communicated Information about the Execution of Large Software Systems , 2011, WCRE.

[20] Jeffrey S. Chase,et al. Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.

[21] Max Kuhn,et al. Applied Predictive Modeling , 2013 .

[22] R. Ledesma,et al. Cliff's Delta Calculator: A non-parametric effect size program for two groups of observations , 2010 .

[23] Cor-Paul Bezemer,et al. Examining the stability of logging statements , 2016, 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[24] Yu Luo,et al. Simple Testing Can Prevent Most Critical Failures: An Analysis of Production Failures in Distributed Data-Intensive Systems , 2014, OSDI.

[25] Ahmed E. Hassan,et al. CacheOptimizer: helping developers configure caching frameworks for hibernate-based database-centric web applications , 2016, SIGSOFT FSE.

[26] Ahmed E. Hassan,et al. Explaining software defects using topic models , 2012, 2012 9th IEEE Working Conference on Mining Software Repositories (MSR).

[27] Zhen Ming Jiang,et al. Characterizing and Detecting Anti-Patterns in the Logging Code , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE).

[28] François Yvon,et al. Using LDA to detect semantically incoherent documents , 2008, CoNLL.

[29] Bin Li,et al. What Information in Software Historical Repositories Do We Need to Support Software Maintenance Tasks? An Approach Based on Topic Model , 2015, Computer and Information Science.

[30] Manabu Ichino. Similarity and Dissimilarity Measures for Mixed Feature-type Symbolic Data , 2016 .

[31] Yann-Gaël Guéhéneuc,et al. Feature Location Using Probabilistic Ranking of Methods Based on Execution Scenarios and Information Retrieval , 2007, IEEE Transactions on Software Engineering.

[32] Cristina V. Lopes,et al. An Application of Latent Dirichlet Allocation to Analyzing Software Evolution , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[33] Robert Kabacoff. R in Action , 2011 .

[34] Stéphane Ducasse,et al. Semantic clustering: Identifying topics in source code , 2007, Inf. Softw. Technol..

[35] A. Ardeshir Goshtasby. Image Registration: Principles, Tools and Methods , 2012 .

[36] Andrea De Lucia,et al. Labeling source code with information retrieval methods: an empirical study , 2013, Empirical Software Engineering.

[37] Armando Fox,et al. Ensembles of models for automated diagnosis of system performance problems , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[38] Tu Minh Phuong,et al. Topic-based defect prediction. , 2011, ICSE 2011.

[39] Leonardo Mariani,et al. Automated Identification of Failure Causes in System Logs , 2008, 2008 19th International Symposium on Software Reliability Engineering (ISSRE).

[40] Daniel Jurafsky,et al. Studying the History of Ideas Using Topic Models , 2008, EMNLP.

[41] Ding Yuan,et al. SherLog: error diagnosis by connecting clues from run-time logs , 2010, ASPLOS XV.

[42] Ying Zou,et al. Towards just-in-time suggestions for log changes , 2016, Empirical Software Engineering.

[43] Andrea De Lucia,et al. Using IR methods for labeling source code artifacts: Is it worthwhile? , 2012, 2012 20th IEEE International Conference on Program Comprehension (ICPC).

[44] Tu Minh Phuong,et al. Topic-based defect prediction: NIER track , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[45] Ahmed E. Hassan,et al. Topic-based software defect explanation , 2017, J. Syst. Softw..

[46] Wei Xu,et al. Advances and challenges in log analysis , 2011, Commun. ACM.

[47] Andrea De Lucia,et al. Parameterizing and Assembling IR-Based Solutions for SE Tasks Using Genetic Algorithms , 2016, 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[48] Ahmed E. Hassan,et al. Modeling the evolution of topics in source code histories , 2011, MSR '11.

[49] Andrea De Lucia,et al. How to effectively use topic models for software engineering tasks? An approach based on Genetic Algorithms , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[50] Yang Xiao,et al. Linux auditing: Overhead and adaptation , 2015, 2015 IEEE International Conference on Communications (ICC).

[51] Thomas L. Griffiths,et al. Probabilistic Topic Models , 2007 .

[52] Heng Li,et al. Which log level should developers choose for a new logging statement? , 2017, Empirical Software Engineering.

[53] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[54] Michael J. Campbell,et al. Statistics at Square One , 1976, British medical journal.

[55] Ahmed E. Hassan,et al. Analytics-Driven Load Testing: An Industrial Experience Report on Load Testing of Large-Scale Systems , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering: Software Engineering in Practice Track (ICSE-SEIP).

[56] Avinash C. Kak,et al. Retrieval from software libraries for bug localization: a comparative study of generic and composite text models , 2011, MSR '11.

[57] Ahmed E. Hassan,et al. Leveraging Performance Counters and Execution Logs to Diagnose Memory-Related Performance Issues , 2013, 2013 IEEE International Conference on Software Maintenance.

[58] Ahmed E. Hassan,et al. Studying software evolution using topic models , 2014, Sci. Comput. Program..

[59] Ahmed E. Hassan,et al. A survey on the use of topic models when mining software repositories , 2015, Empirical Software Engineering.

[60] Richard N. Taylor,et al. Software traceability with topic modeling , 2010, 2010 ACM/IEEE 32nd International Conference on Software Engineering.

[61] Ashish Sureka,et al. LogOpt: Static Feature Extraction from Source Code for Automated Catch Block Logging Prediction , 2016, ISEC.

[62] Ding Yuan,et al. Characterizing logging practices in open-source software , 2012, 2012 34th International Conference on Software Engineering (ICSE).

[63] Qiang Fu,et al. Learning to Log: Helping Developers Make Informed Logging Decisions , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[64] Alexander Golbraikh,et al. Does Rational Selection of Training and Test Sets Improve the Outcome of QSAR Modeling? , 2012, J. Chem. Inf. Model..

[65] Richard A. Groeneveld,et al. Measuring Skewness and Kurtosis , 1984 .

[66] Ding Yuan,et al. Improving Software Diagnosability via Log Enhancement , 2012, TOCS.

[67] Santonu Sarkar,et al. Mining business topics in source code using latent dirichlet allocation , 2008, ISEC '08.

[68] David W. Binkley,et al. Understanding LDA in source code analysis , 2014, ICPC 2014.

[69] Andrew McCallum,et al. Rethinking LDA: Why Priors Matter , 2009, NIPS.

[70] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[71] Michael English,et al. An empirical analysis of information retrieval based concept location techniques in software comprehension , 2008, Empirical Software Engineering.

[72] Hareton K. N. Leung,et al. MSR4SM: Using topic models to effectively mining software repositories for software maintenance tasks , 2015, Inf. Softw. Technol..

[73] J. Bring. How to Standardize Regression Coefficients , 1994 .

[74] อนิรุธ สืบสิงห์,et al. Data Mining Practical Machine Learning Tools and Techniques , 2014 .