论文信息 - FastLane: Test Minimization for Rapidly Deployed Large-Scale Online Services

FastLane: Test Minimization for Rapidly Deployed Large-Scale Online Services

Today, we depend on numerous large-scale services for basic operations such as email. These services, built on the basis of Continuous Integration/Continuous Deployment (CI/CD) processes, are extremely dynamic: developers continuously commit code and introduce new features, functionality and fixes. Hundreds of commits may enter the code-base in a single day. Therefore one of the most time-critical, yet resource-intensive tasks towards ensuring code-quality is effectively testing such large code-bases. This paper presents FastLane, a system that performs data-driven test minimization. FastLane uses light-weight machine-learning models built upon a rich history of test and commit logs to predict test outcomes. Tests for which we predict outcomes need not be explicitly run, thereby saving us precious test-time and resources. Our evaluation on a large-scale email and collaboration platform service shows that our techniques can save 18.04%, i.e., almost a fifth of test-time while obtaining a test outcome accuracy of 99.99%.

[1] Andreas Zeller,et al. Predicting faults from cached history , 2008, ISEC '08.

[2] Harald C. Gall,et al. Don't touch my code!: examining the effects of ownership on software quality , 2011, ESEC/FSE '11.

[3] Premkumar T. Devanbu,et al. How, and why, process metrics are better , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[4] Neelam Gupta,et al. Improving Fault Detection Capability by Selectively Retaining Test Cases during Test Suite Reduction , 2007, IEEE Transactions on Software Engineering.

[5] Emanuel Melachrinoudis,et al. Bi-criteria models for all-uses test suite reduction , 2004, Proceedings. 26th International Conference on Software Engineering.

[6] Gregg Rothermel,et al. Test Case Prioritization: A Family of Empirical Studies , 2002, IEEE Trans. Software Eng..

[7] Andreas Zeller,et al. Predicting defects using change genealogies , 2013, 2013 IEEE 24th International Symposium on Software Reliability Engineering (ISSRE).

[8] Daniel M. Germán,et al. What do large commits tell us?: a taxonomical study of large commits , 2008, MSR '08.

[9] Victor R. Basili,et al. A validation of object oriented metrics as quality indicators , 1996 .

[10] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[11] Qi Luo,et al. A large-scale empirical comparison of static and dynamic test case prioritization techniques , 2016, SIGSOFT FSE.

[12] Lingming Zhang,et al. Hybrid Regression Test Selection , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[13] Witold Pedrycz,et al. A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[14] Gregg Rothermel,et al. Empirical studies of test‐suite reduction , 2002, Softw. Test. Verification Reliab..

[15] Nachiappan Nagappan,et al. Using Software Dependencies and Churn Metrics to Predict Field Failures: An Empirical Case Study , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[16] Rajiv Gupta,et al. A methodology for controlling the size of a test suite , 1990, Proceedings. Conference on Software Maintenance 1990.

[17] Mark Harman,et al. Regression testing minimization, selection and prioritization: a survey , 2012, Softw. Test. Verification Reliab..

[18] Ali Mesbah,et al. Fine-Grained Test Minimization , 2018 .

[19] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[20] Mary Jean Harrold,et al. Test-suite reduction and prioritization for modified condition/decision coverage , 2001, Proceedings IEEE International Conference on Software Maintenance. ICSM 2001.

[21] A. Zeller,et al. Predicting Defects for Eclipse , 2007, Third International Workshop on Predictor Models in Software Engineering (PROMISE'07: ICSE Workshops 2007).

[22] Yann-Gaël Guéhéneuc,et al. On the Relationship between Program Evolution and Fault-Proneness: An Empirical Study , 2013, 2013 17th European Conference on Software Maintenance and Reengineering.

[23] J. Friedman. Greedy function approximation: A gradient boosting machine. , 2001 .

[24] Prasoon Goyal,et al. Local Deep Kernel Learning for Efficient Non-linear SVM Prediction , 2013, ICML.

[25] Hairong Kuang,et al. The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[26] Gregg Rothermel,et al. Prioritizing test cases for regression testing , 2000, ISSTA '00.

[27] Michael J. Franklin,et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.

[28] Amin Milani Fard,et al. An empirical study of bugs in test code , 2015, 2015 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[29] Tsong Yueh Chen,et al. A new heuristic for test suite reduction , 1998, Inf. Softw. Technol..

[30] Darko Marinov,et al. An empirical analysis of flaky tests , 2014, SIGSOFT FSE.

[31] James H. Andrews,et al. Testing using log file analysis: tools, methods, and issues , 1998, Proceedings 13th IEEE International Conference on Automated Software Engineering (Cat. No.98EX239).

[32] Lu Zhang,et al. An Empirical Study of Execution-Data Classification Based on Machine Learning , 2012, SEKE.

[33] Ahmed E. Hassan,et al. The Impact of Class Rebalancing Techniques on the Performance and Interpretation of Defect Prediction Models , 2018, IEEE Transactions on Software Engineering.

[34] Tianqi Chen,et al. XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[35] Ramakrishnan Srikant,et al. Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[36] John Micco,et al. Taming Google-Scale Continuous Testing , 2017, 2017 IEEE/ACM 39th International Conference on Software Engineering: Software Engineering in Practice Track (ICSE-SEIP).

[37] Chris F. Kemerer,et al. A Metrics Suite for Object Oriented Design , 2015, IEEE Trans. Software Eng..