Measuring the Diversity of a Test Set With Distance Entropy

Most existing metrics that we call white-box metrics, such as coverage metrics, require white-box information, like program structure information, and historical runtime information, to evaluate the fault detection capability of a test set. In practice, such white-box information is usually unavailable or difficult to obtain, which means they often cannot be used. In this paper, we propose a black-box metric, distance entropy, based on the diversification idea behind many published diversity-based techniques. Distance entropy provides a possible solution for test set evaluation when white-box information is not available. The empirical study illustrates that distance entropy can effectively evaluate test sets if the distance metric between tests is well defined. Meanwhile, distance entropy outperforms simple diversity metrics without increasing time complexity.

[1]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .

[2]  Hong Cheng,et al.  Identifying bug signatures using discriminative graph mining , 2009, ISSTA.

[3]  Lionel C. Briand,et al.  Adaptive random testing: an illusion of effectiveness? , 2011, ISSTA '11.

[4]  Lionel C. Briand,et al.  Achieving scalable model-based testing through test case diversity , 2013, TSEM.

[5]  Vishwani D. Agrawal,et al.  An Information Theoretic Approach to Digital Fault Testing , 1981, IEEE Transactions on Computers.

[6]  Gregg Rothermel,et al.  An experimental determination of sufficient mutant operators , 1996, TSEM.

[7]  Mark Harman,et al.  Augmenting test suites effectiveness by increasing output diversity , 2012, 2012 34th International Conference on Software Engineering (ICSE).

[8]  Andreas Zeller,et al.  Covering and Uncovering Equivalent Mutants , 2013, Softw. Test. Verification Reliab..

[9]  Sarfraz Khurshid,et al.  A family of generalized entropies and its application to software fault localization , 2012, 2012 6th IEEE International Conference Intelligent Systems.

[10]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[11]  Lionel C. Briand,et al.  Empirical Investigation of the Effects of Test Suite Properties on Similarity-Based Test Case Selection , 2011, 2011 Fourth IEEE International Conference on Software Testing, Verification and Validation.

[12]  Yan Zhou,et al.  Clustering with Minimum Spanning Trees , 2011, Int. J. Artif. Intell. Tools.

[13]  Lionel C. Briand,et al.  An Industrial Investigation of Similarity Measures for Model-Based Test Case Selection , 2010, 2010 IEEE 21st International Symposium on Software Reliability Engineering.

[14]  Tsong Yueh Chen,et al.  Adaptive Random Testing: The ART of test case diversity , 2010, J. Syst. Softw..

[15]  Borislav Nikolik,et al.  Test diversity , 2006, Inf. Softw. Technol..

[16]  Atif M. Memon,et al.  Introducing a test suite similarity metric for event sequence-based test cases , 2009, 2009 IEEE International Conference on Software Maintenance.

[17]  Reid Holmes,et al.  Coverage is not strongly correlated with test suite effectiveness , 2014, ICSE.

[18]  Lionel C. Briand,et al.  Is mutation an appropriate tool for testing experiments? , 2005, ICSE.

[19]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[20]  A. Jefferson Offutt,et al.  Combination testing strategies: a survey , 2005, Softw. Test. Verification Reliab..

[21]  Mark Harman,et al.  An Analysis and Survey of the Development of Mutation Testing , 2011, IEEE Transactions on Software Engineering.

[22]  Richard J. Lipton,et al.  Hints on Test Data Selection: Help for the Practicing Programmer , 1978, Computer.

[23]  A. Jefferson Offutt,et al.  Introduction to Software Testing , 2008 .

[24]  A. Baron Experimental Designs , 1990, The Behavior analyst.

[25]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[26]  Tao Xie,et al.  Is operator-based mutant selection superior to random mutant selection? , 2010, 2010 ACM/IEEE 32nd International Conference on Software Engineering.

[27]  Rong Jin,et al.  Distance Metric Learning: A Comprehensive Survey , 2006 .

[28]  Sarfraz Khurshid,et al.  Regression mutation testing , 2012, ISSTA 2012.

[29]  Mark Harman,et al.  Fault localization prioritization: Comparing information-theoretic and coverage-based approaches , 2013, TSEM.

[30]  Alexandre Petrenko,et al.  Using String Distances for Test Case Prioritisation , 2009, 2009 IEEE/ACM International Conference on Automated Software Engineering.

[31]  Lionel C. Briand,et al.  Reducing the Cost of Model-Based Testing through Test Case Diversity , 2010, ICTSS.

[32]  Huai Liu,et al.  On Test Case Distributions of Adaptive Random Testing , 2007, SEKE.

[33]  Pierre A. Humblet,et al.  A Distributed Algorithm for Minimum-Weight Spanning Trees , 1983, TOPL.

[34]  T. Y. Chen,et al.  Adaptive Random Testing , 2004, ASIAN.

[35]  Andreas Zeller,et al.  The Impact of Equivalent Mutants , 2009, 2009 International Conference on Software Testing, Verification, and Validation Workshops.

[36]  M. E. Johnson,et al.  Minimax and maximin distance designs , 1990 .

[37]  Petri Ihantola,et al.  Mutation analysis vs. code coverage in automated assessment of students' testing skills , 2010, SPLASH/OOPSLA Companion.

[38]  Tony Gorschek,et al.  Searching for Cognitively Diverse Tests: Towards Universal Test Diversity Metrics , 2008, 2008 IEEE International Conference on Software Testing Verification and Validation Workshop.

[39]  Gregg Rothermel,et al.  Supporting Controlled Experimentation with Testing Techniques: An Infrastructure and its Potential Impact , 2005, Empirical Software Engineering.

[40]  Mark Harman,et al.  Clustering test cases to achieve effective and scalable prioritisation incorporating expert knowledge , 2009, ISSTA.

[41]  Akbar Siami Namin,et al.  The influence of size and coverage on test suite effectiveness , 2009, ISSTA.

[42]  Tsong Yueh Chen,et al.  An upper bound on software testing effectiveness , 2008, TSEM.

[43]  Nadine Mandran,et al.  Prioritizing test cases with string distances , 2011, Automated Software Engineering.