论文信息 - Design and use of reference data sets for testing scientific software

Design and use of reference data sets for testing scientific software

Abstract A general methodology for evaluating the accuracy of the results produced by scientific software has been developed at the National Physical Laboratory. The basis of the approach is the design and use of reference data sets and corresponding reference results to undertake black-box testing. The approach enables reference data sets and results to be generated in a manner consistent with the functional specification of the problem addressed by the software. The results returned by the software for the reference data are compared objectively with the reference results. Quality metrics are used for this purpose that account for the key aspects of the problem. In this paper it is shown how reference data sets can be designed for testing software implementations of solutions to a broad class of problems arising throughout science. It is shown how these data sets can be used in practice and how the results provided by software under test can properly be compared with reference results. The approach is illustrated with three examples: (i) mean and standard deviation, (ii) straight-line fitting, and (iii) principal components analysis. Software for such problems is used routinely in many fields, including optical spectrometry.

M. G. Cox | P. M. Harris | M. Cox | P. Harris

[1] W. Van Snyder. Testing functions of one and two arguments , 1996, Quality of Numerical Software.

[2] Philip E. Gill,et al. Practical optimization , 1981 .

[3] Gene H. Golub,et al. Matrix computations (3rd ed.) , 1996 .

[4] Alistair B. Forbes,et al. Reference software for finding Chebyshev best-fit geometric elements , 1996 .

[5] Gene H. Golub,et al. Matrix computations , 1983 .

[6] Daniel W. Lozier. A proposed software test service for special functions , 1996, Quality of Numerical Software.

[7] Maurice G Cox,et al. Development of data sets for the validation of analytical instrumentation , 1994 .

[8] Michael T. Heath,et al. Scientific Computing , 2018 .

[9] Bernard Butler,et al. A methodology for testing classes of approximation and optimisation , 1996, Quality of Numerical Software.

[10] M G Cox,et al. Strategies for testing form assessment software. , 1999 .

[11] Richard G. Brereton,et al. Chemometrics: Applications of Mathematics and Statistics to Laboratory Systems , 1991 .

[12] Ronald F. Boisvert,et al. The Quality of Numerical Software: Assessment and Enhancement , 1996, Quality of Numerical Software.

[13] Anne Lohrli. Chapman and Hall , 1985 .

[14] J. N. Lyness. Performance profiles and software evaluation , 1978 .

[15] C. W. Clenshaw. Chebyshev series for mathematical functions , 1962 .