An empirical evaluation of a testing and debugging methodology for Excel

Spreadsheets are one of the most commonly used types of programs in the world, and it is important that they be sufficiently dependable. To help end users who create spreadsheets do so more reliably, we have created a testing and debugging methodology and environment for use in spreadsheets, known as the WYSIWYT methodology. Our prior experiments with WYSIWYT show that users can utilize it to ensure that their spreadsheets are more dependable, but these experiments to date have considered only an unfamiliar prototype spreadsheet environment, and have not involved spreadsheet creation tasks. In this work we conducted a controlled experiment that addresses these limitations. The results of this study indicate that the use of WYSIWYT did not affect the correctness of spreadsheets created by users, but it did significantly reduce the amount of effort required to create them. Further, the subjects' evaluation of the help provided by WYSIWYT was very positive. Our results provide several insights into the use of the WYSIWYT methodology by end users.

[1]  Gregg Rothermel,et al.  Scaling a Dataflow Testing Methodology to the MultiparadigmWorld of Commercial Spreadsheets , 2006, 2006 17th International Symposium on Software Reliability Engineering.

[2]  Shriram Krishnamurthi,et al.  A type system for statically detecting spreadsheet errors , 2003, 18th IEEE International Conference on Automated Software Engineering, 2003. Proceedings..

[3]  M. Erwig,et al.  Automatic generation and maintenance of correct spreadsheets , 2005, Proceedings. 27th International Conference on Software Engineering, 2005. ICSE 2005..

[4]  Matthias Felleisen,et al.  Validating the unit correctness of spreadsheet programs , 2004, Proceedings. 26th International Conference on Software Engineering.

[5]  Jorma Sajaniemi Modeling Spreadsheet Audit: A Rigorous Approach to Automatic Visualization , 2000, J. Vis. Lang. Comput..

[6]  Gregg Rothermel,et al.  Slicing spreadsheets: an integrated methodology for spreadsheet testing and debugging , 1999, DSL '99.

[7]  Raymond R. Panko,et al.  What we know about spreadsheet errors , 1998 .

[8]  Gregg Rothermel,et al.  Test reuse in the spreadsheet paradigm , 2002, 13th International Symposium on Software Reliability Engineering, 2002. Proceedings..

[9]  Gregg Rothermel,et al.  Automated test case generation for spreadsheets , 2002, ICSE '02.

[10]  Margaret M. Burnett,et al.  Reasoning about many-to-many requirement relationships in spreadsheets , 2002, Proceedings IEEE 2002 Symposia on Human Centric Computing Languages and Environments.

[11]  G. Rothermel,et al.  An empirical study of fault localization for end-user programmers , 2005, Proceedings. 27th International Conference on Software Engineering, 2005. ICSE 2005..

[12]  Gregg Rothermel,et al.  Incorporating incremental validation and impact analysis into spreadsheet maintenance: an empirical study , 2001, Proceedings IEEE International Conference on Software Maintenance. ICSM 2001.

[13]  Gregg Rothermel,et al.  Testing Homogeneous Spreadsheet Grids with the "What You See Is What You Test" Methodology , 2002, IEEE Trans. Software Eng..

[14]  Gregg Rothermel,et al.  End-user software engineering with assertions in the spreadsheet paradigm , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[15]  Margaret M. Burnett,et al.  End-user software visualizations for fault localization , 2003, SoftVis '03.

[16]  Susan Wiedenbeck,et al.  End-user training: an empirical study comparing on-line practice methods , 1995, CHI '95.

[17]  Roland Mittermeir,et al.  Auditing Large Spreadsheet Programs , 2003 .

[18]  Gregg Rothermel,et al.  A methodology for testing spreadsheets , 2001, TSEM.

[19]  Raymond R. Panko,et al.  Spreadsheets on trial: a survey of research on spreadsheet risks , 1996, Proceedings of HICSS-29: 29th Hawaii International Conference on System Sciences.

[20]  Gregg Rothermel,et al.  Testing strategies for form-based visual programs , 1997, Proceedings The Eighth International Symposium on Software Reliability Engineering.

[21]  Brad A. Myers,et al.  Designing the whyline: a debugging interface for asking questions about program behavior , 2004, CHI.

[22]  Margaret M. Burnett,et al.  Effectiveness of end-user debugging software features: are there gender issues? , 2005, CHI.

[23]  Gregg Rothermel,et al.  WYSIWYT testing in the spreadsheet paradigm: an empirical evaluation , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[24]  Markus Clermont Analyzing large spreadsheet programs , 2003, 10th Working Conference on Reverse Engineering, 2003. WCRE 2003. Proceedings..

[25]  Margaret M. Burnett,et al.  Forms/3: A first-order visual language to explore the boundaries of the spreadsheet paradigm , 2001, Journal of Functional Programming.

[26]  Martin Erwig,et al.  Header and Unit Inference for Spreadsheets Through Spatial Analyses , 2004, 2004 IEEE Symposium on Visual Languages - Human Centric Computing.

[27]  Gregg Rothermel,et al.  Harnessing curiosity to increase correctness in end-user programming , 2003, CHI '03.