Evaluating data from behavioral analysis: visual inspection or statistical models?

Traditional behavior analysis relies upon single-subject study designs and visual inspection of graphed data to evaluate the efficacy of experimental manipulations. Attempts to apply statistical inferential procedures to analyze data have been successfully opposed for many decades, despite problems with visual inspection and increasingly cogent arguments to utilize inferential statistics. In a series of experiments, we show that trained behavior analysts often identify level shifts in responding during intervention phases ('treatment effect') in modestly autocorrelated data, but trends are either misconstrued as level treatment effects or go completely unnoticed. Errors in trend detection illustrate the liabilities of using visual inspection as the sole means by which to analyze behavioral data. Meanwhile, because of greatly increased computer power and advanced mathematical techniques, previously undeveloped or underutilized statistical methods have become far more sophisticated and have been brought to bear on a variety of problems associated with repeated measures data. I present several nonparametric procedures and other statistical techniques to evaluate traditional behavioral data to augment, not replace, visual inspection procedures.

[1]  D. Wolfe,et al.  Testing for ordered alternatives by combining independent distribution-free block statistics , 1977 .

[2]  J. Crosbie,et al.  Interrupted time-series analysis with brief single-subject data. , 1993, Journal of consulting and clinical psychology.

[3]  Bruce E. Wampold,et al.  Randomization tests for multiple-baseline designs. , 1986 .

[4]  Gene S. Fisch,et al.  Visual inspection of data: does the eyeball fit the trend? , 1994, Electronic Imaging.

[5]  Bruce E. Wampold,et al.  The heuristics of visual inference , 1981 .

[6]  G. A. Mack,et al.  On the Use of a Friedman-Type Statistic in Balanced and Unbalanced Block Designs , 1981 .

[7]  H. B. Mann Nonparametric Tests Against Trend , 1945 .

[8]  M Davison,et al.  Statistical inference in behavior analysis: Having my cake and eating it? , 1999, The Behavior analyst.

[9]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[10]  B. J. Winer Statistical Principles in Experimental Design , 1992 .

[11]  A. Kazdin,et al.  Artifact, bias, and complexity of assessment: the ABCs of reliability. , 1977, Journal of applied behavior analysis.

[12]  D. Quade Sturdy statistics: nonparametrics and order statistics , 1975 .

[13]  Mark Appelbaum,et al.  Bias in the Analysis of Repeated-Measures Designs: Some Alternative Approaches. , 1973 .

[14]  J. Levin,et al.  Single-case research design and analysis : new directions for psychology and education , 1992 .

[15]  Patrick W. Colgan,et al.  Statistical analysis of behavioural data By Patsy Haccou and Evert Meelis. Oxford University, Oxford, 1992, xvii + 396 pp, ISBN 0-19-854663-7 , 1993, Behavioural Processes.

[16]  B. F. Skinner,et al.  A case history in scientific method. , 1956 .

[17]  John M. Ferron,et al.  Analyzing Single-Case Data: The Power of Randomization Tests. , 1995 .

[18]  Jacob Cohen,et al.  The statistical power of abnormal-social psychological research: a review. , 1962, Journal of abnormal and social psychology.

[19]  A. Deprospero,et al.  Inconsistent visual analyses of intrasubject data. , 1979, Journal of applied behavior analysis.

[20]  B. Wampold,et al.  Intervention effects and relative variation as dimensions in experts' use of visual inference. , 1982, Journal of applied behavior analysis.

[21]  C. Borror Nonparametric Statistical Methods, 2nd, Ed. , 2001 .

[22]  D P Hartmann,et al.  Interrupted time-series analysis and its application to behavioral data. , 1980, Journal of applied behavior analysis.

[23]  B. Wildman,et al.  A probability-based formula for calculating interobserver agreement. , 1977, Journal of applied behavior analysis.

[24]  T. Matyas,et al.  Visual analysis of single-case time series: Effects of variability, serial dependence, and magnitude of intervention effects. , 1990, Journal of applied behavior analysis.