Visual Inspection of Data Revisited: Do the Eyes Still Have It?

In behavior analysis, visual inspection of graphic information is the standard by which data are evaluated. Efforts to supplement visual inspection using inferential statistical procedures to assess intervention effects (e.g., analysis of variance or time-series analysis) have met with opposition. However, when serial dependence is present in the data, the use of visual inspection by itself may prove to be problematic. Previously published reports demonstrate that autocorrelated data influence trained observers’ ability to identify level treatment effects and trends that occur in the intervention phase of experiments. In this report, four recent studies are presented in which autoregressive equations were used to produce point-to-point functions to simulate experimental data. In each study, various parameters were manipulated to assess trained observers’ responses to changes in point-to-point functions from the baseline condition to intervention. Level shifts over baseline behavior (treatment effect), as well as no change from baseline (no treatment effect or trend), were most readily identified by observers, but trends were rarely recognized. Furthermore, other factors previously thought to augment and improve observers’ responses had no impact. Results are discussed in terms of the use of visual inspection and the training of behavior analysts.

[1]  D P Hartmann,et al.  Interrupted time-series analysis and its application to behavioral data. , 1980, Journal of applied behavior analysis.

[2]  R. R. Jones,et al.  Effects of serial dependency on the agreement between visual and statistical inference. , 1978, Journal of applied behavior analysis.

[3]  Bradley E. Huitema,et al.  Autocorrelation in applied behavior analysis: A myth. , 1985 .

[4]  G. Barrie Wetherill,et al.  Statistical Process Control , 1991 .

[5]  T Kratochwill,et al.  A further consideration in the application of an analysis-of-variance model for the intrasubject replication design. , 1974, Journal of applied behavior analysis.

[6]  A. R. Crathorne,et al.  Economic Control of Quality of Manufactured Product. , 1933 .

[7]  W. Cleveland,et al.  Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods , 1984 .

[8]  Patsy Haccou,et al.  Statistical Analysis of Behavioural Data: An Approach Based on Time-structured Models , 1992 .

[9]  R McGill,et al.  Graphical Perception and Graphical Methods for Analyzing Scientific Data , 1985, Science.

[10]  J. Crosbie,et al.  Interrupted time-series analysis with brief single-subject data. , 1993, Journal of consulting and clinical psychology.

[11]  B. Törestad Problems and methods in longitudinal research : stability and change , 1991 .

[12]  G. Glass,et al.  Design and analysis of time-series experiments , 1975 .

[13]  R. M. Elliott,et al.  Behavior of Organisms , 1991 .

[14]  Gene S. Fisch,et al.  Visual inspection of data: does the eyeball fit the trend? , 1994, Electronic Imaging.

[15]  S. Lewandowsky,et al.  Discriminating strata in scatterplots , 1989 .

[16]  W. Velicer,et al.  A Comparison of Alternative Approaches to the Analysis of Interrupted Time-Series. , 1985, Multivariate behavioral research.

[17]  Luis A. Escobar,et al.  Statistical Intervals: A Guide for Practitioners , 1991 .

[18]  B. Huitema Autocorrelation in Behavioral Research Wherefore Art Thou , 1986 .

[19]  D. Bailey,et al.  Effects of lines of progress and semilogarithmic charts on ratings of charted data. , 1984, Journal of applied behavior analysis.

[20]  M. Sidman A note on functional relations obtained from group data. , 1952, Psychological bulletin.

[21]  Bruce E. Wampold,et al.  The heuristics of visual inference , 1981 .

[22]  D P Hartmann,et al.  Considerations in the choice of interobserver reliability estimates. , 1977, Journal of applied behavior analysis.

[23]  Estes Wk The problem of inference from curves based on group data. , 1956 .

[24]  T. Matyas,et al.  Visual analysis of single-case time series: Effects of variability, serial dependence, and magnitude of intervention effects. , 1990, Journal of applied behavior analysis.

[25]  A. Kazdin,et al.  Artifact, bias, and complexity of assessment: the ABCs of reliability. , 1977, Journal of applied behavior analysis.

[26]  Stephen M. Stigler,et al.  Estimating serial correlation by visual inspection of diagnostic plots , 1986 .

[27]  T. Kratochwill,et al.  Observer agreement, credibility, and judgment: some considerations in presenting observer agreement data. , 1977, Journal of applied behavior analysis.

[28]  C. Sharpley,et al.  Autocorrelation in behavioral data: An alternative perspective. , 1988 .

[29]  Patrick E. Shrout,et al.  Should Significance Tests be Banned? Introduction to a Special Section Exploring the Pros and Cons , 1997 .

[30]  Wayne F. Velicer,et al.  The Reliability and Accuracy of Time Series Model Identification , 1983 .

[31]  J. Elashoff,et al.  "An analysis-of-variance model for intrasubject replicaiton design": some additional comments. , 1974, Journal of applied behavior analysis.

[32]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[33]  W. Edwards Deming,et al.  On the Distinction between Enumerative and Analytic Surveys , 1953 .

[34]  D. Bakan,et al.  A generalization of Sidman's results on group and individual functions, and a criterion. , 1954, Psychological bulletin.

[35]  A. Deprospero,et al.  Inconsistent visual analyses of intrasubject data. , 1979, Journal of applied behavior analysis.

[36]  H J Keselman,et al.  Concerning the statistical procedures enumerated by Gentile et al.: another perspective. , 1974, Journal of applied behavior analysis.

[37]  Thomas A. Matyas,et al.  Problems in the estimation of autocorrelation in brief time series and some implications for behavioral data , 1991 .

[38]  B. Wildman,et al.  A probability-based formula for calculating interobserver agreement. , 1977, Journal of applied behavior analysis.

[39]  I. Spence Visual psychophysics of simple graphical elements. , 1990, Journal of experimental psychology. Human perception and performance.

[40]  D M Baer,et al.  "Perhaps it would be better not to know everything.". , 1977, Journal of applied behavior analysis.

[41]  Larry E. Toothaker,et al.  N = 1 Designs: The Failure of ANOVA-Based Tests , 1983 .

[42]  J. Gentile,et al.  An analysis-of-variance model for the intrasubject replication design. , 1972, Journal of applied behavior analysis.

[43]  K J Ottenbacher,et al.  Reliability and accuracy of visually analyzing graphed data from single-subject designs. , 1986, The American journal of occupational therapy : official publication of the American Occupational Therapy Association.

[44]  M. B. Kelly,et al.  A review of the observational data-collection and reliability procedures reported in The Journal of Applied Behavior Analysis. , 1977, Journal of applied behavior analysis.

[45]  K. A. Ericsson,et al.  Verbal reports as data. , 1980 .

[46]  D. Coleman Statistical Process Control—Theory and Practice , 1993 .

[47]  Bradley E. Huitema Autocorrelation in Behavioral Research , 1986 .

[48]  D J Wheeler,et al.  Applying statistical process control to clinical data: an illustration. , 1992, Journal of applied behavior analysis.

[49]  J Michael,et al.  Statistical inference for individual organism research: mixed blessing or curse? , 1974, Journal of applied behavior analysis.

[50]  B. Wampold,et al.  Intervention effects and relative variation as dimensions in experts' use of visual inference. , 1982, Journal of applied behavior analysis.

[51]  Robert McGill,et al.  Graphical Perception: The Visual Decoding of Quantitative Information on Graphical Displays of Data , 1987 .

[52]  Kenneth M. Greenwood,et al.  Problems with the application of interrupted time series analysis for brief single-subject data , 1990 .

[53]  R. R. Jones,et al.  Time-series analysis in operant research. , 1977, Journal of applied behavior analysis.

[54]  D. Magnusson,et al.  13. Problems and Methods in Longitudinal Research: Stability and Change , 1996 .