Is the selection of statistical methods governed by level of measurement

AbstractThe notion that nonparametric methods are required as a replacement of parametric statistical methods when the scale of measurement in a research study does not achieve a certain level was discussed in light of recent developments in representational measurement theory. A new approach to examining the problem via computer simulation was introduced. Some of the beliefs that have been widely held by psychologists for several decades were examined by means of a computer simulation study that mimicked measurement of an underlying empirical structure and performed two - sample Student t - tests on the resulting sample data. It was concluded that there is no need to replace parametric statistical tests by nonparametric methods when the scale of measurement is ordinal and not interval.Stevens' (1946) classic paper on the theory of scales of measurement triggered one of the longest standing debates in behavioural science methodology. The debate -- referred to as the levels of measurement controversy, or measurement - statistics debate -- is over the use of parametric and nonparametric statistics and its relation to levels of measurement. Stevens (1946; 1951; 1959; 1968), Siegel (1956), and most recently Siegel and Castellan (1988) and Conover (1980) argue that parametric statistics should be restricted to data of interval scale or higher. Furthermore, nonparametric statistics should be used on data of ordinal scale. Of course, since each scale of measurement has all of the properties of the weaker measurement, statistical methods requiring only a weaker scale may be used with the stronger scales. A detailed historical review linking Stevens' work on scales of measurement to the acceptance of psychology as a science, and a pedagogical presentation of fundamental axiomatic (i.e., representational) measurement can be found in Zumbo and Zimmerman (1991).Many modes of argumentation can be seen in the debate about levels of measurement and statistics. This paper focusses almost exclusively on an empirical form of rhetoric using experimental mathematics (Ripley, 1987). The term experimental mathematics comes from mathematical physics. It is loosely defined as the mimicking of the rules of a model of some kind via random processes. In the methodological literature this is often referred to as monte carlo simulation. However, for the purpose of this paper, the terms experimental mathematics or computer simulation are preferred to monte carlo because the latter is typically referred to when examining the robustness of a test in relation to particular statistical assumptions. Measurement level is not an assumption of the parametric statistical model (see Zumbo & Zimmerman, 1991 for a discussion of this issue) and to call the method used herein "monte carlo" would imply otherwise. The term experimental mathematics emphasizes the modelling aspect of the present approach to the debate.The purpose of this paper is to present a new paradigm using experimental mathematics to examine the claims made in the levels of measurement controversy. As Michell (1986) demonstrated, the concern over levels of measurement is inextricably tied to the differing notions of measurement and scaling. Michell further argued that fundamental axiomatic measurement or representational theory (see, for example, Narens & Luce, 1986) is the only measurement theory which implies a relation between measurement scales and statistics. Therefore, the approach advocated in this paper is linked closely to representational theory. The novelty of this approach, to the authors knowledge, is in the use of experimental mathematics to mimic representational measurement. Before describing the methodology used in this paper, we will briefly review its motivation.Admissible TransformationsRepresentational theory began in the late 1950's with Scott and Suppes (1958) and later with Suppes and Zinnes (1963), Pfanzagl (1968), and Krantz, Luce, Suppes & Tversky (1971). …

[1]  Louis Narens,et al.  Measurement: The theory of numerical assignments. , 1986 .

[2]  W. W. Stine,et al.  Meaningful inference: The role of measurement in statistics. , 1989 .

[3]  J. Leeuw,et al.  Abstract Measurement Theory. , 1986 .

[4]  Johann Pfanzagl,et al.  Theory of measurement , 1970 .

[5]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[6]  D. W. Zimmerman,et al.  The Relative Power of the Wilcoxon-Mann-Whitney Test and Student t Test Under Simple Bounded Transformations. , 1990, The Journal of general psychology.

[7]  R. Iman,et al.  Rank Transformations as a Bridge between Parametric and Nonparametric Statistics , 1981 .

[8]  C. West Churchman,et al.  Measurement: Definitions and Theories , 1960 .

[9]  T. Gregoire,et al.  Analysis of ordinal data to detect population differences. , 1987 .

[10]  L. Devroye Non-Uniform Random Variate Generation , 1986 .

[11]  Patrick Suppes,et al.  Basic measurement theory , 1962 .

[12]  D. W. Zimmerman A Simplified Probability Model of Error of Measurement , 1969 .

[13]  S. S. Stevens Mathematics, measurement, and psychophysics. , 1951 .

[14]  C. A. Boneau,et al.  The effects of violations of assumptions underlying the test. , 1960, Psychological bulletin.

[15]  R. Lehman Computer simulation and modeling : an introduction , 1978 .

[16]  F. Roberts Measurement Theory with Applications to Decisionmaking, Utility, and the Social Sciences: Measurement Theory , 1984 .

[17]  Frederic M. Lord,et al.  On the Statistical Treatment of Football Numbers. , 1953 .

[18]  S. S. Steivens Measurement, Statistics, and the Schemapiric View , 1968 .

[19]  A. Tversky,et al.  Foundations of Measurement, Vol. I: Additive and Polynomial Representations , 1991 .

[20]  W. J. Conover,et al.  Practical Nonparametric Statistics , 1972 .

[21]  Ernest W. Adams,et al.  A theory of appropriate statistics , 1965, Psychometrika.

[22]  P. A. W. Lewis,et al.  A Pseudo-Random Number Generator for the System/360 , 1969, IBM Syst. J..

[23]  James T. Townsend,et al.  Measurement Scales and Statistics: The Misconception Misconceived , 1984 .

[24]  William W. Rozeboom,et al.  Foundations of the theory of prediction , 1966 .

[25]  Ian T. Jolliffe,et al.  Simulation Methodology for Statisticians, Operations Analysts, and Engineers, Vol. 1. , 1990 .

[26]  Eugene Galanter,et al.  Handbook of mathematical psychology: I. , 1963 .

[27]  W. W. Stine Interobserver relational agreement , 1989 .

[28]  J. Michell Measurement scales and statistics: A clash of paradigms. , 1986 .

[29]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[30]  M. E. Muller,et al.  A Note on the Generation of Random Normal Deviates , 1958 .

[31]  R. Allan Reese,et al.  Practical Nonparametric Statistics , 1983 .

[32]  Mark L. Davison,et al.  Parametric statistics and levels of measurement : factorial designs and multiple regression , 1990 .

[33]  S S Stevens,et al.  On the Theory of Scales of Measurement. , 1946, Science.

[34]  D. W. Zimmerman Test Theory With Minimal Assumptions , 1976 .

[35]  S. S. Stevens,et al.  Handbook of experimental psychology , 1951 .

[36]  Donald W. Zimmerman Probability spaces, hilbert spaces, and the axioms of test theory , 1975 .

[37]  Brian D. Ripley,et al.  Stochastic Simulation , 2005 .

[38]  B. Morgan Elements of Simulation , 1984 .

[39]  Patrick Suppes,et al.  Foundational aspects of theories of measurement , 1958, Journal of Symbolic Logic.

[40]  Mark L. Davison,et al.  Parametric statistics and levels of measurement. , 1988 .

[41]  Stanley Wasserman,et al.  A Handbook for Data Analysis in the Behavioral Sciences:@@@(Vol. 1) Statistical Issues@@@(Vol. 2) Methodological Issues. , 1994 .

[42]  S. Sawilowsky Nonparametric Tests of Interaction in Experimental Design , 1990 .

[43]  R. Blair,et al.  A more realistic look at the robustness and Type II error properties of the t test to departures from population normality. , 1992 .

[44]  J. J. Higgins,et al.  An Investigation of the Type I Error and Power Properties of the Rank Transform Procedure in Factorial ANOVA , 1989 .

[45]  J. L. Rasmussen Analysis of Likert-scale data: A reinterpretation of Gregoire and Driver. , 1989 .

[46]  Bruno D. Zumbo,et al.  The relative power of parametric and nonparametric statistical methods. , 1993 .