An Analysis of Variance Test for Normality (Complete Samples)

The main intent of this paper is to introduce a new statistical procedure for testing a complete sample for normality. The test statistic is obtained by dividing the square of an appropriate linear combination of the sample order statistics by the usual symmetric estimate of variance. This ratio is both scale and origin invariant and hence the statistic is appropriate for a test of the composite hypothesis of normality. Testing for distributional assumptions in general and for normality in particular has been a major area of continuing statistical research-both theoretically and practically. A possible cause of such sustained interest is that many statistical procedures have been derived based on particular distributional assumptions-especially that of normality. Although in many cases the techniques are more robust than the assumptions underlying them, still a knowledge that the underlying assumption is incorrect may temper the use and application of the methods. Moreover, the study of a body of data with the stimulus of a distributional test may encourage consideration of, for example, normalizing transformations and the use of alternate methods such as distribution-free techniques, as well as detection of gross peculiarities such as outliers or errors. The test procedure developed in this paper is defined and some of its analytical properties described in ? 2. Operational information and tables useful in employing the test are detailed in ? 3 (which may be read independently of the rest of the paper). Some examples are given in ? 4. Section 5 consists of an extract from an empirical sampling study of the comparison of the effectiveness of various alternative tests. Discussion and concluding remarks are given in ?6. 2. THE W TEST FOR NORMALITY (COMPLETE SAMPLES) 2 1. Motivation and early work This study was initiated, in part, in an attempt to summarize formally certain indications of probability plots. In particular, could one condense departures from statistical linearity of probability plots into one or a few 'degrees of freedom' in the manner of the application of analysis of variance in regression analysis? In a probability plot, one can consider the regression of the ordered observations on the expected values of the order statistics from a standardized version of the hypothesized distribution-the plot tending to be linear if the hypothesis is true. Hence a possible method of testing the distributional assumptionis by means of an analysis of variance type procedure. Using generalized least squares (the ordered variates are correlated) linear and higher-order

[1]  R. Geary THE RATIO OF THE MEAN DEVIATION TO THE STANDARD DEVIATION AS A TEST OF NORMALITY , 1935 .

[2]  A. C. Aitken IV.—On Least Squares and Linear Combination of Observations , 1936 .

[3]  F. Mosteller,et al.  Low Moments for Small Samples: A Comparative Study of Order Statistics , 1947 .

[4]  N. L. Johnson,et al.  Systems of frequency curves generated by methods of translation. , 1949, Biometrika.

[5]  E. H. Lloyd LEAST-SQUARES ESTIMATION OF LOCATION AND SCALE PARAMETERS USING ORDER STATISTICS , 1952 .

[6]  O. L. Davies,et al.  Design and analysis of industrial experiments , 1954 .

[7]  D. Darling,et al.  A Test of Goodness of Fit , 1954 .

[8]  H. A. David,et al.  THE DISTRIBUTION OF THE RATIO, IN A SINGLE NORMAL SAMPLE, OF RANGE TO STANDARD DEVIATION , 1954 .

[9]  J. Hammersley A Million Random Digits with 100,000 Normal Deviates. , 1955 .

[10]  R. Plackett Linear Estimation from Censored Data , 1958 .

[11]  A. E. Sarhan,et al.  Estimation of Location and Scale Parameters by Order Statistics from Singly and Doubly Censored Samples: Part II. Tables for the Normal Distribution for Samples of Size $11 \leqq n \leqq 15$ , 1958 .

[12]  C. Daniel Use of Half-Normal Plots in Interpreting Factorial Two-Level Experiments , 1959 .

[13]  James Durbin,et al.  Some methods of constructing exact tests , 1961 .

[14]  H. Harter Expected values of normal order statistics , 1961 .

[15]  M. Kendall,et al.  The advanced theory of statistics , 1945 .

[16]  E. S. Pearson,et al.  The ratio of range to standard deviation in the same normal sample , 1964 .