On a Criterion for the Rejection of Observations and the Distribution of the Ratio of Deviation to Sample Standard Deviation

Criteria for the rejection of outlying observations may be designed to reject a given fraction of all observations, or a proportion varying with the size of the sample. Irwin' has discussed several criteria based on sampling from a normal population which had been used previously, as well as one which he proposed. This is based on the principal of fixing the expectation of rejecting an observation from a sample independently of the aggregate number, N, of the sample. The criterion, X, is b/a times the interval between successive observations in ascending order of magnitude, where a is the standard deviation of the sampled population. In the same paper he gave, for different values of N, a table of P1(X) and P2(X), respectively probabffities of exceeding given values of X for the first or second such interval from either end. In actual use, however, a is estimated from the sample standard deviation, and we are left to decide whether observations in question are to be included or not in estimating the standard deviation as also whether or not to modify this by addition or subtraction of an estimate of its probable error. The object of the present communication is to develop a criterion free from defects of this nature, depending only on the assumption of random sampling from a normal universe. For this purpose we develop the distribution of T defined by