SUMMARY It is proposed that bivariate data should be trimmed of those points which define the convex hull, for robust estimation of the product-moment correlation coefficient. Properties of this method are examined by a Monte Carlo investigation. Other applications are mentioned. THE product-moment correlation coefficient, like many other parametric estimators, is sensitive to outliers and disturbances in the tails of the bivariate distribution of quantitative variables. For this reason there is much to recommend the routine application of a trimming procedure before this statistic is calculated. Previous authors, e.g. Nath (1971), have investigated what may be termed a rectangular trimming procedure: that is, each distribution independently is truncated in each tail. While this method has the virtue of simplicity, there may be certain objections to its use. Firstly, it takes no account of the multivariate structure of the data. This may well be important particularly if the truncated sample will be used in more complex multivariate procedures. Secondly, the rectangular trimmed product-moment correlation coefficient is almost certain to be biased toward zero as an estimate of the population correlation. Nath (1971) gives an example of a correlation of 079 which reduced to 065 on single truncation of either distribution resulting in 23 per cent of the sample being eliminated. A bias correction may be calculated assuming a bivariate normal distribution, but is computationally tedious, even though Dyer (1973) has proposed a method avoiding complex iteration.
[1]
J. Tukey.
A survey of sampling from contaminated distributions
,
1960
.
[2]
B. Efron.
The convex hull of a random set of points
,
1965
.
[3]
G. Baikunth Nath,et al.
Estimation in Truncated Bivariate Normal Distributions
,
1971
.
[4]
C. Kowalski.
On the Effects of Non‐Normality on the Distribution of the Sample Product‐Moment Correlation Coefficient
,
1972
.
[5]
Bell Telephone,et al.
ROBUST ESTIMATES, RESIDUALS, AND OUTLIER DETECTION WITH MULTIRESPONSE DATA
,
1972
.
[6]
Danny D. Dyer.
On Moments Estimation of the Parameters of a Truncated Bivariate Normal Distribution
,
1973
.
[7]
Henry R. Neave,et al.
On Using the Box‐Müller Transformation with Multiplicative Congruential Pseudo‐Random Number Generators
,
1973
.
[8]
V. Barnett.
The Ordering of Multivariate Data
,
1976
.