A Multidimensional Goodness-of-Fit Test Based on Interpoint Distances

Abstract Distributional assumptions can be examined with multidimensional goodness-of-fit tests. We propose a conceptually simple test with an appealing logic and accessible asymptotic properties, which is generalizable to a variety of problems and appears to work well against diverse alternatives. To test whether a k-dimensional random sample X1, …, X n follows the distribution G, consider a triangle formed by two randomly selected data points X i and X j and a variable Y ∼ G. Our statistic estimates the likelihood that the side formed by the line from X i to X j is the smallest, the middle, or the largest side of the triangle.

[1]  M GREENWOOD,et al.  The statistical study of infectious diseases. , 1946, Journal of the Royal Statistical Society. Series A.

[2]  David S. Moore,et al.  Chi-square tests for multivariate normality with application to common stock prices , 1981 .

[3]  Barry W. Brown,et al.  On estimating the growth function of tumors , 1983 .

[4]  Mark Schilling,et al.  Goodness of Fit Testing in $\mathbb{R}^m$ Based on the Weighted Empirical Distribution of Certain Nearest Neighbor Statistics , 1983 .

[5]  D. Darling On a Class of Problems Related to the Random Division of an Interval , 1953 .

[6]  D. E. Coleman How To Test Normality And Other Distributional Assumptions , 1986 .

[7]  M. Stephens EDF Statistics for Goodness of Fit and Some Comparisons , 1974 .

[8]  N. Henze,et al.  A consistent test for multivariate normality based on the empirical characteristic function , 1988 .

[9]  Regina Y. Liu On a Notion of Data Depth Based on Random Simplices , 1990 .

[10]  Douglas A. Wolfe,et al.  Introduction to the Theory of Nonparametric Statistics. , 1980 .

[11]  Norbert Henze,et al.  A class of invariant consistent tests for multivariate normality , 1990 .

[12]  T. W. Epps,et al.  A test for normality based on the empirical characteristic function , 1983 .

[13]  H. Oja Descriptive Statistics for Multivariate Distributions , 1983 .

[14]  P. Bickel,et al.  Sums of Functions of Nearest Neighbor Distances, Moment Bounds, Limit Theorems and a Goodness of Fit Test , 1983 .

[15]  Lorenzo Fattorini Remarks on the use of Shapiro-Wilk statistic for testing multivariate normality , 1986 .

[16]  S. Jammalamadaka,et al.  Goodness of fit in multidimensions based on nearest neighbour distances , 1993 .

[17]  R. Randles,et al.  Introduction to the Theory of Nonparametric Statistics , 1991 .

[18]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[19]  A. Afifi,et al.  On Tests for Multivariate Normality , 1973 .

[20]  R. Bartoszynski,et al.  Reducing multidimensional two-sample data to one-dimensional interpoint comparisons , 1996 .

[21]  G. Gregory,et al.  Large Sample Theory for $U$-Statistics and Tests of Fit , 1977 .

[22]  Adolfo J. Quiroz,et al.  Some new tests for multivariate normality , 1991 .

[23]  Douglas E. Critchlow,et al.  THE TRIPLES DISTANCE FOR ROOTED BIFURCATING PHYLOGENETIC TREES , 1996 .