New test for the multivariate two-sample problem based on the concept of minimum energy

We introduce a new statistical quantity, the energy, to test whether two samples originate from the same distributions. The energy is a simple logarithmic function of the distances of the observations in the variate space. The distribution of the test statistic is determined by a resampling method. The power of the energy test in one dimension was studied for a variety of different test samples and compared to several nonparametric tests. In two and four dimensions, a comparison was performed with the Friedman–Rafsky and nearest neighbor tests. The two-sample energy test was shown to be especially powerful in multidimensional applications.

[1]  Herbert Büning Robuste und adaptive Tests , 1991 .

[2]  H. Büning,et al.  KOLMOGOROV-SMIRNOV- AND CRAMÈR-VON MISES TYPE TWO-SAMPLE TESTS WITH VARIOUS WEIGHT FUNCTIONS , 2001 .

[3]  Y. Lepage A combination of Wilcoxon's and Ansari-Bradley's statistics , 1971 .

[4]  Julia Kastner,et al.  Introduction to Robust Estimation and Hypothesis Testing , 2005 .

[5]  N. Henze,et al.  On the multivariate runs test , 1999 .

[6]  B. S. Duran A survey of nonparametric tests for scale , 1976 .

[7]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[8]  M. Schilling Multivariate Two-Sample Tests Based on Nearest Neighbors , 1986 .

[9]  J. Wolfowitz,et al.  On a Test Whether Two Samples are from the Same Population , 1940 .

[10]  Helmut Rieder Robuste und adaptive tests , 1995 .

[11]  L. Devroye Non-Uniform Random Variate Generation , 1986 .

[12]  J. Friedman,et al.  Multivariate generalizations of the Wald--Wolfowitz and Smirnov two-sample tests , 1979 .

[13]  N. Henze A MULTIVARIATE TWO-SAMPLE TEST BASED ON THE NUMBER OF NEAREST NEIGHBOR TYPE COINCIDENCES , 1988 .

[14]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[15]  M. E. Johnson,et al.  A Comparative Study of Tests for Homogeneity of Variances, with Applications to the Outer Continental Shelf Bidding Data , 1981 .

[16]  A. Bowman,et al.  Adaptive Smoothing and Density-Based Tests of Multivariate Normality , 1993 .

[17]  Joachim H. Ahrens,et al.  Pseudo-random numbers , 2005, Computing.