Lens data depth and median

We define the lens depth (LD) function LD(t; F) of a vector t∈R d with respect to a distribution function F to be the probability that t is contained in a random hyper-lens formed by the intersection of two closed balls centred at two i.i.d observations from F. We show that LD is a statistical depth function and explore its properties, including affine invariance, symmetry, maximality at the centre and monotonicity. We define the sample LD and investigate its uniform consistency, asymptotic normality and computational complexity in high-dimensional settings. We define the lens median (LM), a multivariate analogue of the univariate median, as the point where the LD is maximised. The sample LM is the vector that is covered by the most number of hyper-lenses formed between any two sample observations. We state its asymptotic consistency and normality and examine its breakdown point and relative efficiency. The sample LM is robust and efficient for estimating the centre of a unimodal distribution. A comparison of LD and LM to existing data depth functions and medians in terms of computational complexity, robustness, efficiency and breakdown point is presented.

[1]  D. Donoho,et al.  Breakdown Properties of Location Estimates Based on Halfspace Depth and Projected Outlyingness , 1992 .

[2]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[3]  P. Mahalanobis On the generalized distance in statistics , 1936 .

[4]  Regina Y. Liu On a Notion of Data Depth Based on Random Simplices , 1990 .

[5]  R. Serfling,et al.  General notions of statistical depth function , 2000 .

[6]  Cun-Hui Zhang,et al.  The multivariate L1-median and associated data depth. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[7]  H. Oja Descriptive Statistics for Multivariate Distributions , 1983 .

[8]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[9]  P. Rousseeuw,et al.  Breakdown Points of Affine Equivariant Estimators of Multivariate Location and Covariance Matrices , 1991 .

[10]  R. Serfling Survey on (Some) Nonparametric and Robust Multivariate Methods , 2007 .

[11]  Greg Aloupis,et al.  Geometric Measures of Data Depth , 2022 .

[12]  R. Sokal,et al.  A New Statistical Approach to Geographic Variation Analysis , 1969 .

[13]  E. Giné,et al.  Estimators Related to $U$-Processes with Applications to Multivariate Medians: Asymptotic Normality , 1994 .

[14]  Ryan T. Elmore,et al.  Spherical data depth and a multivariate median , 2003, Data Depth: Robust Multivariate Analysis, Computational Geometry and Applications.

[15]  J. Norris Appendix: probability and measure , 1997 .

[16]  Godfried T. Toussaint,et al.  The relative neighbourhood graph of a finite planar set , 1980, Pattern Recognit..

[17]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[18]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[19]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[20]  R. Serfling Equivariance and invariance properties of multivariate quantile and related functions, and the role of standardisation , 2010 .

[21]  Regina Y. Liu,et al.  Multivariate analysis by data depth: descriptive statistics, graphics and inference, (with discussion and a rejoinder by Liu and Singh) , 1999 .

[22]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[23]  D. Kirkpatrick,et al.  A Framework for Computational Morphology , 1985 .

[24]  謙太郎 野間口,et al.  仮説に制約条件がある場合の Bivariate Sign Test , 1986 .

[25]  J. Tukey Mathematics and the Picturing of Data , 1975 .

[26]  E. Giné,et al.  Limit Theorems for $U$-Processes , 1993 .