Metrics and barycenters for point pattern data

We introduce the transport–transform and the relative transport–transform metrics between finite point patterns on a general space, which provide a unified framework for earlier point pattern metrics, in particular the generalized spike time and the normalized and unnormalized optimal subpattern assignment metrics. Our main focus is on barycenters, i.e., minimizers of a q -th-order Fréchet functional with respect to these metrics. We present a heuristic algorithm that terminates in a local minimum and is shown to be fast and reliable in a simulation study. The algorithm serves as a general plug-in method that can be applied to point patterns on any state space where an appropriate algorithm for solving the location problem for individual points is available. We present applications to geocoded data of crimes in Euclidean space and on a street network, illustrating that barycenters serve as informative summary statistics. Our work is a first step toward statistical inference in covariate-based models of repeated point pattern observations.

[1]  M. Fréchet Les éléments aléatoires de nature quelconque dans un espace distancié , 1948 .

[2]  Ba-Ngu Vo,et al.  A Consistent Metric for Performance Evaluation of Multi-Object Filters , 2008, IEEE Transactions on Signal Processing.

[3]  Charles D. Woody,et al.  Algorithms for computing spike time distance and point process prototypes with application to feline neuronal responses to acoustic stimuli , 2012, Journal of Neuroscience Methods.

[4]  Steffen Borgwardt,et al.  Discrete Wasserstein barycenters: optimal transport for discrete data , 2015, Mathematical Methods of Operations Research.

[5]  F. Schoenberg,et al.  Description of earthquake aftershock sequences using prototype point patterns , 2007 .

[6]  H. Muller,et al.  Total Variation Regularized Fréchet Regression for Metric-Space Valued Data , 2019, 1904.09647.

[7]  Franz Hlawatsch,et al.  Rate-Distortion Theory of Finite Point Processes , 2017, IEEE Transactions on Information Theory.

[8]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[9]  M. N. M. Lieshout,et al.  ADRIAN BADDELEY, EGE RUBAK, AND ROLF TURNER, Spatial Point Patterns: Methodology and Applications with R. Boca Raton, FL: CRC Press , 2016 .

[10]  Arnaud Doucet,et al.  Fast Computation of Wasserstein Barycenters , 2013, ICML.

[11]  S. L. Hakimi,et al.  Optimum Locations of Switching Centers and the Absolute Centers and Medians of a Graph , 1964 .

[12]  M. Shirosaki Another proof of the defect relation for moving targets , 1991 .

[13]  Aihua Xia,et al.  A new metric between distributions of point processes , 2007, Advances in Applied Probability.

[14]  Steffen Borgwardt,et al.  Improved Linear Programs for Discrete Barycenters , 2018, INFORMS Journal on Optimization.

[15]  Steffen Borgwardt,et al.  An LP-based, strongly-polynomial 2-approximation algorithm for sparse Wasserstein barycenters , 2017, Oper. Res..

[16]  Thomas E. Nichols,et al.  Bayesian log‐Gaussian Cox process regression: applications to meta‐analysis of neuroimaging working memory studies , 2017, Journal of the Royal Statistical Society. Series C, Applied statistics.

[17]  Frits C. R. Spieksma,et al.  Approximation Algorithms for Multi-Dimensional Assignment Problems with Decomposable Costs , 1994, Discret. Appl. Math..

[18]  H. Muller,et al.  Fréchet regression for random objects with Euclidean predictors , 2016, The Annals of Statistics.

[19]  Martin Haenggi,et al.  Stochastic Geometry Analysis of Cellular Networks , 2018 .

[20]  Virgilio Gómez-Rubio,et al.  Spatial Point Patterns: Methodology and Applications with R , 2016 .

[21]  J. Mateu,et al.  On Kernel-Based Intensity Estimation of Spatial Point Patterns on Linear Networks , 2018 .

[22]  H. Muller,et al.  Fréchet analysis of variance for random objects , 2017, Biometrika.

[23]  Raphael Huser,et al.  Point process-based modeling of multiple debris flow landslides using INLA: an application to the 2009 Messina disaster , 2017, Stochastic Environmental Research and Risk Assessment.

[24]  Dominic Schuhmacher,et al.  Bayesian spatial modelling of childhood cancer incidence in Switzerland using exact point data: a nationwide study during 1985–2015 , 2019, International Journal of Health Geographics.

[25]  Tilman M. Davies,et al.  Fast Kernel Smoothing of Point Patterns on a Large Network using Two‐dimensional Convolution , 2019, International Statistical Review.

[26]  Jonathan D. Victor,et al.  Metric-space analysis of spike trains: theory, algorithms and application , 1998, q-bio/0309031.

[27]  David G. Luenberger,et al.  Linear and nonlinear programming , 1984 .

[29]  Juan Antonio Cuesta-Albertos,et al.  Robust clustering tools based on optimal transportation , 2016, Statistics and Computing.

[30]  Guillaume Carlier,et al.  Barycenters in the Wasserstein Space , 2011, SIAM J. Math. Anal..

[31]  Jorge Mateu,et al.  First- and Second-Order Characteristics of Spatio-Temporal Point Processes on Linear Networks , 2020 .

[32]  Giuseppe Savaré,et al.  Optimal Entropy-Transport problems and a new Hellinger–Kantorovich distance between positive measures , 2015, 1508.07941.

[33]  Jonatan A. González,et al.  On measures of dissimilarity between point patterns: Classification based on prototypes and multidimensional scaling , 2015, Biometrical journal. Biometrische Zeitschrift.

[34]  Jean-Luc Starck,et al.  Wasserstein Dictionary Learning: Optimal Transport-based unsupervised non-linear dictionary learning , 2017, SIAM J. Imaging Sci..

[35]  François-Xavier Vialard,et al.  Scaling algorithms for unbalanced optimal transport problems , 2017, Math. Comput..

[36]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[37]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[38]  Josip Lorincz,et al.  What is the Best Spatial Distribution to Model Base Station Density? A Deep Dive into Two European Mobile Networks , 2016, IEEE Access.

[39]  D. Bertsekas The auction algorithm: A distributed relaxation method for the assignment problem , 1988 .

[40]  Peter J. Diggle,et al.  Statistical Analysis of Spatial and Spatio-Temporal Point Patterns , 2013 .