Subtrajectory Clustering: Finding Set Covers for Set Systems of Subcurves

We study subtrajectory clustering under the Fréchet distance. Given one or more trajectories, the task is to split the trajectories into several parts, such that the parts have a good clustering structure. We approach this problem via a new set cover formulation, which we think provides a natural formalization of the problem as it is studied in many applications. Given a polygonal curve P with n vertices in fixed dimension, integers k, ` ≥ 1, and a real value ∆ > 0, the goal is to find k center curves of complexity at most ` such that every point on P is covered by a subtrajectory that has small Fréchet distance to one of the k center curves (≤ ∆). In many application scenarios, one is interested in finding clusters of small complexity, which is controlled by the parameter `. Our main result is a tri-criterial approximation algorithm: if there exists a solution for given parameters k, `, and ∆, then our algorithm finds a set of k′ center curves of complexity at most `′ with covering radius ∆′ with k′ ∈ O(k` log(k`)), `′ ≤ 2`, and ∆′ ≤ 19∆. Moreover, within these approximation bounds, we can minimize k while keeping the other parameters fixed. If ` is a constant independent of n, then, the approximation factor for the number of clusters k is O(log k) and the approximation factor for the radius ∆ is constant. In this case, the algorithm has expected running time in Õ ( km +mn ) and uses space in O(n+m), where m = d L ∆e and L is the total arclength of the curve P . For the important case of clustering with line segments (`=2) we obtain bi-criteria approximation algorithms, where the approximation criteria are the number of clusters and the radius of the clustering. Department of Computer Science, University of Massachusetts Lowell, USA. Department of Computer Science, University of Bonn, Germany Department of Computer Science, Saint Louis University, USA. Hausdorff Center for Mathematics, University of Bonn, Germany. ar X iv :2 10 3. 06 04 0v 2 [ cs .C G ] 1 9 A ug 2 02 1

[1]  Kyle Fox,et al.  Subtrajectory Clustering: Models and Algorithms , 2018, PODS.

[2]  Micha Sharir,et al.  Efficient algorithms for geometric optimization , 1998, CSUR.

[3]  Vasek Chvátal,et al.  A Greedy Heuristic for the Set-Covering Problem , 1979, Math. Oper. Res..

[4]  Kenneth Holmqvist,et al.  Eye tracking: a comprehensive guide to methods and measures , 2011 .

[5]  Kenneth L. Clarkson,et al.  Las Vegas algorithms for linear and integer programming when the dimension is small , 1995, JACM.

[6]  Christian Sohler,et al.  Clustering time series under the Fréchet distance , 2015, SODA.

[7]  Pankaj K. Agarwal,et al.  Approximation algorithms for projective clustering , 2000, SODA '00.

[8]  Joachim Gudmundsson,et al.  Improved Map Construction using Subtrajectory Clustering , 2020, LocalRec@SIGSPATIAL.

[9]  Pankaj K. Agarwal,et al.  Near-Linear Algorithms for Geometric Hitting Sets and Set Covers , 2014, Discrete & Computational Geometry.

[10]  David Haussler,et al.  Epsilon-nets and simplex range queries , 1986, SCG '86.

[11]  Maike Buchin,et al.  Inferring movement patterns from geometric similarity , 2020, J. Spatial Inf. Sci..

[12]  Jian Li,et al.  Real-time human gesture grading based on OpenPose , 2017, 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[13]  Kai Zheng,et al.  A survey of trajectory distance measures and performance evaluation , 2019, The VLDB Journal.

[14]  Ioannis Psarros,et al.  The VC Dimension of Metric Balls under Fréchet and Hausdorff Distances , 2019, Discrete & Computational Geometry.

[15]  Peter L. Bartlett,et al.  Neural Network Learning - Theoretical Foundations , 1999 .

[16]  Mark de Berg,et al.  Optimal Binary Space Partitions in the Plane , 2010, COCOON.

[17]  Andrew T Duchowski,et al.  A breadth-first survey of eye-tracking applications , 2002, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[18]  Nabil H. Mustafa,et al.  Near-Linear Time Approximation Algorithms for Curve Simplification , 2005, Algorithmica.

[19]  Anne Driemel,et al.  On the complexity of range searching among curves , 2017, SODA.

[20]  Timothy M. Chan,et al.  Faster Approximation Algorithms for Geometric Set Cover , 2020, SoCG.

[21]  Joachim Gudmundsson,et al.  Detecting Commuting Patterns by Clustering Subtrajectories , 2011, Int. J. Comput. Geom. Appl..

[22]  Frank Staals,et al.  Clustering Trajectories for Map Construction , 2017, SIGSPATIAL/GIS.

[23]  Helmut Alt,et al.  Computing the Fréchet distance between two polygonal curves , 1995, Int. J. Comput. Geom. Appl..

[24]  Jie Zhao,et al.  A review of moving object trajectory clustering algorithms , 2016, Artificial Intelligence Review.

[25]  Joachim Gudmundsson,et al.  Approximating $(k,\ell)$-center clustering for curves , 2018 .

[26]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[27]  Héctor H. González-Baños,et al.  A randomized art-gallery algorithm for sensor placement , 2001, SCG '01.

[28]  J. Shane Culpepper,et al.  A Survey on Trajectory Data Management, Analytics, and Learning , 2020, ArXiv.

[29]  Abhinandan Nath,et al.  k-Median clustering under discrete Fréchet and Hausdorff distances , 2020, SoCG.

[30]  Maike Buchin,et al.  Group Diagrams for Representing Trajectories , 2018, IWCTS@SIGSPATIAL.

[31]  Maike Buchin,et al.  Approximating $(k,\ell)$-Median Clustering for Polygonal Curves , 2020 .

[32]  Sanjeev Arora,et al.  The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[33]  Michael T. Goodrich,et al.  Almost optimal set covers in finite VC-dimension , 1995, Discret. Comput. Geom..