Mining frequency-based sequential trajectory co-clusters

Co-clustering is a specific type of clustering that addresses the problem of finding groups of objects without necessarily considering all attributes. This technique has shown to have more consistent results in high-dimensional sparse data than traditional clustering. In trajectory co-clustering, the methods found in the literature have two main limitations: first, the space and time dimensions have to be constrained by user-defined thresholds; second, elements (trajectory points) are clustered ignoring the trajectory sequence, assuming that the points are independent among them. To address the limitations above, we propose a new trajectory co-clustering method for mining semantic trajectory co-clusters. It simultaneously clusters the trajectories and their elements taking into account account the order in which they appear. This new method uses the element frequency to identify candidate co-clusters. Besides, it uses an objective cost function that automatically drives the co-clustering process, avoiding the need for constraining dimensions. We evaluate the proposed approach using real-world a publicly available dataset. The experimental results show that our proposal finds frequent and meaningful contiguous sequences revealing mobility patterns, thereby the most relevant elements.

[1]  Vania Bogorny,et al.  Multiple aspect trajectory data analysis: research challenges and opportunities , 2016, GeoInfo.

[2]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  P. Alam ‘G’ , 2021, Composites Engineering: An A–Z Guide.

[4]  Avraham A. Melkman,et al.  Sleeved coclustering , 2004, KDD '04.

[5]  A. Fagan,et al.  GPS driving: a digital biomarker for preclinical Alzheimer disease , 2021, Alzheimer's research & therapy.

[6]  Vania Bogorny,et al.  Individual and Group Activity Recognition in Moving Object Trajectories , 2017, J. Inf. Data Manag..

[7]  Claire Cardie,et al.  Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[8]  Camila Leite da Silva,et al.  A Survey and Comparison of Trajectory Classification Methods , 2019, 2019 8th Brazilian Conference on Intelligent Systems (BRACIS).

[9]  Jie Zhao,et al.  A review of moving object trajectory clustering algorithms , 2016, Artificial Intelligence Review.

[10]  Yi-Chang Chiu,et al.  Characterizing activity patterns using co-clustering and user-activity network , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[11]  James W. Davis,et al.  Learning Directed Intention-driven Activities using Co-Clustering , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[12]  Lianhai Wang,et al.  Linking Multiple Online Identities in Criminal Investigations: A Spectral Co-Clustering Framework , 2017, IEEE Transactions on Information Forensics and Security.

[13]  David Sarne,et al.  Co-clustering of fuzzy lagged data , 2014, Knowledge and Information Systems.

[14]  Fabrice Rossi,et al.  Co-Clustering Network-Constrained Trajectory Data , 2013, EGC.

[15]  Daqing Zhang,et al.  Modeling User Activity Preference by Leveraging User Spatial Temporal Characteristics in LBSNs , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[16]  Gerd Stumme,et al.  Mining frequent patterns with counting inference , 2000, SKDD.

[17]  Ickjai Lee,et al.  Mining distinct and contiguous sequential patterns from large vehicle trajectories , 2020, Knowl. Based Syst..

[18]  Mohamed Nadif,et al.  Directional co-clustering , 2019, Adv. Data Anal. Classif..

[19]  Weiming Zhang,et al.  Vessel Spatio-temporal Knowledge Discovery with AIS Trajectories Using Co-clustering , 2017 .

[20]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[21]  Wei Wu,et al.  Influence Maximization in Trajectory Databases , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[22]  Feng Wang,et al.  Nonnegative matrix tri-factorization with user similarity for clustering in point-of-interest , 2019, Neurocomputing.

[23]  Mohamed F. Mokbel,et al.  Recommendations in location-based social networks: a survey , 2015, GeoInformatica.

[24]  Vijaymeena M.K,et al.  A Survey on Similarity Measures in Text Mining , 2016 .

[25]  Yun Sing Koh,et al.  A Survey of Sequential Pattern Mining , 2017 .

[26]  Vania Bogorny,et al.  Multidimensional Similarity Measuring for Semantic Trajectories , 2016, Trans. GIS.

[27]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[28]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[29]  Salvatore Orlando,et al.  A Unifying Framework for Mining Approximate Top- $k$ Binary Patterns , 2014, IEEE Transactions on Knowledge and Data Engineering.

[30]  P. Alam ‘W’ , 2021, Composites Engineering.

[31]  Chin-Teng Lin,et al.  A review of clustering techniques and developments , 2017, Neurocomputing.

[32]  Takayuki Morikawa,et al.  Big Trajectory Data Mining: A Survey of Methods, Applications, and Services , 2020, Sensors.

[33]  Gebräuchliche Fertigarzneimittel,et al.  V , 1893, Therapielexikon Neurologie.