From edit distance to augmented space-time-weighted edit distance: Detecting and clustering patterns of human activities in Puget Sound region

Abstract Considering and measuring the similarity of human activities remains challenging. Existing studies of similarity measures based on traditional edit distance (ED), specifically on activity patterns, do not reflect the spatiotemporal characteristics in the measurement model. Additionally, interdependence between activities is ignored in existing multidimensional sequence alignment methods. To address the gap, we initially extend the traditional edit distance to a space-time-weighted edit distance (STW-ED). Specifically, differences in distance and time between activities are considered cost functions in the operation cost calculation (insertion, deletion, and substitution). We advance STW-ED to an augmented space-time-weighted edit distance method (ASTW-ED) that integrates an optimum-trajectory-based multidimensional sequence alignment method (OT-MDSAM) with STW-ED, treating the nonspatiotemporal dimensions as augment factors. In addition, ontology is considered for the similarity measure for nonspatiotemporal dimensions. To show the feasibility of our proposed approach, we conduct an empirical study based on an activity-based travel survey in the Puget Sound Region. Eight clusters (homemakers, regular workers with a colorful life, regular workers with a monotonous life, part-time workers, recreation travelers, senior travelers, no-job travelers, and night owl adventurers) are identified based on ASTW-ED and ontology. To cluster the similarity matrix derived from the introduced methods, the affinity propagation (AP) clustering method is employed because it is free of prior knowledge for clustering and can produce exemplars of the clusters. The empirical study indicates that, relative to existing methods for multidimensional activity similarity measurement and clustering, ASTW-ED performs better in terms of within-group homogeneity and between-group heterogeneity of clusters. In addition, the results reveal that ontology can improve clustering performance if it is considered for nonspatiotemporal dimensions provide better understanding of human behavior for urban governance..

[1]  Ningchuan Xiao,et al.  Assessing Activity Pattern Similarity with Multidimensional Sequence Alignment based on a Multiobjective Optimization Evolutionary Algorithm. , 2014, Geographical analysis.

[2]  K. Axhausen,et al.  Observing the rhythms of daily life: A six-week travel diary , 2002 .

[3]  Liangxu Liu,et al.  Using Relative Distance and Hausdorff Distance to Mine Trajectory Clusters , 2013 .

[4]  W C Wilson,et al.  Activity Pattern Analysis by Means of Sequence-Alignment Methods , 1998 .

[5]  A. Santos,et al.  Summary of Travel Trends: 2009 National Household Travel Survey , 2011 .

[6]  Kay W. Axhausen,et al.  Processing Raw Data from Global Positioning Systems without Additional Information , 2009 .

[7]  Kathleen Stewart,et al.  Modeling Moving Geospatial Objects from an Event-based Perspective , 2007, Trans. GIS.

[8]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[9]  J. Gower A General Coefficient of Similarity and Some of Its Properties , 1971 .

[10]  Davy Janssens,et al.  Ambient Systems , Networks and Technologies ( ANT 2013 ) An Activity-based Carpooling Microsimulation using Ontology , 2013 .

[11]  Robert Weibel,et al.  Movement similarity assessment using symbolic representation of trajectories , 2012, Int. J. Geogr. Inf. Sci..

[12]  Eric I. Pas,et al.  A Flexible and Integrated Methodology for Analytical Classification of Daily Travel-Activity Behavior , 1983 .

[13]  Kay W. Axhausen,et al.  Identifying trips and activities and their characteristics from GPS raw data without further information , 2008 .

[14]  Alessandro Vespignani,et al.  Multiscale mobility networks and the spatial spreading of infectious diseases , 2009, Proceedings of the National Academy of Sciences.

[15]  Frederico T. Fonseca,et al.  Using Ontologies for Integrated Geographic Information Systems , 2002, Trans. GIS.

[16]  Martin Raubal,et al.  Measuring similarity of mobile phone user trajectories– a Spatio-temporal Edit Distance method , 2014, Int. J. Geogr. Inf. Sci..

[17]  Tieniu Tan,et al.  Similarity based vehicle trajectory clustering and anomaly detection , 2005, IEEE International Conference on Image Processing 2005.

[18]  Achille C. Varzi,et al.  Ontological Tools for Geographic Representation , 1998 .

[19]  J. Scheiner,et al.  Travel Distances in Daily Travel and Long-Distance Travel: What Role is Played by Urban Form? , 2014 .

[20]  Muzaffer Uysal,et al.  Travel Motivations of Japanese Overseas Travelers: A Factor-Cluster Segmentation Approach , 1995 .

[21]  Kirsi Virrantaus,et al.  Space–time density of trajectories: exploring spatio-temporal patterns in movement data , 2010, Int. J. Geogr. Inf. Sci..

[22]  Changjoo Kim,et al.  An examination of the jobs-housing balance of different categories of workers across 26 metropolitan regions , 2016 .

[23]  E. I. Pas The Effect of Selected Sociodemographic Characteristics on Daily Travel-Activity Behavior , 1984 .

[24]  Moshe Ben-Akiva,et al.  Integration of an Activity-based Model System and a Residential Location Model , 1998 .

[25]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[26]  Davy Janssens,et al.  Semantic Annotation of Global Positioning System Traces , 2013 .

[27]  M. Kwan Gis methods in time‐geographic research: geocomputation and geovisualization of human activity patterns , 2004 .

[28]  Ta Theo Arentze,et al.  Pattern Recognition in Complex Activity Travel Patterns: Comparison of Euclidean Distance, Signal-Processing Theoretical, and Multidimensional Sequence Alignment Methods , 2001 .

[29]  Bettina Speckmann,et al.  Similarity of trajectories taking into account geographic context , 2014, J. Spatial Inf. Sci..

[30]  C H Chang-Hyeon Joh,et al.  Measuring and predicting adaptation in multidimensional activity-travel patterns , 2004 .

[31]  Peter J. Taylor,et al.  A Kantian View of the City: A Factorial-Ecology Experiment in Space and Time , 1975 .

[32]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[33]  Martin Raubal,et al.  Correlating mobile phone usage and travel behavior - A case study of Harbin, China , 2012, Comput. Environ. Urban Syst..

[34]  F. Stuart Chapin,et al.  Human activity patterns in the city : things people do in time and in space , 1976 .

[35]  O. Persson,et al.  Torsten Hägerstrand in the Citation Time Web , 2012 .

[36]  Noam Shoval,et al.  Sequence Alignment as a Method for Human Activity Analysis in Space and Time , 2007 .

[37]  Pat Burnett,et al.  THE ANALYSIS OF TRAVEL AS AN EXAMPLE OF COMPLEX HUMAN BEHAVIOR IN SPATIALLY-CONSTRAINED SITUATIONS: DEFINITION AND MEASUREMENT ISSUES , 1982 .

[38]  D. Collia,et al.  The 2001 National Household Travel Survey: a look into the travel patterns of older Americans. , 2003, Journal of safety research.

[39]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[40]  Albert-László Barabási,et al.  Understanding the Spreading Patterns of Mobile Phone Viruses , 2009, Science.

[41]  Shan Jiang,et al.  Clustering daily patterns of human activities in the city , 2012, Data Mining and Knowledge Discovery.

[42]  Pip Forer,et al.  Movement beyond the snapshot - Dynamic analysis of geospatial lifelines , 2007, Comput. Environ. Urban Syst..

[43]  Yixiang Chen,et al.  Detecting Anomalous Trajectories and Behavior Patterns Using Hierarchical Clustering from Taxi GPS Data , 2018, ISPRS Int. J. Geo Inf..

[44]  Pierpaolo D'Urso,et al.  Fuzzy clustering of human activity patterns , 2013, Fuzzy Sets Syst..

[45]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[46]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[47]  Shahrokh Valaee,et al.  Mobility-Based Clustering in VANETs Using Affinity Propagation , 2009, GLOBECOM 2009 - 2009 IEEE Global Telecommunications Conference.

[48]  Tao Wang,et al.  The Fusion Model of Intelligent Transportation Systems Based on the Urban Traffic Ontology , 2012 .

[49]  Peter R. Stopher,et al.  Deducing mode and purpose from GPS data , 2008 .

[50]  F. Agakov,et al.  Application of high-dimensional feature selection: evaluation for genomic prediction in man , 2015, Scientific Reports.

[51]  Lidia P. Kostyniuk,et al.  Using GPS Data to Understand Driving Behavior , 2008 .

[52]  Gloria Bordogna,et al.  Spatial Querying Supported by Domain and User Ontologies: An Approach for Web GIS Applications , 2015, FQAS.

[53]  M. Kwan The Uncertain Geographic Context Problem , 2012 .

[54]  Jun Ma,et al.  A dynamic analysis of person and household activity and travel patterns using data from the first two waves in the Puget Sound Transportation Panel , 1997 .

[55]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[56]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[57]  S. Arnold A Test for Clusters , 1979 .

[58]  F. Koppelman,et al.  Activity-Based Modeling of Travel Demand , 2003 .

[59]  A. Maslow Motivation and Personality , 1954 .

[60]  Odile Papini,et al.  Ontology-Based Photogrammetry Survey for Medieval Archaeology: Toward a 3D Geographic Information System (GIS) , 2017 .