Clustering Activity–Travel Behavior Time Series using Topological Data Analysis

Over the last few years, traffic data has been exploding and the transportation discipline has entered the era of big data. It brings out new opportunities for doing data-driven analysis, but it also challenges traditional analytic methods. This paper proposes a new Divide and Combine based approach to do K means clustering on activity-travel behavior time series using features that are derived using tools in Time Series Analysis and Topological Data Analysis. Clustering data from five waves of the National Household Travel Survey ranging from 1990 to 2017 suggests that activity-travel patterns of individuals over the last three decades can be grouped into three clusters. Results also provide evidence in support of recent claims about differences in activity-travel patterns of different survey cohorts. The proposed method is generally applicable and is not limited only to activity-travel behavior analysis in transportation studies. Driving behavior, travel mode choice, household vehicle ownership, when being characterized as categorical time series, can all be analyzed using the proposed method.

[1]  Hernando Ombao,et al.  Topological Data Analysis of Single-Trial Electroencephalographic Signals. , 2018, The annals of applied statistics.

[2]  Peter Bubenik,et al.  Statistical topological data analysis using persistence landscapes , 2012, J. Mach. Learn. Res..

[3]  G. Madey,et al.  Uncovering individual and collective human dynamics from mobile phone records , 2007, 0710.2939.

[4]  M. McNally,et al.  Travel/activity analysis: Pattern recognition, classification and interpretation , 1985 .

[5]  Heather A. Harrington,et al.  Persistent homology of time-dependent functional networks constructed from coupled time series. , 2016, Chaos.

[6]  Konstadinos G. Goulias,et al.  LONGITUDINAL ANALYSIS OF ACTIVITY AND TRAVEL PATTERN DYNAMICS USING GENERALIZED MIXED MARKOV LATENT CLASS MODELS , 1999 .

[7]  Mei-Po Kwan,et al.  Interactive geovisualization of activity-travel patterns using three-dimensional geographical information systems: a methodological exploration with a large data set , 2000 .

[8]  Changhyun Kwon,et al.  Multi-day activity-travel pattern sampling based on single-day data , 2018 .

[9]  Leonardo Lima,et al.  Towards Smart Traffic Lights Using Big Data to Improve Urban Traffic , 2015 .

[10]  Alexander Russell,et al.  Computational topology: ambient isotopic approximation of 2-manifolds , 2003, Theor. Comput. Sci..

[11]  E. I. Pas Weekly travel-activity behavior , 1988 .

[12]  Carlo Ratti,et al.  Understanding individual mobility patterns from urban sensing data: A mobile phone trace example , 2013 .

[13]  John L. Shanks,et al.  Computation of the Fast Walsh-Fourier Transform , 1969, IEEE Transactions on Computers.

[14]  David S. Stoffer,et al.  Walsh-Fourier Analysis and its Statistical Applications , 1991 .

[15]  Ricardo Jardim-Goncalves,et al.  Big Data Processing and Storage Framework for ITS: A Case Study on Dynamic Tolling , 2016 .

[16]  Noam Shoval,et al.  Sequence Alignment as a Method for Human Activity Analysis in Space and Time , 2007 .

[17]  Jiangping Zhou,et al.  Tracking job and housing dynamics with smartcard data , 2018, Proceedings of the National Academy of Sciences.

[18]  Gunnar E. Carlsson,et al.  Topology and data , 2009 .

[19]  David J. Ketchen,et al.  THE APPLICATION OF CLUSTER ANALYSIS IN STRATEGIC MANAGEMENT RESEARCH: AN ANALYSIS AND CRITIQUE , 1996 .

[20]  Ta Theo Arentze,et al.  Pattern Recognition in Complex Activity Travel Patterns: Comparison of Euclidean Distance, Signal-Processing Theoretical, and Multidimensional Sequence Alignment Methods , 2001 .

[21]  Clarke Wilson,et al.  Activity Patterns of Canadian Women: Application of ClustalG Sequence Alignment Software , 2001 .

[22]  R. L. Thorndike Who belongs in the family? , 1953 .