Fluctuation Similarity Modeling for Traffic Flow Time Series: A Clustering Approach

Traffic time series analysis is important because of its use in traffic control and travel time prediction. In this paper, we discuss how to cluster traffic time series that have similar fluctuation patterns. We use simple average detrending method and only study the residual time series. Second, we use principle component analysis (PCA) on raw data and use the weight of the first d-components as the features of the time series. Third, we use k-means algorithm to cluster the traffic time series. Finally, we study the results of the clustering algorithm and discuss the origins of the clusters. In summary, the most important factors of clustering results are urban/rural area, direction and in/not in ramp entrance.

[1]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[2]  Eleni I. Vlahogianni,et al.  Statistical methods versus neural networks in transportation research: Differences, similarities and some insights , 2011 .

[3]  Jianming Hu,et al.  Comparative Study of PCA and ICA Based Traffic Flow Compression , 2009 .

[4]  Adam Meyerson,et al.  Fast and Accurate k-means For Large Datasets , 2011, NIPS.

[5]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[6]  Eleni I. Vlahogianni,et al.  Short-term traffic forecasting: Where we are and where we’re going , 2014 .

[7]  Xiaoying Wu,et al.  Integrated analysis of CFD data with K-means clustering algorithm and extreme learning machine for localized HVAC control , 2015 .

[8]  Ivan G. Guardiola,et al.  A functional approach to monitor and recognize patterns of daily traffic profiles , 2014 .

[9]  Yi Zhang,et al.  A BPCA based missing value imputing method for traffic flow volume data , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[10]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[11]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[12]  Shiliang Sun,et al.  A bayesian network approach to traffic flow forecasting , 2006, IEEE Transactions on Intelligent Transportation Systems.

[13]  Shiliang Sun,et al.  Bayesian Network Methods for Traffic Flow Forecasting with Incomplete Data , 2004, ECML.

[14]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[15]  Roland Chrobok,et al.  Different methods of traffic forecast based on real data , 2004, Eur. J. Oper. Res..

[16]  I. Jolliffe Principal Component Analysis , 2002 .

[17]  Fred Douglis,et al.  Transparent process migration: Design alternatives and the sprite implementation , 1991, Softw. Pract. Exp..

[18]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[19]  Billy M. Williams,et al.  Comparison of parametric and nonparametric models for traffic flow forecasting , 2002 .

[20]  Li Li,et al.  Robust causal dependence mining in big data network and its application to traffic flow predictions , 2015 .

[21]  G. W. STEWARTt ON THE EARLY HISTORY OF THE SINGULAR VALUE DECOMPOSITION * , 2022 .