Pattern recognition using clustering algorithm for scenario definition in traffic simulation-based decision support systems

This paper presents a scenario clustering approach intended to mine historical data warehouses to identify appropriate scenarios for simulation as a part of an evaluation of transportation projects or operational measures. As such, it provides a systematic and efficient approach to select and prepare effective input scenarios to a given traffic simulation model. The scenario clustering procedure has two main applications: travel time reliability analysis, and traffic estimation and prediction systems. The ability to systematically identify similarity and dissimilarity among weather scenarios can facilitate the selection of critical scenarios for reliability studies. It can also support real-time weather-responsive traffic management (WRTM) by quickly classifying a current or predicted weather condition into pre-defined categories and suggesting relevant WRTM strategies that can be tested via real-time traffic simulation before deployment. A detailed method for clustering weather time series data is presented and demonstrated using historical data. Two clustering algorithms with different similarity measures are compared. Clustering results using a K-means clustering algorithm with squared Euclidean distance are illustrated in an application to travel time reliability.

[1]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[2]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[3]  Hani S. Mahmassani,et al.  Use of Mobile Data for Weather-Responsive Traffic Management Models , 2012 .

[4]  E. Lefevre,et al.  Classifiers and distance-based evidential fusion for road travel time estimation , 2006, SPIE Defense + Commercial Sensing.

[5]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[6]  Mark Hansen,et al.  Generating day-of-operation probabilistic capacity scenarios from weather forecasts , 2013 .

[7]  Jiwon Kim,et al.  Scenario-based Approach to Travel Time Reliability Analysis 2 Using Traffic Simulation Models , 2012 .

[8]  Haris N. Koutsopoulos,et al.  Dynamic data-driven local traffic state estimation and prediction , 2013 .

[9]  Paul E. Green,et al.  A Note on Proximity Measures and Cluster Analysis , 1969 .

[10]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[11]  Meead Saberi,et al.  Connecting Networkwide Travel Time Reliability and the Network Fundamental Diagram of Traffic Flow , 2013 .

[12]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[13]  Mehdi Azimi,et al.  Categorizing Freeway Flow Conditions by Using Clustering Methods , 2010 .

[14]  J. Gower,et al.  Minimum Spanning Trees and Single Linkage Cluster Analysis , 1969 .

[15]  Hani S. Mahmassani,et al.  Implementation and Evaluation of Weather-Responsive Traffic Management Strategies , 2013 .

[16]  E. Forgy,et al.  Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[17]  George W. Adamson,et al.  A Comparison of the Performance of Some Similarity and Dissimilarity Measures in the Automatic Classification of Chemical Structures , 1975, J. Chem. Inf. Comput. Sci..

[18]  G. W. Milligan,et al.  Methodology Review: Clustering Methods , 1987 .

[19]  Nong Ye,et al.  Data Mining: Theories, Algorithms, and Examples , 2013 .

[20]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[21]  Peter Vovsha,et al.  Scenario-Based Approach to Analysis of Travel Time Reliability with Traffic Simulation Models , 2013 .

[22]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[23]  Jiwon Kim,et al.  Calibration of Traffic Flow Models under Adverse Weather and Application in Mesoscopic Network Simulation , 2013 .

[24]  Peter H. A. Sneath,et al.  Numerical Taxonomy: The Principles and Practice of Numerical Classification , 1973 .