Outlier Detection in Traffic Data Based on the Dirichlet Process Mixture Model

Traffic data collections are exceedingly useful for road network management. Such collections are typically massive and are full of errors, noise and abnormal traffic behaviour. These abnormalities are regarded as outliers because they are inconsistent with the rest of the data. Hence, the problem of outlier detection (OD) is non-trivial. This paper presents a novel method for detecting outliers in large-scale traffic data by modelling the information as a Dirichlet process mixture model (DPMM). In essence, input traffic signals are truncated and mapped to a covariance signal descriptor, and the vector dimension is then further reduced by principal component analysis. This modified signal vector is then modelled by a DPMM. Traffic signals generally share heavy spatial-temporal similarities within signals or among various categories of traffic signals, and previous OD methods have proved incapable of properly discerning these similarities or differences. The contribution of this study is to represent real-world traffic data by a robust DPMM-based method and to perform an unsupervised OD to achieve a detection rate of 96.67% in a ten-fold cross validation.

[1]  Haijun Gao,et al.  Traffic-incident detection-algorithm based on nonparametric regression , 2005, IEEE Trans. Intell. Transp. Syst..

[2]  Padhraic Smyth,et al.  Learning Time-Intensity Profiles of Human Activity using Non-Parametric Bayesian Models , 2006, NIPS.

[3]  Li Li,et al.  Mining for Similarities in Urban Traffic Flow Using Wavelets , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[4]  Chu-Song Chen,et al.  Two-View Motion Segmentation with Model Selection and Outlier Removal by RANSAC-Enhanced Dirichlet Process Mixture Models , 2010, International Journal of Computer Vision.

[5]  Yu-An Tan,et al.  Prediction and Identification of Urban Traffic Flow Based on Features , 2006, 2006 9th International Conference on Control, Automation, Robotics and Vision.

[6]  N. H. C. Yung,et al.  Performance Evaluation for Motif-Based Patterned Texture Defect Detection , 2010, IEEE Transactions on Automation Science and Engineering.

[7]  N. H. C. Yung,et al.  Modeling of traffic data characteristics by Dirichlet Process Mixtures , 2012, 2012 IEEE International Conference on Automation Science and Engineering (CASE).

[8]  Michael J. Black,et al.  A nonparametric Bayesian alternative to spike sorting , 2008, Journal of Neuroscience Methods.

[9]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Fenghua Zhu,et al.  DynaCAS: Computational Experiments and Decision Support for ITS , 2008, IEEE Intelligent Systems.

[11]  Tom Thomas,et al.  Detection of incidents and events in urban networks , 2008 .

[12]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[13]  Shawn Turner,et al.  Empirical Approaches to Outlier Detection in Intelligent Transportation Systems Data , 2003 .

[14]  N. H. C. Yung,et al.  Ellipsoidal decision regions for motif-based patterned fabric defect detection , 2010, Pattern Recognit..

[15]  Wei Wang,et al.  A comparison of outlier detection algorithms for ITS data , 2010, Expert Syst. Appl..

[16]  Xiaoqin Zhang,et al.  Trajectory-Based Video Retrieval Using Dirichlet Process Mixture Models , 2008, BMVC.

[17]  Mohan M. Trivedi,et al.  Real-Time Video Based Highway Traffic Measurement and Performance Monitoring , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[18]  Alan S. Willsky,et al.  Hierarchical Dirichlet processes for tracking maneuvering targets , 2007, 2007 10th International Conference on Information Fusion.

[19]  Zhongfei Zhang,et al.  An Incremental DPMM-Based Method for Trajectory Clustering, Modeling, and Retrieval , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Yang Gao,et al.  Detecting Abnormal Events via Hierarchical Dirichlet Processes , 2009, PAKDD.

[21]  W. Eric L. Grimson,et al.  Trajectory Analysis and Semantic Region Modeling Using Nonparametric Hierarchical Bayesian Models , 2011, International Journal of Computer Vision.

[22]  Chengcui Zhang,et al.  Learning-based spatio-temporal vehicle tracking and indexing for transportation multimedia database systems , 2003, IEEE Trans. Intell. Transp. Syst..

[23]  Shaogang Gong,et al.  Video Behaviour Mining Using a Dynamic Topic Model , 2011, International Journal of Computer Vision.

[24]  Javier A. Barria,et al.  Detection and Classification of Traffic Anomalies Using Microscopic Traffic Variables , 2011, IEEE Transactions on Intelligent Transportation Systems.

[25]  Michael I. Jordan,et al.  Learning Multiscale Representations of Natural Scenes Using Dirichlet Processes , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[26]  Yin Wang,et al.  Analysis on traffic flow data and extraction of nonlinear characteristic quantities , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[27]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[28]  Huan Zhou,et al.  The High Frequency Traffic Flow Analysis , 2009, 2009 Second International Symposium on Computational Intelligence and Design.