Correcting and complementing freeway traffic accident data using mahalanobis distance based outlier detection

A huge amount of traffic data is archived which can be used in data mining especially supervised learning. However, it is not being fully used due to lack of accurate accident information (labels). ...

[1]  Donald H. Burn,et al.  Switching the pooling similarity distances: Mahalanobis for Euclidean , 2006 .

[2]  Carlos Sun,et al.  Dynamic Incident Progression Curve for Classifying Secondary Traffic Crashes , 2010 .

[3]  Javier Palarea-Albaladejo,et al.  zCompositions — R package for multivariate imputation of left-censored data under a compositional approach , 2015 .

[4]  Brian Lee Smith,et al.  Identifying Nearest Neighbors in a Large-Scale Incident Data Archive , 2004 .

[5]  Jiawei Han,et al.  Multidimensional Data Mining of Traffic Anomalies on Large-Scale Road Networks , 2011 .

[6]  Hojjat Adeli,et al.  Wavelet‐Clustering‐Neural Network Model for Freeway Incident Detection , 2003 .

[7]  Yong Shi,et al.  ON-LINE TESTING OF THE MCMASTER INCIDENT DETECTION ALGORITHM UNDER RECURRENT CONGESTION , 1993 .

[8]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[9]  Tom Thomas,et al.  Detection of incidents and events in urban networks , 2008 .

[10]  P. Rousseeuw,et al.  A fast algorithm for the minimum covariance determinant estimator , 1999 .

[11]  Qiang Guo,et al.  Design and Implementation of School Hospital Information Analysis and Mining System , 2014 .

[12]  Simon Janos,et al.  Web based distant monitoring and control for greenhouse systems using the Sun SPOT modules , 2009, 2009 7th International Symposium on Intelligent Systems and Informatics.

[13]  Sungbin Cho,et al.  A hybrid approach based on the combination of variable selection using decision trees and case-based reasoning using the Mahalanobis distance: For bankruptcy prediction , 2010, Expert Syst. Appl..

[14]  Yorgos J. Stephanedes,et al.  FREEWAY INCIDENT DETECTION THROUGH FILTERING , 1993 .

[15]  Zhijun Huang,et al.  A Kind of  Algorithms for Euclidean Distance-Based Outlier Mining and its Application to Expressway Toll Fraud Detection , 2009, 2009 International Asia Conference on Informatics in Control, Automation and Robotics.

[16]  Martti Juhola,et al.  Informal identification of outliers in medical data , 2000 .

[17]  Choujun Zhan,et al.  Semi-Supervised Image Classification Based on Local and Global Regression , 2015, IEEE Signal Processing Letters.

[18]  Mingchao Li,et al.  A multidimensional information model for managing construction information , 2015 .

[19]  Yaser E. Hawas,et al.  A Threshold-Based Real-Time Incident Detection System for Urban Traffic Networks , 2012 .

[20]  D. Massart,et al.  The Mahalanobis distance , 2000 .

[21]  Jj Allaire,et al.  Web Application Framework for R , 2016 .

[22]  Jiyoun Yeon,et al.  Differences in Freeway Capacity by Day of the Week, Time of Day, and Segment Type , 2009 .

[23]  Nathan J Morris,et al.  strum: an R package for structural modeling of latent variables for general pedigrees , 2015, BMC Genetics.

[24]  A. S. Bhatt Comparative analysis of attribute selection measures used for attribute selection in decision tree induction , 2012, 2012 International Conference on Radar, Communication and Computing (ICRCC).

[25]  H. M. Zhang,et al.  Fundamental Diagram of Traffic Flow , 2011 .

[26]  Calvin R. Maurer,et al.  A Linear Time Algorithm for Computing Exact Euclidean Distance Transforms of Binary Images in Arbitrary Dimensions , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Jianhua Guo,et al.  Real time traffic flow outlier detection using short-term traffic conditional variance prediction , 2015 .

[28]  Yiannis Kamarianakis,et al.  Characterizing regimes in daily cycles of urban traffic using smooth-transition regressions , 2010 .

[29]  C. Pipper,et al.  [''R"--project for statistical computing]. , 2008, Ugeskrift for laeger.

[30]  M. C. Kiran,et al.  Quantifying and mapping biodiversity and ecosystem services : Utility of a multi-season NDVI based Mahalanobis distance surrogate , 2009 .

[31]  Fu-Ding Xie,et al.  Image segmentation using PSO and PCM with Mahalanobis distance , 2011, Expert Syst. Appl..

[32]  P. Rousseeuw,et al.  Unmasking Multivariate Outliers and Leverage Points , 1990 .

[33]  Carlos F. Daganzo,et al.  A simple detection scheme for delay-inducing freeway incidents , 1997 .

[34]  Salvatore J. Stolfo,et al.  Adaptive Model Generation: An Architecture for Deployment of Data Mining-Based Intrusion Detection Systems , 2002 .

[35]  Chunbo Zhang,et al.  ALTERNATIVE ROUTE STRATEGY FOR EMERGENCY TRAFFIC MANAGEMENT BASED ON ITS : A CASE STUDY OF XI ’ AN MING CITY WALL , 2013 .

[36]  Lara Lusa,et al.  medplot: A Web Application for Dynamic Summary and Analysis of Longitudinal Medical Data Based on R , 2015, PloS one.

[37]  Eleni I. Vlahogianni,et al.  Statistical methods for detecting nonlinearity and non-stationarity in univariate short-term time-series of traffic volume , 2006 .

[38]  R. Kadmon,et al.  Assessment of alternative approaches for bioclimatic modeling with special emphasis on the Mahalanobis distance , 2003 .

[39]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[40]  Fang Yuan,et al.  INCIDENT DETECTION USING SUPPORT VECTOR MACHINES , 2003 .

[41]  Clemens Reimann,et al.  Multivariate outlier detection in exploration geochemistry , 2005, Comput. Geosci..

[42]  Alfred Krzywicki,et al.  Exploiting Concept Clumping for Efficient Incremental E-Mail Categorization , 2010, ADMA.

[43]  M. M. Naidu,et al.  An Approach to Prediction of Precipitation Using Gini Index in SLIQ Decision Tree , 2013, 2013 4th International Conference on Intelligent Systems, Modelling and Simulation.

[44]  J. R. Alder,et al.  Web based visualization of large climate data sets , 2015, Environ. Model. Softw..