Missing traffic data imputation considering approximate intervals: A hybrid structure integrating adaptive network-based inference and fuzzy rough set

Abstract The incomplete traffic data will seriously influence the application of Intelligent Transportation System (ITS). In this study, a hybrid model, combining Adaptive Network-based Fuzzy Inference System (ANFIS) and Fuzzy Rough Set (FRS), is constructed to impute missing traffic volume data. Firstly, the upper and lower parameters of fuzzification in hybrid model are optimized through the combination of FRS and least square method. Then, a five-layer structure is designed to achieve imputation process under the integration of ANFIS and FRS. In addition, the imputation process is improved and the final imputation results can be estimated based on the upper and lower approximations of missing values. Finally, experiments are conducted to validate the effectiveness of the hybrid model using three evaluation indicators: Root Mean Square Error (RMSE), Correlation Coefficient (R) and Relative Accuracy (RA), under different missing rates. Though comparing with several candidate models under random missing and continuous missing types, the imputation results show that the hybrid method produces higher imputation accuracy and better stability imputation performance under different data missing rates, especially for high missing rate. The hybrid model, combining the advantages for dealing with uncertainty in FRS and strong learning ability to outliers in ANFIS, is an effective and feasible strategy to improve imputation performance for missing traffic flow data in transportation system.

[1]  J. Shao,et al.  Nearest Neighbor Imputation for Survey Data , 2000 .

[2]  Biswajit Basu,et al.  Time-series modelling for forecasting vehicular traffic flow in Dublin , 2005 .

[3]  Renato José Sassi,et al.  Biological image classification using rough-fuzzy artificial neural network , 2015, Expert Syst. Appl..

[4]  Hwasoo Yeo,et al.  Traffic Data Imputation Using Deep Convolutional Neural Networks , 2020, IEEE Access.

[5]  Carl E. Rasmussen,et al.  Gaussian Processes for Machine Learning (GPML) Toolbox , 2010, J. Mach. Learn. Res..

[6]  Hyun Kim,et al.  Missing data imputation for traffic flow speed using spatio-temporal cokriging , 2018 .

[7]  Jinjun Tang,et al.  Traffic flow prediction based on combination of support vector machine and data denoising schemes , 2019, Physica A: Statistical Mechanics and its Applications.

[8]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[9]  Bin Ran,et al.  Robust and flexible strategy for missing data imputation in intelligent transportation system , 2017 .

[10]  Yunlong Zhang,et al.  Forecasting of Short-Term Freeway Volume with v-Support Vector Machines , 2007 .

[11]  Huafeng Wu,et al.  Robust Ship Tracking via Multi-view Learning and Sparse Representation , 2018, Journal of Navigation.

[12]  Xiaofei Wang,et al.  Understanding characteristics in multivariate traffic flow time series from complex network structure , 2017 .

[13]  Guangdong Feng,et al.  A Tensor Based Method for Missing Traffic Data Completion , 2013 .

[14]  Yi Zhang,et al.  A BPCA based missing value imputing method for traffic flow volume data , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[15]  D. T. Lee,et al.  Travel-time prediction with support vector regression , 2004, IEEE Transactions on Intelligent Transportation Systems.

[16]  Meng Li,et al.  A spatiotemporal approach for traffic data imputation with complicated missing patterns , 2020 .

[17]  Nathan Huynh,et al.  Kernel-Based Machine Learning Models for Predicting Daily Truck Volume at Seaport Terminals , 2010 .

[18]  Bruce Ramsey,et al.  AUTOCOUNTS: A WAY TO ANALYSE AUTOMATIC TRAFFIC COUNT DATA. , 1994 .

[19]  Robert Nowicki,et al.  Rough Neuro-Fuzzy Structures for Classification With Missing Data , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[20]  Li Li,et al.  Missing traffic data: comparison of imputation methods , 2014 .

[21]  Jieping Ye,et al.  Tensor Completion for Estimating Missing Values in Visual Data , 2013, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[23]  Yunlong Zhang,et al.  Application of Genetic Neural Networks to Real-Time Intersection Accident Detection Using Acoustic Signals , 2006 .

[24]  Mehran Amiri,et al.  Missing data imputation using fuzzy-rough methods , 2016, Neurocomputing.

[25]  Nancy L. Nihan,et al.  Can Single-Loop Detectors Do the Work of Dual-Loop Detectors? , 2003 .

[26]  Bin Ran,et al.  Perspectives on Future Transportation Research: Impact of Intelligent Transportation System Technologies on Next-Generation Transportation Modeling , 2012, J. Intell. Transp. Syst..

[27]  S. Popescu,et al.  Bayesian Learning with Gaussian Processes for Supervised Classification of Hyperspectral Data , 2008 .

[28]  Jiawei Wang,et al.  Missing traffic data imputation and pattern discovery with a Bayesian augmented tensor factorization model , 2019, Transportation Research Part C: Emerging Technologies.

[29]  Qiang Shen,et al.  Semantics-preserving dimensionality reduction: rough and fuzzy-rough-based approaches , 2004, IEEE Transactions on Knowledge and Data Engineering.

[30]  Yinhai Wang,et al.  A hybrid approach to integrate fuzzy C-means based imputation method with genetic algorithm for missing traffic volume data estimation , 2015 .

[31]  Jinjun Tang,et al.  Ship Type Recognition via a Coarse-to-Fine Cascaded Convolution Neural Network , 2020, Journal of Navigation.

[32]  Shawn Turner,et al.  IMPUTING MISSING VALUES IN ITS DATA ARCHIVES FOR INTERVALS UNDER 5 MINUTES , 2001 .

[33]  Ruifeng Yu,et al.  Loop Detector Segmentation Error and Its Impacts on Traffic Speed Estimation , 2009 .

[34]  Ruimin Ke,et al.  Innovative method for traffic data imputation based on convolutional neural network , 2018, IET Intelligent Transport Systems.

[35]  Yaying Zhang,et al.  Urban Traffic Data Imputation With Detrending and Tensor Decomposition , 2020, IEEE Access.

[36]  Krzysztof Siminski,et al.  Neuro-rough-fuzzy approach for regression modelling from missing data , 2012, Int. J. Appl. Math. Comput. Sci..

[37]  Alexandre M. Bayen,et al.  Arterial travel time forecast with streaming data: A hybrid approach of flow modeling and machine learning , 2012 .

[38]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[39]  Robert Nowicki,et al.  On classification with missing data using rough-neuro-fuzzy systems , 2010, Int. J. Appl. Math. Comput. Sci..

[40]  Qiang Shen,et al.  Fuzzy-Rough Sets Assisted Attribute Selection , 2007, IEEE Transactions on Fuzzy Systems.

[41]  Li Li,et al.  Efficient missing data imputing for traffic flow by considering temporal and spatial dependence , 2013 .