Local Polynomial Regression Models for Average Traffic Speed Estimation and Forecasting in Linear Constraint Databases

Constraint databases have the specific advantage of being able to represent infinite temporal relations by linear equations, linear inequalities, polynomial equations, and so on. This advantage can store a continuous time-line that naturally connects with other traffic attributes, such as traffic speed. In most cases, vehicle speed varies over time, that is, the speed is often nonlinear. However, the infinite representations allowed in current constraint database systems are only linear. Our article presents a new approach to estimate and forecast continuous average speed using linear constraint database systems. Our new approach to represent and query the nonlinear average traffic speed is based on a combination of local polynomial regression and piecewise-linear approximation algorithm. Experiments using the MLPQ constraint database system and queries show that our method has a high accuracy in predicting the average traffic speed. The actual accuracy is controllable by a parameter. We compared the local linear regression model with the local cubic model by using a field experiment. It was found that the local cubic model follows more closely the raw data than the linear model follows.

[1]  Lixin Li,et al.  A Spatiotemporal Database for Ozone in the Conterminous U.S. , 2006, Thirteenth International Symposium on Temporal Representation and Reasoning (TIME'06).

[2]  Brian Lee Smith,et al.  PARAMETRIC AND NONPARAMETRIC TRAFFIC VOLUME FORECASTING , 2000 .

[3]  Joachim Engel,et al.  An iterative bandwidth selector for kernel estimation of densities and their derivatives , 1994 .

[4]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[5]  J. Faraway,et al.  Bootstrap choice of bandwidth for density estimation , 1990 .

[6]  H. Mahmassani,et al.  Travel time estimation based on piecewise truncated quadratic speed trajectory , 2008 .

[7]  S. Sheather A data-based algorithm for choosing the window width when estimating the density at a point , 1983 .

[8]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[9]  D. W. Scott,et al.  Biased and Unbiased Cross-Validation in Density Estimation , 1987 .

[10]  Jianqing Fan Design-adaptive Nonparametric Regression , 1992 .

[11]  N-E El Faouzi NONPARAMETRIC TRAFFIC FLOW PREDICTION USING KERNEL ESTIMATOR , 1996 .

[12]  Steven I-Jy Chien,et al.  DYNAMIC TRAVEL TIME PREDICTION WITH REAL-TIME AND HISTORICAL DATA , 2003 .

[13]  D. W. Scott,et al.  Kernel density estimation revisited , 1977 .

[14]  James Stephen Marron,et al.  Comparison of data-driven bandwith selectors , 1988 .

[15]  A. Bowman An alternative method of cross-validation for the smoothing of density estimates , 1984 .

[16]  H. Müller,et al.  Kernel estimation of regression functions , 1979 .

[17]  Peter Z. Revesz Introduction to Databases - From Biological to Spatio-Temporal , 2010, Texts in Computer Science.

[18]  Kan Chen,et al.  Advanced Traveler Information Systems , 2002 .

[19]  Jianqing Fan,et al.  Fast Implementations of Nonparametric Curve Estimators , 1994 .

[20]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[21]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[22]  Henry X. Liu,et al.  Use of Local Linear Regression Model for Short-Term Traffic Forecasting , 2003 .

[23]  Ruimin Li,et al.  Evaluation of speed-based travel time estimation models , 2006 .

[24]  M. R. Leadbetter,et al.  Hazard Analysis , 2018, System Safety Engineering and Risk Assessment.

[25]  H. Müller,et al.  Local Polynomial Modeling and Its Applications , 1998 .

[26]  Min Ouyang,et al.  Approximate query evaluation using linear constraint databases , 2001, Proceedings Eighth International Symposium on Temporal Representation and Reasoning. TIME 2001.

[27]  E. Nadaraya On Estimating Regression , 1964 .

[28]  Barbara von Halle,et al.  Handbook of relational database design , 1989 .

[29]  E. Nadaraya On Non-Parametric Estimates of Density Functions and Regression Curves , 1965 .

[30]  M. C. Jones,et al.  A Brief Survey of Bandwidth Selection for Density Estimation , 1996 .

[31]  Frank S. Koppelman,et al.  Perspectives on Driver Preferences for Dynamic Route Guidance Systems , 1997 .

[32]  M. Wand,et al.  An Effective Bandwidth Selector for Local Least Squares Regression , 1995 .

[33]  Peter Z. Revesz,et al.  Constraint Databases: A Survey , 1995, Semantics in Databases.