Optimization of Support Vector Regression Models for Stormwater Prediction

In this paper we propose a solution to a real-world time series regression problem: the prediction of fill levels of stormwater tanks. Our regression model is based on Support Vector Regression (SVR), but can easily be replaced with other data mining methods. The main intention of the work is to overcome frequently occuring problems in data mining by automatically tuning both preprocessing and hyperparameters. We highly believe that many models can be improved by a systematic preprocessing and hyperparameter tuning. The optimization of our model is presented in a step-by-step manner which can easily be adapted to other time series problems. We point out possible issues of parameter tuning, e.g., we analyze our tuned models with respect to overfitting and oversearching (which are effects that might lead to a reduced model generalizability) and present methods to circumvent such issues.

[1]  Thomas Bartz-Beielstein,et al.  Optimizing Support Vector Machines for Stormwater Prediction , 2010 .

[2]  Markus Reischl,et al.  Die MATLAB-Toolbox Gait-CAD , 2006 .

[3]  Thomas Bartz-Beielstein,et al.  Genetic Programming Applied to Predictive Control in Environmental Engineering , 2009 .

[4]  Richard A. Davis,et al.  Time Series: Theory and Methods , 2013 .

[5]  Chih-Jen Lin,et al.  IJCNN 2001 challenge: generalization ability and text decoding , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[6]  Thomas Bartz-Beielstein,et al.  SPOT: An R Package For Automatic and Interactive Tuning of Optimization Algorithms by Sequential Parameter Optimization , 2010, ArXiv.

[7]  Thomas Bartz-Beielstein,et al.  Parameter-Tuned Data Mining: A General Framework , 2010 .

[8]  Gunnar Rätsch,et al.  Predicting Time Series with Support Vector Machines , 1997, ICANN.

[9]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[10]  J. A. Stewart,et al.  Nonlinear Time Series Analysis , 2015 .

[11]  Thomas Bartz-Beielstein,et al.  Experimental Research in Evolutionary Computation - The New Experimentalism , 2010, Natural Computing Series.

[12]  Thomas Bartz-Beielstein,et al.  Optimierte Modellierung von Füllständen in Regenüberlaufbecken mittels CI-basierter ParameterselektionOptimized Modelling of Fill Levels in Stormwater Tanks Using CI-based Parameter Selection Schemes , 2009, Autom..

[13]  Simon Haykin,et al.  Support vector machines for dynamic reconstruction of a chaotic system , 1999 .

[14]  H. Theil,et al.  Economic Forecasts and Policy. , 1959 .

[15]  Andy J. Keane,et al.  Engineering Design via Surrogate Modelling - A Practical Guide , 2008 .

[16]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.