A Preliminary Study on Automatic Algorithm Selection for Short-Term Traffic Forecasting

Despite the broad range of Machine Learning (ML) algorithms, there are no clear baselines to find the best method and its configuration given a Short-Term Traffic Forecasting (STTF) problem. In ML, this is known as the Model Selection Problem (MSP). Although Automatic Algorithm Selection (AAS) has proved success dealing with MSP in other areas, it has hardly been explored in STTF. This paper deepens into the benefits of AAS in this field. To this end, we have used Auto-WEKA, a well-known AAS method, and compared it to the general approach (which consists of selecting the best of a set of algorithms) over a multi-class imbalanced classification STTF problem. Experimental results show AAS as a promising methodology in this area and allow important conclusions to be drawn on how to improve the performance of ASS methods when dealing with STTF.

[1]  Gerald Tesauro,et al.  Selecting Near-Optimal Learners via Incremental Data Allocation , 2015, AAAI.

[2]  Gang Luo,et al.  A review of automatic selection methods for machine learning algorithms and hyper-parameter values , 2016, Network Modeling Analysis in Health Informatics and Bioinformatics.

[3]  Chris Eliasmith,et al.  Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-Learn , 2014, SciPy.

[4]  Aaron Klein,et al.  Efficient and Robust Automated Machine Learning , 2015, NIPS.

[5]  Kevin Leyton-Brown,et al.  Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[6]  Eleni I. Vlahogianni,et al.  Road Traffic Forecasting: Recent Advances and New Challenges , 2018, IEEE Intelligent Transportation Systems Magazine.

[7]  Eneko Osaba,et al.  A Hybrid Method for Short-Term Traffic Congestion Forecasting Using Genetic Algorithms and Cross Entropy , 2016, IEEE Transactions on Intelligent Transportation Systems.

[8]  Francisco Herrera,et al.  Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power , 2010, Inf. Sci..

[9]  Bartosz Krawczyk,et al.  Sentiment Classification from Multi-class Imbalanced Twitter Data Using Binarization , 2017, HAIS.

[10]  Yunpeng Wang,et al.  Large-Scale Transportation Network Congestion Evolution Prediction Using Deep Learning Theory , 2015, PloS one.

[11]  Eleni I. Vlahogianni,et al.  Optimization of traffic forecasting: Intelligent surrogate modeling , 2015 .

[12]  Mohamad Reza Ganjalikhany,et al.  In silico vaccine design against type 1 diabetes based on molecular modeling of coxsackievirus B4 epitopes , 2016, Network Modeling Analysis in Health Informatics and Bioinformatics.

[13]  Eleni I. Vlahogianni,et al.  Short-term traffic forecasting: Where we are and where we’re going , 2014 .

[14]  Hwasoo Yeo,et al.  Short-term Travel-time Prediction on Highway: A Review of the Data-driven Approach , 2015 .

[15]  Tim Kraska,et al.  Automating model search for large scale machine learning , 2015, SoCC.