An Architecture to Process Massive Vehicular Traffic Data

Fostered by the "big data" hype in mobility, many research efforts have been aimed at improving techniques to model vehicular traffic patterns for mobility prediction. Nevertheless, from a practical stance, the industry still faces many technological challenges in bringing solutions on the market. Especially the scalability and performance of such systems raise major concerns, given the amount of spatio-temporal data to be processed. The common approach in dealing with these issues is to introduce constraints and/or simplifications on both the spatial component of the data and on the employed algorithms, leading to results that are somehow limited. To overcome these issues, in this paper we report on our experiences and our approaches in providing a solution that meets industrial needs with the aim to leverage the computational and storage capabilities of the Cloud to handle massive dataset for providing vehicular traffic predictions. In particular, we present an approach to deal with real-world datasets to facilitate the knowledge discovery process from this data while matching the business constraints given by the industrial use case.

[1]  Shashi Shekhar,et al.  Spatial Big Data : Platforms , Analytics , and Science , 2013 .

[2]  João Gama,et al.  The next generation of transportation systems,greenhouse emissions, and data mining , 2010, KDD.

[3]  Ranga Raju Vatsavai,et al.  Spatiotemporal data mining in the era of big spatial data: algorithms and applications , 2012, BigSpatial '12.

[4]  Jin Xin Cao,et al.  Traffic volume forecasting based on radial basis function neural network with the consideration of traffic flows at the adjacent intersections , 2014 .

[5]  Shashi Shekhar,et al.  Spatial big-data challenges intersecting mobility and cloud computing , 2012, MobiDE '12.

[6]  Wolfgang Nejdl,et al.  Predicting and visualizing traffic congestion in the presence of planned special events , 2014, J. Vis. Lang. Comput..

[7]  Stephen Dunne,et al.  Regime-Based Short-Term Multivariate Traffic Condition Forecasting Algorithm , 2012 .

[8]  Shashi Shekhar,et al.  Identifying patterns in spatial information: A survey of methods , 2011, WIREs Data Mining Knowl. Discov..

[9]  Joel H. Saltz,et al.  Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce , 2013, Proc. VLDB Endow..

[10]  Flora Amato,et al.  Exploiting Cloud Technologies and Context Information for Recommending Touristic Paths , 2013, IDC.

[11]  Ahmed Eldawy,et al.  SpatialHadoop: A MapReduce framework for spatial data , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[12]  Wolfgang Nejdl,et al.  Predicting Traffic Congestion in Presence of Planned Special Events , 2014, DMS.

[13]  Wolfgang Nejdl,et al.  Stuck Around the Stadium? An Approach to Identify Road Segments Affected by Planned Special Events , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[14]  Ugur Demiryurek,et al.  Utilizing Real-World Transportation Data for Accurate Traffic Prediction , 2012, 2012 IEEE 12th International Conference on Data Mining.

[15]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[16]  Sergio Di Martino,et al.  A Rich Cloud Application to Improve Sustainable Mobility , 2011, W2GIS.

[17]  Chetan Gupta,et al.  Forecasting Spatiotemporal Impact of Traffic Incidents on Road Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.