A Distributed Big Data Analytics Architecture for Vehicle Sensor Data

The unceasingly increasing needs for data acquisition, storage and analysis in transportation systems have led to the adoption of new technologies and methods in order to provide efficient and reliable solutions. Both highways and vehicles, nowadays, host a vast variety of sensors collecting different types of highly fluctuating data such as speed, acceleration, direction, and so on. From the vast volume and variety of these data emerges the need for the employment of big data techniques and analytics in the context of state-of-the-art intelligent transportation systems (ITS). Moreover, the scalability needs of fleet and traffic management systems point to the direction of designing and deploying distributed architecture solutions that can be expanded in order to avoid technological and/or technical entrapments. Based on the needs and gaps detected in the literature as well as the available technologies for data gathering, storage and analysis for ITS, the aim of this study is to provide a distributed architecture platform to address these deficiencies. The architectural design of the system proposed, engages big data frameworks and tools (e.g., NoSQL Mongo DB, Apache Hadoop, etc.) as well as analytics tools (e.g., Apache Spark). The main contribution of this study is the introduction of a holistic platform that can be used for the needs of the ITS domain offering continuous collection, storage and data analysis capabilities. To achieve that, different modules of state-of-the-art methods and tools were utilized and combined in a unified platform that supports the entire cycle of data acquisition, storage and analysis in a single point. This leads to a complete solution for ITS applications which lifts the limitations imposed in legacy and current systems by the vast amounts of rapidly changing data, while offering a reliable system for acquisition, storage as well as timely analysis and reporting capabilities of these data.

[1]  Weijing Xu,et al.  Analysis of Intelligent Transportation System Application Based on Internet of Things and Big Data Technology under the Background of Information Society , 2022, Advances in Multimedia.

[2]  C. Tarhan,et al.  Application of Intelligent Transportation System Data using Big Data Technologies , 2022, 2022 Innovations in Intelligent Systems and Applications Conference (ASYU).

[3]  Jian Wang,et al.  How to improve urban transportation planning in big data era? A practice in the study of traffic analysis zone delineation , 2022, Transport Policy.

[4]  Zhihan Lv,et al.  Real-Time Intelligent Automatic Transportation Safety Based on Big Data Management , 2022, IEEE Transactions on Intelligent Transportation Systems.

[5]  H. Nguyen,et al.  Applications of Big Data Analytics in Traffic Management in Intelligent Transportation Systems , 2022, JOIV : International Journal on Informatics Visualization.

[6]  Evgenia F. Adamopoulou,et al.  A Quality Control Methodology for Heterogeneous Vehicular Data Streams , 2022, Italian National Conference on Sensors.

[7]  Khalid Elgazzar,et al.  Connected Vehicles: Technology Review, State of the Art, Challenges and Opportunities , 2021, Sensors.

[8]  Evgenia F. Adamopoulou,et al.  Driving Behaviour Analysis Using Machine and Deep Learning Methods for Continuous Streams of Vehicular Data , 2021, Sensors.

[9]  Evgenia Adamopoulou,et al.  Comparative Analysis of Machine Learning-Based Approaches for Anomaly Detection in Vehicular Data , 2021 .

[10]  Mohan Kubendiran,et al.  Survey on Big Data Techniques in Intelligent Transportation System (ITS) , 2021 .

[11]  Evgenia Adamopoulou,et al.  An Artificial Intelligence-Based Approach for the Controlled Access Ramp Metering Problem , 2021, Vehicles.

[12]  Joe Zhu,et al.  Big data algorithms and applications in intelligent transportation system: A review and bibliometric analysis , 2021, International Journal of Production Economics.

[13]  Sooyeon Shin,et al.  Implementation of a Sensor Big Data Processing System for Autonomous Vehicles in the C-ITS Environment , 2020, Applied Sciences.

[14]  Daniele Raimondi,et al.  On-Board Unit Big Data: Short-term Traffic Forecasting in Urban Transportation Networks , 2020, 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA).

[15]  Celimuge Wu,et al.  Traffic big data assisted V2X communications toward smart transportation , 2019, Wireless Networks.

[16]  Yinhai Wang,et al.  Road surface friction prediction using long short-term memory neural network based on historical data , 2019, J. Intell. Transp. Syst..

[17]  Fahim Arif,et al.  Real-time data processing scheme using big data analytics in internet of things based smart transportation environment , 2019, J. Ambient Intell. Humaniz. Comput..

[18]  Indratmo,et al.  Systematic Review of the Literature on Big Data in the Transportation Domain: Concepts and Applications , 2019, Big Data Res..

[19]  Fecir Duran,et al.  The design and implementation of road condition warning system for drivers , 2019, Measurement and Control.

[20]  Raphaël Troncy,et al.  Modeling dangerous driving events based on in-vehicle data using Random Forest and Recurrent Neural Network , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[21]  Michael W. Levin,et al.  Vehicle sensor data-based transportation research: Modeling, analysis, and management , 2019, J. Intell. Transp. Syst..

[22]  Peng Xu,et al.  Design of Data Interchange Platform for Digital Highway , 2019, 2019 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS).

[23]  Tao Tang,et al.  Big Data Analytics in Intelligent Transportation Systems: A Survey , 2019, IEEE Transactions on Intelligent Transportation Systems.

[24]  Konstantinos Demestichas,et al.  Road Traffic Prediction Using Artificial Neural Networks , 2018, 2018 South-Eastern European Design Automation, Computer Engineering, Computer Networks and Society Media Conference (SEEDA_CECNSM).

[25]  Arif Ur Rahman,et al.  SMART TSS: Defining transportation system behavior using big data analytics in smart cities , 2018, Sustainable Cities and Society.

[26]  Javier Del Ser,et al.  Big Data for transportation and mobility: recent advances, trends and challenges , 2018, IET Intelligent Transport Systems.

[27]  Kamalrulnizam Abu Bakar,et al.  Fog Based Intelligent Transportation Big Data Analytics in The Internet of Vehicles Environment: Motivations, Architecture, Challenges, and Critical Issues , 2018, IEEE Access.

[28]  Elisabetta Raguseo,et al.  Big data technologies: An empirical investigation on their adoption, benefits and risks for companies , 2018, Int. J. Inf. Manag..

[29]  Santanu Chaudhury,et al.  Video-based road traffic monitoring and prediction using dynamic Bayesian networks , 2017 .

[30]  Lidong Wang,et al.  Heterogeneous Data and Big Data Analytics , 2017 .

[31]  Reynold Xin,et al.  Apache Spark , 2016 .

[32]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[33]  Nittaya Kerdprasop,et al.  The Clustering Validity with Silhouette and Sum of Squared Errors , 2015 .

[34]  Andreas Kassler,et al.  Distributed Architectures for Intelligent Transport Systems: A Survey , 2012, 2012 Second Symposium on Network Cloud Computing and Applications.

[35]  Kenneth G. Manton,et al.  Cluster Analysis: Overview , 2005 .

[36]  Jairo R. Montoya-Torres,et al.  Big Data Analytics and Intelligent Transportation Systems , 2021, IFAC-PapersOnLine.

[37]  Robithoh Annur,et al.  Information and Communication Technology (ICT) for Intelligent Transportation Systems (ITS) , 2020 .

[38]  H. Humaira,et al.  Determining The Appropiate Cluster Number Using Elbow Method for K-Means Algorithm , 2020, Proceedings of the Proceedings of the 2nd Workshop on Multidisciplinary and Applications (WMA) 2018, 24-25 January 2018, Padang, Indonesia.

[39]  Peter Conradi,et al.  An Automotive Distributed Mobile Sensor Data Collection with Machine Learning Based Data Fusion and Analysis on a Central Backend System , 2016 .

[40]  Jay Kreps,et al.  Kafka : a Distributed Messaging System for Log Processing , 2011 .

[41]  Jiawei Han,et al.  K-Means Clustering , 2021, Learn Data Mining Through Excel.

[42]  Ahmed K. Elmagarmid,et al.  Generalization of ACID Properties , 2009, Encyclopedia of Database Systems.