Efficient Streaming Mass Spatio-Temporal Vehicle Data Access in Urban Sensor Networks Based on Apache Storm

The efficient data access of streaming vehicle data is the foundation of analyzing, using and mining vehicle data in smart cities, which is an approach to understand traffic environments. However, the number of vehicles in urban cities has grown rapidly, reaching hundreds of thousands in number. Accessing the mass streaming data of vehicles is hard and takes a long time due to limited computation capability and backward modes. We propose an efficient streaming spatio-temporal data access based on Apache Storm (ESDAS) to achieve real-time streaming data access and data cleaning. As a popular streaming data processing tool, Apache Storm can be applied to streaming mass data access and real time data cleaning. By designing the Spout/bolt workflow of topology in ESDAS and by developing the speeding bolt and other bolts, Apache Storm can achieve the prospective aim. In our experiments, Taiyuan BeiDou bus location data is selected as the mass spatio-temporal data source. In the experiments, the data access results with different bolts are shown in map form, and the filtered buses’ aggregation forms are different. In terms of performance evaluation, the consumption time in ESDAS for ten thousand records per second for a speeding bolt is approximately 300 milliseconds, and that for MongoDB is approximately 1300 milliseconds. The efficiency of ESDAS is approximately three times higher than that of MongoDB.

[1]  Song Guo,et al.  A General Communication Cost Optimization Framework for Big Data Stream Processing in Geo-Distributed Data Centers , 2016, IEEE Transactions on Computers.

[2]  Ronald C. Taylor An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics , 2010, BMC Bioinformatics.

[3]  Wei Xie,et al.  A Real-Time Log Analyzer Based on MongoDB , 2014 .

[4]  Henry Muccini,et al.  CASSANDRA: An Online Failure Prediction Strategy for Dynamically Evolving Systems , 2014, ICFEM.

[5]  Shanika Karunasekera,et al.  Distributed stream clustering using micro-clusters on Apache Storm , 2017, J. Parallel Distributed Comput..

[6]  Radu Stoleru,et al.  Mobile storm: Distributed real-time stream processing for mobile clouds , 2015, 2015 IEEE 4th International Conference on Cloud Networking (CloudNet).

[7]  Kipton Barros,et al.  Distributed Database Kriging for Adaptive Sampling (D2KAS) , 2015, Comput. Phys. Commun..

[8]  Christoph Stasch,et al.  New Generation Sensor Web Enablement , 2011, Sensors.

[9]  Ying Wah Teh,et al.  Mining Personal Data Using Smartphones and Wearable Devices: A Survey , 2015, Sensors.

[10]  Nengcheng Chen,et al.  An Efficient Method of Sharing Mass Spatio-Temporal Trajectory Data Based on Cloudera Impala for Traffic Distribution Mapping in an Urban City , 2016, Sensors.

[11]  Kam-Wing Ng,et al.  A novel caching mechanism for peer-to-peer based media-on-demand streaming , 2008, J. Syst. Archit..

[12]  Gustavo Alonso,et al.  SwissQM: Next Generation Data Processing in Sensor Networks , 2007, CIDR.

[13]  Jian Pei,et al.  A spatiotemporal compression based approach for efficient big data processing on Cloud , 2014, J. Comput. Syst. Sci..

[14]  Lin Liu,et al.  Memcache and MongoDB Based GIS Web Service , 2012, 2012 Second International Conference on Cloud and Green Computing.

[15]  Yuanxi Yang,et al.  Performance assessment of single- and dual-frequency BeiDou/GPS single-epoch kinematic positioning , 2014, GPS Solutions.

[16]  Yi-Bing Lin,et al.  Effects of cache mechanism on wireless data access , 2003, IEEE Trans. Wirel. Commun..

[17]  Ce-Kuen Shieh,et al.  A topology-based scaling mechanism for Apache Storm , 2017, Int. J. Netw. Manag..

[18]  Sushmita Ruj,et al.  A Decentralized Security Framework for Data Aggregation and Access Control in Smart Grids , 2013, IEEE Transactions on Smart Grid.

[19]  Daniele Marioli,et al.  Application of an ANFIS Algorithm to Sensor Data Processing , 2005, IEEE Transactions on Instrumentation and Measurement.

[20]  Ryum-Duck Oh,et al.  Efficient Sensor Stream Data Processing System to use Cache Technique for Ubiquitous Sensor Network Application Service , 2012 .

[21]  Ramez Elmasri,et al.  Architectures for streaming data processing in sensor networks , 2005, The 3rd ACS/IEEE International Conference onComputer Systems and Applications, 2005..

[22]  Enrico Capobianco,et al.  Smart Cities, Big Data, and Communities: Reasoning From the Viewpoint of Attractors , 2016, IEEE Access.

[23]  Pari Delir Haghighi,et al.  An Evaluation of Data Stream Processing Systems for Data Driven Applications , 2016, ICCS.

[24]  Vlad Trifa,et al.  Interacting with the SOA-Based Internet of Things: Discovery, Query, Selection, and On-Demand Provisioning of Web Services , 2010, IEEE Transactions on Services Computing.

[25]  Chetan Kumar,et al.  A new approach for a proxy-level web caching mechanism , 2008, Decis. Support Syst..