An Architecture for Intelligent Data Processing on IoT Edge Devices

As the Internet of Things edges closer to mainstream adoption, with it comes an exponential rise in data transmission across the current Internet architecture. Capturing and analyzing this data will lead to a wealth of opportunities. However, this ungoverned, unstructured data has the potential to exhaust the resources of an already strained infrastructure. Analyzing data as close to the sources as possible would greatly enhance the success of the IoT. This paper proposes a distributed data processing architecture for edge devices in an IoT environment. Our approach focuses on a vehicular trucking use case. The goal is to recreate the traditionally centralized Storm processes on the edge devices using a combination of Apache MiNiFi and the user’s custombuilt programs. Our approach is shown to preserve computational accuracy while reducing by upwards of 90 percent the volume of data transferred from edge devices for centralized processing.

[1]  Hirozumi Yamaguchi,et al.  Middleware for Proximity Distributed Real-Time Processing of IoT Data Flows , 2016, 2016 IEEE 36th International Conference on Distributed Computing Systems (ICDCS).

[2]  Bin Cheng,et al.  Real-time data reduction at the network edge of Internet-of-Things systems , 2015, 2015 11th International Conference on Network and Service Management (CNSM).

[3]  K. Barraclough Eclipse , 2006, BMJ : British Medical Journal.

[4]  Florin Pop,et al.  Soft Real-Time Hadoop Scheduler for Big Data Processing in Smart Cities , 2016, 2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA).

[5]  Ameet Talwalkar,et al.  MLlib: Machine Learning in Apache Spark , 2015, J. Mach. Learn. Res..

[6]  Robert Evans Apache Storm, a Hands on Tutorial , 2015, 2015 IEEE International Conference on Cloud Engineering.

[7]  Rachida Dssouli,et al.  Big Data Pre-processing: A Quality Framework , 2015, 2015 IEEE International Congress on Big Data.