Real-time data reduction at the network edge of Internet-of-Things systems

The expected huge increase in the number of IoT data sources (sensors, embedded systems, personal devices etc.) will give rise to network-edge computing, i.e., data pre-processing, local storage, and filtering close to the data sources. Specifically, data reduction at the network edge (e.g., on an IoT gateway device or a mini-server deployed locally at an IoT area network) can prevent I/O bottlenecks, as well as dramatically reduce storage, bandwidth, and energy costs. However, current solutions face two main obstacles towards achieving this benefits of network-edge computing. Firstly, the most efficient algorithms for data reduction of time series (which is one of the prevailing kinds of data in IoT) are developed to work a posteriori upon big datasets and they cannot take decisions per incoming data item. Secondly, the state of the art lacks systems that can apply any of many different possible data reduction methods without adding significant delays or heavyweight re-configurations. This paper presents a solution that automates the switching between different data handling algorithms at the network edge, including an analysis of adjusted data reduction methods, as well as three flavors of a new algorithm that is capable of performing real-time reduction of incoming time series items based on the concept of Perceptually Important Points. The potential benefits are evaluated upon real datasets from street, household, and robot sensors, showing that our solution achieves accuracies between 76,1 % and 93,8 % despite forwarding only 1/3 of the data items, without adding significant forwarding delays.

[1]  Badrinath Roysam,et al.  Image change detection algorithms: a systematic survey , 2005, IEEE Transactions on Image Processing.

[2]  Tak-chung Fu,et al.  Flexible time series pattern matching based on perceptually important points , 2001 .

[3]  Ralf Steinmetz,et al.  Scoresheet-based event relevance determination for energy efficiency in wireless sensor networks , 2011, 2011 IEEE 36th Conference on Local Computer Networks.

[4]  Apostolos Papageorgiou,et al.  Efficient filtering processes for machine-to-machine data based on automation modules and data-agnostic algorithms , 2014, Int. J. Bus. Process. Integr. Manag..

[5]  More than 50 billion connected devices , 2011 .

[6]  Eugene Fink,et al.  Search for Patterns in Compressed Time Series , 2002, Int. J. Image Graph..

[7]  Karl Johan Åström,et al.  On the choice of sampling rates in parametric identification of time series , 1969, Inf. Sci..

[8]  Olivier Hersent,et al.  M2M Communications: A Systems Approach , 2012 .

[9]  Man Hon Wong,et al.  Efficient and robust feature extraction and pattern matching of time series by a lattice structure , 2001, CIKM '01.

[10]  Wei Tang,et al.  FlexAnalytics: A Flexible Data Analytics Framework for Big Data Applications with I/O Performance Improvement , 2014, Big Data Res..

[11]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[12]  Liaoruo Huang,et al.  Policy control and traffic aggregation for M2M services in mobile networks , 2013, Proceedings 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC).