Quality-aware aggregation & predictive analytics at the edge

We investigate the quality of aggregation and predictive analytics in edge computing environments. Edge analytics require pushing processing and inference to the edge of a network of sensing & actuator nodes, which enables huge amount of contextual data to be processed in real time that would be prohibitively complex and costly to transfer on centralized locations. We propose a quality-aware, time-optimized edge analytics model that supports communication efficient predictive modeling within the edge network. Our idea rests on the capability of edge nodes to intelligently decide when and which data to deliver and process in light of minimizing the communication overhead and maximizing the quality of analytics results. We provide mathematical modeling, performance and comparative assessment over real datasets showing its benefits in edge computing environments.

[1]  G. Simons Great Expectations: Theory of Optimal Stopping , 1973 .

[2]  David Siegmund,et al.  Great expectations: The theory of optimal stopping , 1971 .

[3]  Rangala Manasa Vehicle Assisted Device To Device Data Delivery for Smart Grid , 2017 .

[4]  D.P. Agrawal,et al.  APTEEN: a hybrid protocol for efficient routing and comprehensive information retrieval in wireless , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[5]  Chris Tofallis,et al.  A better measure of relative prediction accuracy for model selection and model estimation , 2014, J. Oper. Res. Soc..

[6]  Lídice García Ríos,et al.  Big Data Infrastructure for analyzing data generated by Wireless Sensor Networks , 2014, 2014 IEEE International Congress on Big Data.

[7]  Christos Anagnostopoulos,et al.  Predictive intelligence to the edge: impact on edge analytics , 2018, Evol. Syst..

[8]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[9]  Christos Anagnostopoulos,et al.  Time-optimized contextual information forwarding in mobile sensor networks , 2014, J. Parallel Distributed Comput..

[10]  Siem Jan Koopman,et al.  Time Series Analysis by State Space Methods , 2001 .

[11]  Kun-Lung Wu,et al.  Sliding windows over uncertain data streams , 2014, Knowledge and Information Systems.

[12]  Shudong Jin,et al.  Prediction or Not? An Energy-Efficient Framework for Clustering-Based Data Collection in Wireless Sensor Networks , 2011, IEEE Transactions on Parallel and Distributed Systems.

[13]  Goutham Kamath,et al.  Pushing Analytics to the Edge , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[14]  E. Massera,et al.  On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario , 2008 .

[15]  Christos Anagnostopoulos Quality-optimized predictive analytics , 2016, Applied Intelligence.

[16]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[17]  A. Manjeshwar,et al.  TEEN: a routing protocol for enhanced efficiency in wireless sensor networks , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[18]  Assaf Schuster,et al.  Monitoring Least Squares Models of Distributed Streams , 2015, KDD.

[19]  Zhuo Chen,et al.  Edge Analytics in the Internet of Things , 2015, IEEE Pervasive Computing.

[20]  Timos K. Sellis,et al.  Maintaining consistent results of continuous queries under diverse window specifications , 2011, Inf. Syst..