Distributed Decomposed Data Analytics in Fog Enabled IoT Deployments

The edge of the network plays a vital role in an Internet of Things (IoT) system, serving as an optimal site to perform an operation on data before transmitting it over the network. We present the fog-specific decomposition of multivariate linear regression as the predictive analytic model in our work using the statistical query model and summation form. The decomposition method used is not the contribution, but applying the decomposition method to the analytics model to run in a distributed manner in the fog-enabled IoT deployments is the contribution. What is novel is the decomposition made on a fog-based distributed setting. To test the performance, our proposed approach has been applied to a real-world dataset and evaluated using a fog computing testbed. The proposed method avoids sending raw data to the cloud and offers balanced computation in the infrastructure. The results show an 80% reduction in the amount of data transferred to the cloud using the proposed fog-based distributed data analytics approach compared with the conventional cloud-based approach. Furthermore, by adopting the proposed distributed approach, we observed a 98% drop in the time taken to arrive at the final result compared with the cloud-centric approach. We also present the results on the quality of analytics solution obtained in both approaches, and they suggest that the fog-based distributed analytics approach can serve as equally as the traditional cloud-centric approach.

[1]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[2]  Athanasios V. Vasilakos,et al.  Fog Computing for Sustainable Smart Cities , 2017, ArXiv.

[3]  Alan Davy,et al.  Resource aware placement of IoT application modules in Fog-Cloud Computing Paradigm , 2017, 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM).

[4]  Amy L. Murphy,et al.  Practical Data Prediction for Real-World Wireless Sensor Networks , 2015, IEEE Transactions on Knowledge and Data Engineering.

[5]  M. Benzi Preconditioning techniques for large linear systems: a survey , 2002 .

[6]  Marimuthu Palaniswami,et al.  Internet of Things (IoT): A vision, architectural elements, and future directions , 2012, Future Gener. Comput. Syst..

[7]  Xuemin Shen,et al.  Vehicle-Assisted Device-to-Device Data Delivery for Smart Grid , 2016, IEEE Transactions on Vehicular Technology.

[8]  Soumya K. Ghosh,et al.  SPIN: A Fast and Scalable Matrix Inversion Method in Apache Spark , 2018, ICDCN.

[9]  Shudong Jin,et al.  Prediction or Not? An Energy-Efficient Framework for Clustering-Based Data Collection in Wireless Sensor Networks , 2011, IEEE Transactions on Parallel and Distributed Systems.

[10]  Alan Davy,et al.  Poster Abstract: Resource Aware Placement of Data Stream Analytics Operators on Fog Infrastructure for Internet of Things Applications , 2016, 2016 IEEE/ACM Symposium on Edge Computing (SEC).

[11]  Bo Tang,et al.  Incorporating Intelligence in Fog Computing for Big Data Analysis in Smart Cities , 2017, IEEE Transactions on Industrial Informatics.

[12]  Gianluigi Ferrari,et al.  A Graph-Based Cloud Architecture for Big Stream Real-Time Applications in the Internet of Things , 2014, ESOCC Workshops.

[13]  S. Rahimifard,et al.  Unlocking the Potential of the Internet of Things to Improve Resource Efficiency in Food Supply Chains , 2017, Innovative Approaches and Applications for Sustainable Rural Development.

[14]  Erdogan Dogdu,et al.  Context-Aware Computing, Learning, and Big Data in Internet of Things: A Survey , 2018, IEEE Internet of Things Journal.

[15]  Christos Anagnostopoulos,et al.  Time-optimized contextual information forwarding in mobile sensor networks , 2014, J. Parallel Distributed Comput..

[16]  Rajkumar Buyya,et al.  Fog Computing: A Taxonomy, Survey and Future Directions , 2016, Internet of Everything.

[17]  E. Massera,et al.  On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario , 2008 .

[18]  Goutham Kamath,et al.  Pushing Analytics to the Edge , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[19]  Ganesh K. Venayagamoorthy,et al.  Computational Intelligence in Wireless Sensor Networks: A Survey , 2011, IEEE Communications Surveys & Tutorials.

[20]  Raja Lavanya,et al.  Fog Computing and Its Role in the Internet of Things , 2019, Advances in Computer and Electrical Engineering.

[21]  Sudip Misra,et al.  Theoretical modelling of fog computing: a green computing paradigm to support IoT applications , 2016, IET Networks.

[22]  Christos Anagnostopoulos,et al.  Quality-aware aggregation & predictive analytics at the edge , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[23]  Assaf Schuster,et al.  Lightweight Monitoring of Distributed Streams , 2018, ACM Trans. Database Syst..

[24]  Thanassis Tiropanis,et al.  Analytics for the Internet of Things , 2018, ACM Comput. Surv..

[25]  James Demmel,et al.  Direct QR factorizations for tall-and-skinny matrices in MapReduce architectures , 2013, 2013 IEEE International Conference on Big Data.

[26]  Michael J. McGrath,et al.  Sensor Network Topologies and Design Considerations , 2013 .

[27]  Assaf Schuster,et al.  Monitoring Least Squares Models of Distributed Streams , 2015, KDD.

[28]  Anurag Agarwal,et al.  The Internet of Things—A survey of topics and trends , 2014, Information Systems Frontiers.

[29]  Alan Davy,et al.  Resource Aware Placement of Data Analytics Platform in Fog Computing , 2016, Cloud Forward.

[30]  Léon Bottou,et al.  The Tradeoffs of Large Scale Learning , 2007, NIPS.

[31]  Lei Shi,et al.  Distributed Least-Squares Iterative Methods in Networks: A Survey , 2017, ArXiv.

[32]  Michael Kearns,et al.  Efficient noise-tolerant learning from statistical queries , 1993, STOC.

[33]  Prabal Dutta,et al.  The Internet of Things Has a Gateway Problem , 2015, HotMobile.

[34]  Hyeonwoo Kim,et al.  Correlation analysis of MQTT loss and delay according to QoS level , 2013, The International Conference on Information Networking 2013 (ICOIN).

[35]  Nikolai Helwig,et al.  Condition monitoring of a complex hydraulic system using multivariate statistics , 2015, 2015 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) Proceedings.

[36]  Mung Chiang,et al.  Decomposing Data Analytics in Fog Networks , 2017, SenSys.

[37]  Heyu Wang,et al.  Distributed Quantile Regression Over Sensor Networks , 2018, IEEE Transactions on Signal and Information Processing over Networks.

[38]  Tao Zhang,et al.  Fog and IoT: An Overview of Research Opportunities , 2016, IEEE Internet of Things Journal.