A New Distributed and Decentralized Stochastic Optimization Algorithm with Applications in Big Data Analytics

The world is witnessing an unprecedented growth of needs in data analytics. Big Data is distinguished by its three main characteristics: velocity, variety and volume. An open issue and challenge faced by the data community is how to scale up analytic algorithms. To address this issue, optimization of large scale data sets has attracted many researchers in recent years. In this paper, we first present the most recent advances in optimization of Big Data analytics. Further, we introduce a fully distributed stochastic optimization algorithm for decision making over large scale data sets. We also propose the optimal weight design for the proposed algorithm and study its performance by considering a practical application in cognitive networks. Experimental results confirm that the proposed method performs well, proven to be distributed, scalable and robust to missing data and communication failures.

[1]  Sen Wang,et al.  Big Data Enabled Mobile Network Design for 5G and Beyond , 2017, IEEE Communications Magazine.

[2]  Imad Aad,et al.  The Mobile Data Challenge: Big Data for Mobile Computing Research , 2012 .

[3]  Vipin Kumar,et al.  Trends in big data analytics , 2014, J. Parallel Distributed Comput..

[4]  Francisco Facchinei,et al.  Hybrid Random/Deterministic Parallel Algorithms for Convex and Nonconvex Big Data Optimization , 2014, IEEE Transactions on Signal Processing.

[5]  Francisco Facchinei,et al.  Parallel Selective Algorithms for Nonconvex Big Data Optimization , 2014, IEEE Transactions on Signal Processing.

[6]  Meikang Qiu,et al.  Health-CPS: Healthcare Cyber-Physical System Assisted by Cloud and Big Data , 2017, IEEE Systems Journal.

[7]  Siguang Chen,et al.  Accelerated Distributed Optimization Design for Reconstruction of Big Sensory Data , 2017, IEEE Internet of Things Journal.

[8]  Ali H. Sayed,et al.  Fundamentals Of Adaptive Filtering , 2003 .

[9]  Zhu Han,et al.  Multi-block ADMM for big data optimization in smart grid , 2015, 2015 International Conference on Computing, Networking and Communications (ICNC).

[10]  Ali H. Sayed,et al.  Diffusion Adaptation over Networks , 2012, ArXiv.

[11]  Victor I. Chang,et al.  A model to compare cloud and non-cloud storage of Big Data , 2016, Future Gener. Comput. Syst..

[12]  Volkan Cevher,et al.  Convex Optimization for Big Data: Scalable, randomized, and parallel algorithms for big data analytics , 2014, IEEE Signal Processing Magazine.

[13]  Piet Demeester,et al.  City of Things: A Multidisciplinary Smart Cities Testbed for IoT, Big Data and Living Labs Innovation , 2017 .

[14]  Robert C. Qiu,et al.  Cognitive Networked Sensing and Big Data , 2013 .

[15]  Gonzalo Mateos,et al.  Modeling and Optimization for Big Data Analytics: (Statistical) learning tools for our era of data deluge , 2014, IEEE Signal Processing Magazine.

[16]  Nathan Marz,et al.  Big Data: Principles and best practices of scalable realtime data systems , 2015 .

[17]  Xinghuo Yu,et al.  Distributed Optimal Consensus Over Resource Allocation Network and Its Application to Dynamical Economic Dispatch , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Zhi-Quan Luo,et al.  A Unified Algorithmic Framework for Block-Structured Optimization Involving Big Data: With applications in machine learning and signal processing , 2015, IEEE Signal Processing Magazine.

[19]  Robert Haimes,et al.  Visually exploring gigabyte data sets in real time , 1999, CACM.

[20]  Xuhui Chen,et al.  A tutorial on secure outsourcing of large-scale computations for big data , 2016, IEEE Access.

[21]  John Yearwood,et al.  Heterogeneous Cooperative Co-Evolution Memetic Differential Evolution Algorithm for Big Data Optimization Problems , 2017, IEEE Transactions on Evolutionary Computation.

[22]  Ruggero Carli,et al.  Distributed Partitioned Big-Data Optimization via Asynchronous Dual Decomposition , 2018, IEEE Transactions on Control of Network Systems.

[23]  Peter Richtárik,et al.  Parallel coordinate descent methods for big data optimization , 2012, Mathematical Programming.

[24]  Frank Eliassen,et al.  From IoT big data to IoT big services , 2017, SAC.

[25]  Aditya B. Patel,et al.  Addressing big data problem using Hadoop and Map Reduce , 2012, 2012 Nirma University International Conference on Engineering (NUiCONE).