Spark-Based Parallel Genetic Algorithm for Simulating a Solution of Optimal Deployment of an Underwater Sensor Network

Underwater sensor networks have wide application prospects, but the large-scale sensing node deployment is severely hindered by problems like energy constraints, long delays, local disconnections, and heavy energy consumption. These problems can be solved effectively by optimizing sensing node deployment with a genetic algorithm. However, the genetic algorithm (GA) needs many iterations in solving the best location of underwater sensor deployment, which results in long running time delays and limited practical application when dealing with large-scale data. The classical parallel framework Hadoop can improve the GA running efficiency to some extent while the state-of-the-art parallel framework Spark can release much more parallel potential of GA by realizing parallel crossover, mutation, and other operations on each computing node. Giving full allowance for the working environment of the underwater sensor network and the characteristics of sensors, this paper proposes a Spark-based parallel GA to calculate the extremum of the Shubert multi-peak function, through which the optimal deployment of the underwater sensor network can be obtained. Experimental results show that while faced with a large-scale underwater sensor network, compared with single node and Hadoop framework, the Spark-based implementation not only significantly reduces the running time but also effectively avoids the problem of premature convergence because of its powerful randomness.

[1]  Enrique Alba,et al.  Parallel Genetic Algorithms , 2011, Studies in Computational Intelligence.

[2]  Guangjie Han,et al.  SSL: Smart Street Lamp Based on Fog Computing for Smarter Cities , 2018, IEEE Transactions on Industrial Informatics.

[3]  Prasanta K. Jana,et al.  Genetic algorithm approach for k-coverage and m-connected node placement in target based wireless sensor networks , 2016, Comput. Electr. Eng..

[4]  Guangjie Han,et al.  Hybrid-LRU Caching for Optimizing Data Storage and Retrieval in Edge Computing-Based Wearable Sensors , 2019, IEEE Internet of Things Journal.

[5]  Yujie Zhang,et al.  Spark: A Big Data Processing Platform Based on Memory Computing , 2015, 2015 Seventh International Symposium on Parallel Architectures, Algorithms and Programming (PAAP).

[6]  Davide Anguita,et al.  Building an Underwater Wireless Sensor Network Based on Optical: Communication: Research Challenges and Current Results , 2009, 2009 Third International Conference on Sensor Technologies and Applications.

[7]  Xin Liu,et al.  Deployment optimization for 3D industrial wireless sensor networks based on particle swarm optimizers with distributed parallelism , 2018, J. Netw. Comput. Appl..

[8]  Guangjie Han,et al.  Edge Computing-Based Intelligent Manhole Cover Management System for Smart Cities , 2018, IEEE Internet of Things Journal.

[9]  Rajkumar Buyya,et al.  MRPGA: An Extension of MapReduce for Parallelizing Genetic Algorithms , 2008, 2008 IEEE Fourth International Conference on eScience.

[10]  Xuehai Zhou,et al.  Memory power optimization on different memory address mapping schemas , 2014, 2014 IEEE 20th International Conference on Embedded and Real-Time Computing Systems and Applications.

[11]  Linda Di Geronimo,et al.  A Parallel Genetic Algorithm Based on Hadoop MapReduce for the Automatic Generation of JUnit Test Suites , 2012, 2012 IEEE Fifth International Conference on Software Testing, Verification and Validation.

[12]  Leonardo Vanneschi,et al.  Introduction: special issue on parallel and distributed evolutionary algorithms, part I , 2009, Genetic Programming and Evolvable Machines.

[13]  Sanjay Ghemawat,et al.  MapReduce: simplified data processing on large clusters , 2008, CACM.

[14]  Guang Yang,et al.  Analysis of security and threat of underwater wireless sensor network topology , 2012, Digital Image Processing.

[15]  Guangjie Han,et al.  Localization Algorithms of Underwater Wireless Sensor Networks: A Survey , 2012, Sensors.

[16]  Guangjie Han,et al.  Coordinate Memory Deduplication and Partition for Improving Performance in Cloud Computing , 2019, IEEE Transactions on Cloud Computing.

[17]  Xiang Min,et al.  Energy efficient clustering algorithm for maximizing lifetime of wireless sensor networks , 2010 .

[18]  Roger L. Wainwright,et al.  A parallel island model genetic algorithm for the multiprocessor scheduling problem , 1994, SAC '94.

[19]  Guangjie Han,et al.  A Maximum Cache Value Policy in Hybrid Memory-Based Edge Computing for Mobile Devices , 2019, IEEE Internet of Things Journal.

[20]  Enrique Alba,et al.  A survey of parallel distributed genetic algorithms , 1999, Complex..

[21]  Zygmunt J. Haas,et al.  Coverage and connectivity in three-dimensional networks with random node deployment , 2015, Ad Hoc Networks.

[22]  Jing Cao,et al.  Impacts of Memory Address Mapping Scheme on Reducing DRAM Self-Refresh Power for Mobile Computing Devices , 2018, IEEE Access.

[23]  Fan Wu,et al.  A Thread-Oriented Memory Resource Management Framework for Mobile Edge Computing , 2019, IEEE Access.

[24]  Michael J. Franklin,et al.  Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.

[25]  Z. Q. Wang,et al.  Research on big data processing technology based on Hadoop , 2016 .

[26]  Nor-Syahidatul N. Ismail,et al.  Analyzing the Performance of Acoustic Channel in Underwater Wireless Sensor Network (UWSN) , 2010, 2010 Fourth Asia International Conference on Mathematical/Analytical Modelling and Computer Simulation.

[27]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[28]  Xavier Llorà,et al.  Scaling Genetic Algorithms Using MapReduce , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[29]  Lekha R. Nair,et al.  Applying spark based machine learning model on streaming big data for health status prediction , 2017, Comput. Electr. Eng..