RGCA: A Reliable GPU Cluster Architecture for Large-Scale Internet of Things Computing Based on Effective Performance-Energy Optimization

This paper aims to develop a low-cost, high-performance and high-reliability computing system to process large-scale data using common data mining algorithms in the Internet of Things (IoT) computing environment. Considering the characteristics of IoT data processing, similar to mainstream high performance computing, we use a GPU (Graphics Processing Unit) cluster to achieve better IoT services. Firstly, we present an energy consumption calculation method (ECCM) based on WSNs. Then, using the CUDA (Compute Unified Device Architecture) Programming model, we propose a Two-level Parallel Optimization Model (TLPOM) which exploits reasonable resource planning and common compiler optimization techniques to obtain the best blocks and threads configuration considering the resource constraints of each node. The key to this part is dynamic coupling Thread-Level Parallelism (TLP) and Instruction-Level Parallelism (ILP) to improve the performance of the algorithms without additional energy consumption. Finally, combining the ECCM and the TLPOM, we use the Reliable GPU Cluster Architecture (RGCA) to obtain a high-reliability computing system considering the nodes’ diversity, algorithm characteristics, etc. The results show that the performance of the algorithms significantly increased by 34.1%, 33.96% and 24.07% for Fermi, Kepler and Maxwell on average with TLPOM and the RGCA ensures that our IoT computing system provides low-cost and high-reliability services.

[1]  Naixue Xiong,et al.  Energy-Efficient Algorithm for Broadcasting in Ad Hoc Wireless Sensor Networks , 2013, Sensors.

[2]  Reza Malekian,et al.  Accurate Vehicle Location System Using RFID, an Internet of Things Approach , 2016, Sensors.

[3]  Daniel R. Johnson,et al.  A variable warp size architecture , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[4]  Justyna Petke,et al.  Reducing Energy Consumption Using Genetic Improvement , 2015, GECCO.

[5]  Athanasios V. Vasilakos,et al.  Data Mining for the Internet of Things: Literature Review and Challenges , 2015, Int. J. Distributed Sens. Networks.

[6]  Andrea Zanella,et al.  Internet of Things for Smart Cities , 2014, IEEE Internet of Things Journal.

[7]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[8]  Wu He,et al.  Internet of Things in Industries: A Survey , 2014, IEEE Transactions on Industrial Informatics.

[9]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[10]  Naixue Xiong,et al.  An Effective Massive Sensor Network Data Access Scheme Based on Topology Control for the Internet of Things , 2016, Sensors.

[11]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..

[12]  Athanasios V. Vasilakos,et al.  Information centric network: Research challenges and opportunities , 2015, J. Netw. Comput. Appl..

[13]  Behrad Bagheri,et al.  Application of data mining and feature extraction on intelligent fault diagnosis by Artificial Neural Network and k-nearest neighbor , 2010, The XIX International Conference on Electrical Machines - ICEM 2010.

[14]  Mark Harman,et al.  Review of "Genetic Programming and Evolvable Machines: Special Issue on Semantic Methods in Genetic Programming" , 2016, SEVO.

[15]  Dongsheng Ma,et al.  Enabling Power-Efficient DVFS Operations on Silicon , 2010, IEEE Circuits and Systems Magazine.

[16]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[17]  Kevin Skadron,et al.  Rodinia: A benchmark suite for heterogeneous computing , 2009, 2009 IEEE International Symposium on Workload Characterization (IISWC).

[18]  Prafulla Kumar Choubey,et al.  Power efficient, bandwidth optimized and fault tolerant sensor management for IOT in Smart Home , 2015, 2015 IEEE International Advance Computing Conference (IACC).

[19]  Richard G Baraniuk,et al.  More Is Less: Signal Processing and the Data Deluge , 2011, Science.

[20]  Mohamed S. Kamel,et al.  Efficient phrase-based document indexing for Web document clustering , 2004, IEEE Transactions on Knowledge and Data Engineering.

[21]  Syed Hassan Ahmed,et al.  A Novel Scheme for an Energy Efficient Internet of Things Based on Wireless Sensor Networks , 2015, Sensors.

[22]  Rajkumar Buyya,et al.  Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing , 2012, Future Gener. Comput. Syst..

[23]  Andreas Moshovos,et al.  Demystifying GPU microarchitecture through microbenchmarking , 2010, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS).

[24]  Mahmut T. Kandemir,et al.  Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance , 2015, 2015 International Conference on Parallel Architecture and Compilation (PACT).

[25]  Han Xiao Towards Parallel and Distributed Computing in Large-Scale Data Mining : A Survey , 2010 .

[26]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[27]  Lida Xu,et al.  The internet of things: a survey , 2014, Information Systems Frontiers.

[28]  Lida Xu,et al.  An Integrated System for Regional Environmental Monitoring and Management Based on Internet of Things , 2014, IEEE Transactions on Industrial Informatics.

[29]  Shen Bin,et al.  Research on data mining models for the internet of things , 2010, 2010 International Conference on Image Analysis and Signal Processing.

[30]  Charles A. Bouman,et al.  CLUSTER: An Unsupervised Algorithm for Modeling Gaussian Mixtures , 2014 .

[31]  Marimuthu Palaniswami,et al.  Internet of Things (IoT): A vision, architectural elements, and future directions , 2012, Future Gener. Comput. Syst..

[32]  Collin McCurdy,et al.  The Scalable Heterogeneous Computing (SHOC) benchmark suite , 2010, GPGPU-3.

[33]  Arkady B. Zaslavsky,et al.  Context Aware Computing for The Internet of Things: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[34]  Marimuthu Palaniswami,et al.  An Information Framework for Creating a Smart City Through Internet of Things , 2014, IEEE Internet of Things Journal.

[35]  Yi Yang,et al.  Many-thread aware instruction-level parallelism: Architecting shader cores for GPU computing , 2012, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT).

[36]  Laurence T. Yang,et al.  Data Mining for Internet of Things: A Survey , 2014, IEEE Communications Surveys & Tutorials.

[37]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..

[38]  Won Woo Ro,et al.  Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit , 2016, ISCA.

[39]  Mahmut T. Kandemir,et al.  Neither more nor less: Optimizing thread-level parallelism for GPGPUs , 2013, Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques.

[40]  Arkady B. Zaslavsky,et al.  Sensing as a service model for smart cities supported by Internet of Things , 2013, Trans. Emerg. Telecommun. Technol..

[41]  Won Woo Ro,et al.  Warped-preexecution: A GPU pre-execution approach for improving latency hiding , 2016, 2016 IEEE International Symposium on High Performance Computer Architecture (HPCA).

[42]  S. Hussain,et al.  Genetic Algorithm for Energy Efficient Clusters in Wireless Sensor Networks , 2007, Fourth International Conference on Information Technology (ITNG'07).

[43]  Davide Maltoni,et al.  Large-scale fingerprint identification on GPU , 2015, Inf. Sci..

[44]  Wonyong Yoon,et al.  A Survey on Energy Conserving Mechanisms for the Internet of Things: Wireless Networking Aspects , 2015, Sensors.

[45]  Keqin Li,et al.  Re-Stream: Real-time and energy-efficient resource scheduling in big data stream computing environments , 2015, Inf. Sci..