An Edge Computing Platform for Intelligent Operational Monitoring in Internet Data Centers

The increasing demand for cloud-based services, such as big data analytics and online e-commerce, leads to rapid growth of large-scale internet data centers. In order to provide highly reliable, cost effective, and high quality cloud services, data centers are equipped with sensors to monitor the operational states of infrastructure hardware, such as servers, storage arrays, networking devices, and computer room air conditioning systems. However, such coarse grained monitoring cannot provide fine grained real time information for resource multiplexing and job scheduling. Moreover, the monitoring of node level power consumption plays an important role in the optimization of workload placement and energy efficiency in data centers. In this paper, we propose an edge computing platform for intelligent operational monitoring in data centers. The platform integrates wireless sensors and on-board built-in sensors to collect data during the operation and maintenance of data centers. Using logical functions, we divide the data center clusters into grids, and then deploy wireless sensors and edge servers in each grid. As such, data processing on edge servers can reduce the latency in data transmission to central clouds and thereby enhance the real time resource mapping decisions in data centers. In addition, the proposed platform also provides predictions of resource utilization, workload characteristics, and hardware health trends in data centers.

[1]  Jie Wu,et al.  Scan-Based Movement-Assisted Sensor Deployment Methods in Wireless Sensor Networks , 2007, IEEE Transactions on Parallel and Distributed Systems.

[2]  Weisong Shi,et al.  Energy efficiency comparison of hypervisors , 2017, 2016 Seventh International Green and Sustainable Computing Conference (IGSC).

[3]  Sudipta Sahana,et al.  An Adaptive Cloud Service Observation using Billboard Manager Cloud Monitoring Tool , 2015 .

[4]  Wei Liu,et al.  A low redundancy data collection scheme to maximize lifetime using matrix completion technique , 2019, EURASIP J. Wirel. Commun. Netw..

[5]  Yi Jia,et al.  Wireless sensor network for data-center environmental monitoring , 2011, 2011 Fifth International Conference on Sensing Technology.

[6]  Zhiwen Zeng,et al.  Adaption Resizing Communication Buffer to Maximize Lifetime and Reduce Delay for WVSNs , 2019, IEEE Access.

[7]  Qun Li,et al.  Fog Computing: Platform and Applications , 2015, 2015 Third IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb).

[8]  Yucong Duan,et al.  Toward service selection for workflow reconfiguration: An interface-based computing solution , 2018, Future Gener. Comput. Syst..

[9]  Guangjie Han,et al.  Characteristics of Co-Allocated Online Services and Batch Jobs in Internet Data Centers: A Case Study From Alibaba Cloud , 2019, IEEE Access.

[10]  Henry Hoffmann,et al.  JouleGuard: energy guarantees for approximate applications , 2015, SOSP.

[11]  Tao Zhang,et al.  MicroThings: A Generic IoT Architecture for Flexible Data Aggregation and Scalable Service Cooperation , 2017, IEEE Communications Magazine.

[12]  Yu-Wei Su,et al.  A Comparative Study of Wireless Protocols: Bluetooth, UWB, ZigBee, and Wi-Fi , 2007, IECON 2007 - 33rd Annual Conference of the IEEE Industrial Electronics Society.

[13]  Christina Delimitrou,et al.  Quasar: resource-efficient and QoS-aware cluster management , 2014, ASPLOS.

[14]  Xiao Liu,et al.  A statistical approach to participant selection in location-based social networks for offline event marketing , 2019, Inf. Sci..

[15]  Wei Liu,et al.  A Queuing Delay Utilization Scheme for On-Path Service Aggregation in Services-Oriented Computing Networks , 2019, IEEE Access.

[16]  Henry Hoffmann,et al.  MEANTIME: Achieving Both Minimal Energy and Timeliness with Approximate Computing , 2016, USENIX Annual Technical Conference.

[17]  Ke Cheng,et al.  CFHider: Control Flow Obfuscation with Intel SGX , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[18]  Henry Hoffmann,et al.  POET: a portable approach to minimizing energy under soft real-time constraints , 2015, 21st IEEE Real-Time and Embedded Technology and Applications Symposium.

[19]  Rajkumar Buyya,et al.  Fog Computing: Helping the Internet of Things Realize Its Potential , 2016, Computer.

[20]  Minglu Li,et al.  Target-oriented scheduling in directional sensor networks , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[21]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[22]  Wanchun Dou,et al.  Dynamic Mobile Crowdsourcing Selection for Electricity Load Forecasting , 2018, IEEE Access.

[23]  Jeffrey O. Kephart,et al.  A robot as mobile sensor and agent in data center energy management , 2011, ICAC '11.

[24]  Jian Wan,et al.  Location-Aware Service Recommendation With Enhanced Probabilistic Matrix Factorization , 2018, IEEE Access.

[25]  Sudipta Sahana,et al.  An Energy Efficient Dynamic Schedule based Server Load Balancing Approach for Cloud Data Center , 2015 .

[26]  Xu Li,et al.  Performance Evaluation of Vehicle-Based Mobile Sensor Networks for Traffic Monitoring , 2009, IEEE Transactions on Vehicular Technology.

[27]  Yixin Chen,et al.  Towards Optimal Sensor Placement for Hot Server Detection in Data Centers , 2011, 2011 31st International Conference on Distributed Computing Systems.

[28]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[29]  Lianyong Qi,et al.  Privacy-Aware Multidimensional Mobile Service Quality Prediction and Recommendation in Distributed Fog Environment , 2018, Wirel. Commun. Mob. Comput..

[30]  Weisong Shi,et al.  Energy Proportional Servers: Where Are We in 2016? , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[31]  Naixue Xiong,et al.  Interdomain I/O Optimization in Virtualized Sensor Networks , 2018, Sensors.

[32]  Lieven Eeckhout,et al.  Trends in Server Energy Proportionality , 2011, Computer.

[33]  Yumei Wang,et al.  Energy Aware Virtual Machine Scheduling in Data Centers , 2019, Energies.

[34]  Jürgen Schmidhuber,et al.  LSTM can Solve Hard Long Time Lag Problems , 1996, NIPS.

[35]  Xin Zhou,et al.  Toward Computation Offloading in Edge Computing: A Survey , 2019, IEEE Access.

[36]  Jianfeng Ma,et al.  Strongly Secure and Efficient Range Queries in Cloud Databases under Multiple Keys , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[37]  Henry Hoffmann,et al.  A Probabilistic Graphical Model-based Approach for Minimizing Energy Under Performance Constraints , 2015, ASPLOS.

[38]  F. Liu,et al.  Battery-Friendly Relay Selection Scheme for Prolonging the Lifetimes of Sensor Nodes in the Internet of Things , 2019, IEEE Access.

[39]  Congfeng Jiang,et al.  An Edge Computing Platform for Intelligent Internet Data Center Operational Monitoring , 2018, High-Performance Computing Applications in Numerical Simulation and Edge Computing.

[40]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[41]  Xu Li,et al.  Performance Evaluation of SUVnet With Real-Time Traffic Data , 2007, IEEE Transactions on Vehicular Technology.

[42]  Byung-Gon Chun,et al.  CloneCloud: elastic execution between mobile device and cloud , 2011, EuroSys '11.

[43]  Jordi Guitart,et al.  A service framework for energy-aware monitoring and VM management in Clouds , 2013, Future Gener. Comput. Syst..

[44]  Yulong Shen,et al.  Transmission protocol for secure big data in two-hop wireless networks with cooperative jamming , 2014, Inf. Sci..

[45]  Richard E. Brown,et al.  United States Data Center Energy Usage Report , 2016 .

[46]  Xin Wang,et al.  Clipper: A Low-Latency Online Prediction Serving System , 2016, NSDI.

[47]  Bin Li,et al.  Dynamo: Facebook's Data Center-Wide Power Management System , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).

[48]  Stefan A. Robila,et al.  Analyzing utilization rates in data centers for optimizing energy management , 2012, 2012 International Green Computing Conference (IGCC).

[49]  Xiaohong Jiang,et al.  Secure k-NN Query on Encrypted Cloud Data with Multiple Keys , 2017 .

[50]  Sebti Foufou,et al.  Towards bandwidth guaranteed energy efficient data center networking , 2015, Journal of Cloud Computing.

[51]  Xiaohong Jiang,et al.  Practical Verifiable Computation–A MapReduce Case Study , 2018, IEEE Transactions on Information Forensics and Security.

[52]  Raja Lavanya,et al.  Fog Computing and Its Role in the Internet of Things , 2019, Advances in Computer and Electrical Engineering.

[53]  Paramvir Bahl,et al.  The Case for VM-Based Cloudlets in Mobile Computing , 2009, IEEE Pervasive Computing.

[54]  Xiaohong Jiang,et al.  MtMR: Ensuring MapReduce Computation Integrity with Merkle Tree-Based Verifications , 2018, IEEE Transactions on Big Data.

[55]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.