Power-Aware Allocation of Graph Jobs in Geo-Distributed Cloud Networks

In the era of big-data, the jobs submitted to the clouds exhibit complicated structures represented by graphs, where the nodes denote the sub-tasks each of which can be accommodated at a slot in a server, while the edges indicate the communication constraints among the sub-tasks. We develop a framework for efficient allocation of graph jobs in geo-distributed cloud networks (GDCNs), explicitly considering the power consumption of the datacenters (DCs). We address the following two challenges arising in graph job allocation: i) the allocation problem belongs to NP-hard nonlinear integer programming; ii) the allocation requires solving the NP-complete sub-graph isomorphism problem, which is particularly cumbersome in large-scale GDCNs. We develop a suite of efficient solutions for GDCNs of various scales. For small-scale GDCNs, we propose an analytical approach based on convex programming. For medium-scale GDCNs, we develop a distributed allocation algorithm exploiting the processing power of DCs in parallel. Afterward, we provide a novel low-complexity (decentralized) sub-graph extraction method, based on which we introduce cloud crawlers aiming to extract allocations of good potentials for large-scale GDCNs. Given these suggested strategies, we further investigate strategy selection under both fixed and adaptive DC pricing schemes, and propose an online learning algorithm for each.

[1]  Shaolei Ren,et al.  Provably-Efficient Job Scheduling for Energy and Fairness in Geographically Distributed Data Centers , 2012, 2012 IEEE 32nd International Conference on Distributed Computing Systems.

[2]  Dusit Niyato,et al.  A Framework for Cooperative Resource Management in Mobile Cloud Computing , 2013, IEEE Journal on Selected Areas in Communications.

[3]  Baochun Li,et al.  Joint request mapping and response routing for geo-distributed cloud services , 2013, 2013 Proceedings IEEE INFOCOM.

[4]  R. Srikant,et al.  Scheduling Storms and Streams in the Cloud , 2015, SIGMETRICS.

[5]  Liang-Jie Zhang,et al.  Editorial: Big Services Era: Global Trends of Cloud Computing and Big Data , 2012 .

[6]  Burkhard Stiller,et al.  A Survey of the State-of-the-Art in Fair Multi-Resource Allocations for Data Centers , 2018, IEEE Transactions on Network and Service Management.

[7]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[8]  Ronald L. Rivest,et al.  Introduction to Algorithms, 3rd Edition , 2009 .

[9]  A. Barabasi,et al.  Scale-free characteristics of random networks: the topology of the world-wide web , 2000 .

[10]  Bingsheng He,et al.  Efficient Process Mapping in Geo-Distributed Cloud Data Centers , 2017, SC17: International Conference for High Performance Computing, Networking, Storage and Analysis.

[11]  S. Hart,et al.  A simple adaptive procedure leading to correlated equilibrium , 2000 .

[12]  Athanasios V. Vasilakos,et al.  IoT-Based Big Data Storage Systems in Cloud Computing: Perspectives and Challenges , 2017, IEEE Internet of Things Journal.

[13]  Xuejie Zhang,et al.  Swarm optimization algorithms applied to multi-resource fair allocation in heterogeneous cloud computing systems , 2017, Computing.

[14]  Kwang Mong Sim,et al.  Agent-Based Interactions and Economic Encounters in an Intelligent InterCloud , 2015, IEEE Transactions on Cloud Computing.

[15]  Javad Ghaderi,et al.  On Non-Preemptive VM Scheduling in the Cloud , 2017, Abstracts of the 2018 ACM International Conference on Measurement and Modeling of Computer Systems.

[16]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[17]  Yuan Yao,et al.  Data centers power reduction: A two time scale approach for delay tolerant workloads , 2012, 2012 Proceedings IEEE INFOCOM.

[18]  Wei Wang,et al.  Multi-Resource Fair Allocation in Heterogeneous Cloud Computing Systems , 2015, IEEE Transactions on Parallel and Distributed Systems.

[19]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[20]  Li Shi,et al.  Energy-Aware Scheduling of Embarrassingly Parallel Jobs and Resource Allocation in Cloud , 2017, IEEE Transactions on Parallel and Distributed Systems.

[21]  Georgios B. Giannakis,et al.  DGLB: Distributed Stochastic Geographical Load Balancing over Cloud Networks , 2017, IEEE Transactions on Parallel and Distributed Systems.

[22]  Xuejie Zhang,et al.  A load balancing mechanism based on ant colony and complex network theory in open cloud computing federation , 2010, 2010 The 2nd International Conference on Industrial Mechatronics and Automation.

[23]  Abdul Majid Mazlina,et al.  Big Data Processing in Cloud Computing Environments , 2017 .

[24]  Sebastiano Vigna,et al.  UbiCrawler: a scalable fully distributed Web crawler , 2004, Softw. Pract. Exp..

[25]  Zhenlong Li,et al.  Big Data and cloud computing: innovation opportunities and challenges , 2017, Int. J. Digit. Earth.

[26]  Huaiyu Dai,et al.  Options-based sequential auctions for dynamic cloud resource allocation , 2017, 2017 IEEE International Conference on Communications (ICC).

[27]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[28]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[29]  Huaiyu Dai,et al.  A Two-Stage Auction Mechanism for Cloud Resource Allocation , 2018, IEEE Transactions on Cloud Computing.

[30]  László Gyarmati,et al.  Scafida: a scale-free network inspired data center architecture , 2010, CCRV.

[31]  Zongpeng Li,et al.  Dynamic pricing and profit maximization for the cloud with geo-distributed data centers , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[32]  Divyakant Agrawal,et al.  Big data and cloud computing: current state and future opportunities , 2011, EDBT/ICDT '11.

[33]  Derek G. Corneil,et al.  The graph isomorphism disease , 1977, J. Graph Theory.

[34]  Marc Najork,et al.  Mercator: A scalable, extensible Web crawler , 1999, World Wide Web.

[35]  Karl Henrik Johansson,et al.  Subgradient methods and consensus algorithms for solving convex optimization problems , 2008, 2008 47th IEEE Conference on Decision and Control.

[36]  Yonggang Wen,et al.  Data Center Energy Consumption Modeling: A Survey , 2016, IEEE Communications Surveys & Tutorials.

[37]  Ran Zhang,et al.  Reactive Pricing: An Adaptive Pricing Policy for Cloud Providers to Maximize Profit , 2016, IEEE Transactions on Network and Service Management.

[38]  Joseph M. Hellerstein,et al.  MapReduce Online , 2010, NSDI.

[39]  Hector Garcia-Molina,et al.  Efficient Crawling Through URL Ordering , 1998, Comput. Networks.

[40]  Longbo Huang,et al.  A Comment on “Power Cost Reduction in Distributed Data Centers: A Two Time Scale Approach for Delay Tolerant Workloads” , 2015, IEEE Transactions on Parallel and Distributed Systems.