Quality of Service (QoS)-driven resource provisioning for large-scale graph processing in cloud computing environments: Graph Processing-as-a-Service (GPaaS)

Abstract Large-scale graph data is being generated every day through applications and services such as social networks, Internet of Things (IoT) and mobile applications. Traditional processing approaches such as MapReduce are inefficient for processing graph datasets. To overcome this limitation, several exclusive graph processing frameworks have been developed since 2010. However, despite broad accessibility of cloud computing paradigm and its useful features namely as elasticity and pay-as-you-go pricing model, most frameworks are designed for high performance computing infrastructure (HPC). There are few graph processing systems that are developed for cloud environments but similar to their other counterparts, they also try to improve the performance by implementing new computation or communication techniques. In this paper, for the first time, we introduce the large-scale graph processing-as-a-service (GPaaS). GPaaS considers service level agreement (SLA) requirements and quality of service (QoS) for provisioning appropriate combination of resources in order to minimize the monetary cost of the operation. It also reduces the execution time compared to other graph processing frameworks such as Giraph up to 10%–15%. We show that our service significantly reduces the monetary cost by more than 40% compared to Giraph or other frameworks such as PowerGraph.

[1]  Tao Zhang,et al.  Efficient Graph Mining on Heterogeneous Platforms in the Cloud , 2016, CloudComp.

[2]  Haixun Wang,et al.  Trinity: a distributed graph engine on a memory cloud , 2013, SIGMOD '13.

[3]  Pankesh Patel,et al.  Service Level Agreement in Cloud Computing , 2009 .

[4]  Chongchong Xu,et al.  Evaluation and Trade-offs of Graph Processing for Cloud Services , 2017, 2017 IEEE International Conference on Web Services (ICWS).

[5]  Kenli Li,et al.  A Profit Maximization Scheme with Guaranteed Quality of Service in Cloud Computing , 2015, IEEE Transactions on Computers.

[6]  Rajkumar Buyya,et al.  Next generation cloud computing: New trends and research directions , 2017, Future Gener. Comput. Syst..

[7]  Rajkumar Buyya,et al.  iGiraph: A Cost-Efficient Framework for Processing Large-Scale Graphs on Public Clouds , 2016, 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid).

[8]  Avery Ching,et al.  One Trillion Edges: Graph Processing at Facebook-Scale , 2015, Proc. VLDB Endow..

[9]  Pritam Roy,et al.  A new memetic algorithm with GA crossover technique to solve Single Source Shortest Path (SSSP) problem , 2014, 2014 Annual IEEE India Conference (INDICON).

[10]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[11]  Panos Kalnis,et al.  Mizan: a system for dynamic load balancing in large-scale graph processing , 2013, EuroSys '13.

[12]  Yogesh L. Simmhan,et al.  Scalable Graph Processing Frameworks , 2018, ACM Comput. Surv..

[13]  Paul D. Manuel,et al.  A trust model of cloud computing based on Quality of Service , 2015, Ann. Oper. Res..

[14]  Luke M. Leslie,et al.  Supporting On-demand Elasticity in Distributed Graph Processing , 2016, 2016 IEEE International Conference on Cloud Engineering (IC2E).

[15]  Willy Zwaenepoel,et al.  X-Stream: edge-centric graph processing using streaming partitions , 2013, SOSP.

[16]  Danilo Ardagna,et al.  Quality-of-service in cloud computing: modeling techniques and their applications , 2014, Journal of Internet Services and Applications.

[17]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[18]  Du Wan Cheun,et al.  A Quality Model for Evaluating Software-as-a-Service in Cloud Computing , 2009, 2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications.

[19]  Jeffrey D. Ullman,et al.  Vision Paper: Towards an Understanding of the Limits of Map-Reduce Computation , 2012, ArXiv.

[20]  Zengxiang Li,et al.  Performance and Monetary Cost of Large-Scale Distributed Graph Processing on Amazon Cloud , 2016, 2016 International Conference on Cloud Computing Research and Innovations (ICCCRI).

[21]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[22]  Harry G. Perros,et al.  Service Performance and Analysis in Cloud Computing , 2009, 2009 Congress on Services - I.

[23]  Yogesh L. Simmhan,et al.  Optimizations and Analysis of BSP Graph Processing Models on Public Clouds , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[24]  Jérôme Kunegis,et al.  KONECT: the Koblenz network collection , 2013, WWW.

[25]  Jinha Kim,et al.  TurboGraph: a fast parallel graph engine handling billion-scale graphs in a single PC , 2013, KDD.

[26]  Vipin Kumar,et al.  Multilevel Graph Partitioning Schemes , 1995, ICPP.

[27]  Aart J. C. Bik,et al.  Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[28]  Indranil Gupta,et al.  LFGraph: simple and fast distributed graph analytics , 2013, TRIOS@SOSP.

[29]  Guy E. Blelloch,et al.  GraphChi: Large-Scale Graph Computation on Just a PC , 2012, OSDI.

[30]  George Pallis,et al.  Cloud Computing: The New Frontier of Internet Computing , 2010, IEEE Internet Computing.

[31]  Jennifer Widom,et al.  GPS: a graph processing system , 2013, SSDBM.

[32]  Joseph Gonzalez,et al.  PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs , 2012, OSDI.

[33]  Johannes Gehrke,et al.  Asynchronous Large-Scale Graph Processing Made Easy , 2013, CIDR.

[34]  Amine Mhedhbi,et al.  The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing , 2017 .

[35]  Murat Demirbas,et al.  Giraphx: Parallel Yet Serializable Large-Scale Graph Processing , 2013, Euro-Par.

[36]  Rajkumar Buyya,et al.  A Cost-Efficient Auto-Scaling Algorithm for Large-Scale Graph Processing in Cloud Environments with Heterogeneous Resources , 2021, IEEE Transactions on Software Engineering.

[37]  Huadong Dai,et al.  Challenges in large-graph processing: A vision , 2016, 2016 5th International Conference on Computer Science and Network Technology (ICCSNT).

[38]  Bingsheng He,et al.  Large graph processing in the cloud , 2010, SIGMOD Conference.

[39]  Rajkumar Buyya,et al.  Cost‐efficient and network‐aware dynamic repartitioning‐based algorithms for scheduling large‐scale graphs in cloud computing environments , 2018, Softw. Pract. Exp..