BSP-Based Strongly Connected Component Algorithm in Joint Cloud Computing

There is a trend that many applications are deployed among multiple public or private clouds. Due to the data asset protection and network communication cost, it is not feasible to gather all the data from different clouds to a single cloud. Inter-cloud graph data mining is becoming a new challenge. The strongly connected component algorithm is a basic graph algorithm that plays an important role in many important areas such as social network analysis, web search and even biomedical areas. However, only few distributed processing frameworks provide this algorithm, which is currently unavailable in joint cloud computing frameworks. To this end, this paper proposes a BSP (Bulk Synchronous Parallel) service over joint cloud computing and a BSP-based strongly connected component algorithm which can be easily realized on any distributed platform as long as it provides BSP service. Many graph algorithms can be easily developed in the cross-cloud computation environment by using this BSP service. Moreover, this BSP-based strongly connected component algorithm not only fills the gaps in the domain of distributed graph processing frameworks, but also extends its scalability to the level of cross-cloud computing by using the BSP service.

[1]  Aart J. C. Bik,et al.  Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[2]  Joseph M. Hellerstein,et al.  Distributed GraphLab: A Framework for Machine Learning in the Cloud , 2012, Proc. VLDB Endow..

[3]  Joseph Gonzalez,et al.  PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs , 2012, OSDI.

[4]  Yong Zhang,et al.  A Scalable lnternet-of-Vehicles Service over Joint Clouds , 2018, 2018 IEEE Symposium on Service-Oriented System Engineering (SOSE).

[5]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[6]  Huaimin Wang,et al.  Growing construction and adaptive evolution of complex software systems , 2016, Science China Information Sciences.

[7]  Vladimir Vlassov,et al.  High-Level Programming Abstractions for Distributed Graph Processing , 2016, IEEE Transactions on Knowledge and Data Engineering.

[8]  Huaimin Wang,et al.  JointCloud: A Cross-Cloud Cooperation Architecture for Integrated Internet Service Customization , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[9]  Avery Ching,et al.  One Trillion Edges: Graph Processing at Facebook-Scale , 2015, Proc. VLDB Endow..

[10]  Reynold Xin,et al.  GraphX: Graph Processing in a Distributed Dataflow Framework , 2014, OSDI.