Scaling-up versus scaling-out networking in data centers: a comparative robustness analysis

The information and communication technology nowadays more than ever depends on the Internet and cloud computing, so that the data centers (DCs) have been converted to a constitutive unit of the cloud computing. A DC is composed of two primary parts: servers and Data Center Networks (DCNs). Robustness and scalability are two major challenges of the DCNs that are expanded based on two strategies, scale-out, and scale-up. This paper is distinctive from the related studies in two aspects. The first one is to simultaneously focus on both the scalability and the robustness challenges of the DCNs. For this purpose, we will concentrate on the comparison of robustness in the scalable models of these networks. The second one is, despite the previous work that only evaluated the DCN robustness under topological changes, we evaluated the robustness and fault tolerance against three types of unexpected changes in topology, traffic, and COI (community of interest) in the present work. Hence, we have chosen the network criticality (NC) as a graph-theoretic metric for analyzing DCN robustness. Afterward, we compare some structural and spectral graph metrics with NC among some well-known DCNs, and their scale-out and scale-up. Our results are useful to select the appropriate scaling strategy with the goal of maximizing the robustness of existing DCNs and provide a guideline for designing the new robust and scalable DCN.

[1]  Farshad Safaei,et al.  An efficient fault-tolerant routing algorithm in NoCs to tolerate permanent faults , 2016, The Journal of Supercomputing.

[2]  A. Leon-Garcia,et al.  Comparison of network criticality, algebraic connectivity, and other graph metrics , 2009, SIMPLEX '09.

[3]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[4]  Samee Ullah Khan,et al.  On the Connectivity of Data Center Networks , 2013, IEEE Communications Letters.

[5]  Rajiv Ranjan,et al.  Modeling and Simulation of Data Center Networks , 2015, Handbook on Data Centers.

[6]  Haibing Guan,et al.  A survey on data center networking for cloud computing , 2015, Comput. Networks.

[7]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.

[8]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[9]  Dong Lin,et al.  FlatNet: Towards a flatter data center network , 2012, 2012 IEEE Global Communications Conference (GLOBECOM).

[10]  Bran Selic,et al.  A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems , 2013, The Journal of Supercomputing.

[11]  Albert G. Greenberg,et al.  The cost of a cloud: research problems in data center networks , 2008, CCRV.

[12]  Abdul Hameed,et al.  Future Generation Computer Systems ( ) – Future Generation Computer Systems a Taxonomy and Survey on Green Data Center Networks Keywords: Data Center Data Center Networks Network Architectures Network Performance Network Management Network Experimentation , 2022 .

[13]  Lei Zhu,et al.  Optimizing the fault-tolerance overheads of HPC systems using prediction and multiple proactive actions , 2015, The Journal of Supercomputing.

[14]  Yonggang Wen,et al.  A Survey on Data Center Networking (DCN): Infrastructure and Operations , 2017, IEEE Communications Surveys & Tutorials.

[15]  Haitao Wu,et al.  FiConn: Using Backup Port for Server Interconnection in Data Centers , 2009, IEEE INFOCOM 2009.

[16]  Tao Chen,et al.  The features, hardware, and architectures of data center networks: A survey , 2016, J. Parallel Distributed Comput..

[17]  Lei Shi,et al.  Dcell: a scalable and fault-tolerant network structure for data centers , 2008, SIGCOMM '08.

[18]  Alberto Leon-Garcia,et al.  On Robust Traffic Engineering in Core Networks , 2008 .

[19]  Mounir Hamdi,et al.  SprintNet: A high performance server-centric network architecture for data centers , 2014, 2014 IEEE International Conference on Communications (ICC).

[20]  Yi Zhong,et al.  State-of-the-art research study for green cloud computing , 2011, The Journal of Supercomputing.

[21]  Amin Vahdat,et al.  PortLand: a scalable fault-tolerant layer 2 data center network fabric , 2009, SIGCOMM '09.

[22]  Robert E. Kooij,et al.  Graph measures and network robustness , 2013, ArXiv.

[23]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[24]  Gang Wang,et al.  SBEToolbox: A Matlab Toolbox for Biological Network Analysis , 2013, Evolutionary bioinformatics online.

[25]  Haitao Wu,et al.  BCube: a high performance, server-centric network architecture for modular data centers , 2009, SIGCOMM '09.

[26]  Almerima Jamakovic,et al.  Influence of the network structure on robustness , 2007, 2007 15th IEEE International Conference on Networks.

[27]  Ahmad Khonsari,et al.  HHS: an efficient network topology for large-scale data centers , 2015, The Journal of Supercomputing.

[28]  Guiran Chang,et al.  Analyzing, modeling and evaluating dynamic adaptive fault tolerance strategies in cloud computing environments , 2013, The Journal of Supercomputing.

[29]  M. Faloutsos The internet AS-level topology: three data sources and one definitive metric , 2006, CCRV.

[30]  Albert Y. Zomaya,et al.  On the Characterization of the Structural Robustness of Data Center Networks , 2013, IEEE Transactions on Cloud Computing.

[31]  Zhiyang Su,et al.  Rethinking the Data Center Networking: Architecture, Network Protocols, and Resource Sharing , 2014, IEEE Access.

[32]  Albert Y. Zomaya,et al.  Trends and challenges in cloud datacenters , 2014, IEEE Cloud Computing.

[33]  Ahmad Khonsari,et al.  A flexible and high-performance data center network topology , 2017, The Journal of Supercomputing.

[34]  Miguel Elias M. Campista,et al.  A reliability analysis of datacenter topologies , 2012, 2012 IEEE Global Communications Conference (GLOBECOM).

[35]  Jing Liu,et al.  A comparative study of network robustness measures , 2017, Frontiers of Computer Science.