Infrastructure of data centers for transferring big data traffic: A survey research

Big data make drastic changes in patterns of data center traffic. In traditional applications, which are based on client-server architectures, data center traffic are in northern-southern mode. While, as applications of big data developed, so-called eastern-western traffic patterns were widely used in data center networks. Moreover, applications of cloud computing technology in processing infrastructures and data center storage have made dynamic network traffic in such centers. Large volume of eastern-western data, changes in traffic patterns in a dynamic manner, and immediate nature are among importance characteristics of application of big data in data center networks that have posed several challenges to traditional architecture and data center technologies. This is, therefore, essential to make drastic changes in network infrastructure of data centers to execute several applications of big data. In this article, attempts are made to focus on network issues and requirements for big data, addressing recent methods and technologies in order to solve network problems. We, in addition, analyzed application and impact of modern technologies such as software-defined networks in applications relevant to big data.

[1]  Jong-Suk Ruth Lee,et al.  Study on network architecture of big data center for the efficient control of huge data traffic , 2014, Comput. Sci. Inf. Syst..

[2]  Hai Jin,et al.  Building a network highway for big data: architecture and challenges , 2014, IEEE Network.

[3]  Anees Shaikh,et al.  Programming your network at run-time for big data applications , 2012, HotSDN '12.

[4]  Amin Vahdat,et al.  Hedera: Dynamic Flow Scheduling for Data Center Networks , 2010, NSDI.

[5]  Yi Lu,et al.  ElephantTrap: A low cost device for identifying large flows , 2007, 15th Annual IEEE Symposium on High-Performance Interconnects (HOTI 2007).

[6]  Mona M At T I A N,et al.  Applying big-data technologies to network architecture , 2012 .

[7]  Yves Deville,et al.  Traffic engineering for multiple spanning tree protocol in large data centers , 2011, 2011 23rd International Teletraffic Congress (ITC).

[8]  Sujata Banerjee,et al.  ElasticTree: Saving Energy in Data Center Networks , 2010, NSDI.

[9]  Anupam Das,et al.  Transparent and Flexible Network Management for Big Data Processing in the Cloud , 2013, HotCloud.

[10]  A. Tajima,et al.  A 2.56-Tb/s multiwavelength and scalable switch-fabric for fast packet-switching networks , 1998, IEEE Photonics Technology Letters.

[11]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[12]  Radia Perlman,et al.  An algorithm for distributed computation of a spanningtree in an extended LAN , 1985, SIGCOMM '85.

[13]  Chita R. Das,et al.  Design of a Dynamic Priority-Based Fast Path Architecture for On-Chip Interconnects , 2007 .

[14]  Sampath Rangarajan,et al.  MuVi: a multicast video delivery scheme for 4g cellular networks , 2012, Mobicom '12.

[15]  Radia J. Perlman,et al.  An algorithm for distributed computation of a spanningtree in an extended LAN , 1985, SIGCOMM '85.

[16]  Feng Wu,et al.  ParCast: soft video delivery in MIMO-OFDM WLANs , 2012, Mobicom '12.

[17]  Kang Xi,et al.  Enabling flow-based routing control in data center networks using Probe and ECMP , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[18]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[19]  Jim Esch,et al.  Software-Defined Networking: A Comprehensive Survey , 2015, Proc. IEEE.

[20]  Yin Xu,et al.  mPath: High-Bandwidth Data Transfers with Massively Multipath Source Routing , 2013, IEEE Transactions on Parallel and Distributed Systems.

[21]  Praveen Yalagandula,et al.  Mahout: Low-overhead datacenter traffic management using end-host-based elephant detection , 2011, 2011 Proceedings IEEE INFOCOM.