SDN-aware federation of distributed data

The introduction of software defined networking (SDN) has created an opportunity for file access services to get a view of the underlying network and to further optimize large data transfers. This opportunity is still unexplored while the amount of data that needs to be transferred is growing. Data transfers are also becoming more frequent as a result of interdisciplinary collaborations and the nature of research infrastructures. To address the needs for larger and more frequent data transfers, we propose an approach which enables file access services to use SDN. We extend the file access services developed in our earlier work by including network resources in the provisioning for large data transfers. A novel SDN-aware file transfer mechanism is prototyped for improving the performance and reliability of large data transfers on research infrastructure equipped with programmable network switches. Our results show that I/O and data-intensive scientific workflows benefit from SDN-aware file access services. Software defined networking (SDN) has created unprecedented opportunities for efficient data transfers.We present an observable and controllable network model to approach data transfers from multiple sources.We develop and implement two algorithms that take advantage of the programmability of the network.We introduce an architecture for transparent & adaptive data transfers and enable full exploitation of research infrastructures.We assess the improvement of data transfer rates resulting from SDN-enabling Distributed File Access Services (DFAS).

[1]  Aniruddha S. Gokhale,et al.  Software-Defined Networking: Challenges and research opportunities for Future Internet , 2014, Comput. Networks.

[2]  Victor I. Chang,et al.  Towards a Big Data system disaster recovery in a Private Cloud , 2015, Ad Hoc Networks.

[3]  Fernando M. V. Ramos,et al.  On the Feasibility of a Consistent and Fault-Tolerant Data Store for SDNs , 2013, 2013 Second European Workshop on Software Defined Networks.

[4]  Nagiza F. Samatova,et al.  Theory-Guided Data Science for Climate Change , 2014, Computer.

[5]  Cees T. A. M. de Laat,et al.  A user-centric execution environment for CineGrid workloads , 2015, Future Gener. Comput. Syst..

[6]  Stavros Konstantaras,et al.  PIRE ExoGENI - ENVRI Preparation for Big Data Science , 2014 .

[7]  Cees T. A. M. de Laat,et al.  Network Resource Control for Data Intensive Applications in Heterogeneous Infrastructures , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[8]  Xin Yang,et al.  Palantir: Reseizing Network Proximity in Large-Scale Distributed Computing Frameworks Using SDN , 2014, 2014 IEEE 7th International Conference on Cloud Computing.

[9]  Jim Esch,et al.  Software-Defined Networking: A Comprehensive Survey , 2015, Proc. IEEE.

[10]  Gilles Fedak,et al.  Active Data: A programming model to manage data life cycle across heterogeneous systems and infrastructures , 2015, Future Gener. Comput. Syst..

[11]  Ewa Deelman,et al.  Peer-to-Peer Data Sharing for Scientific Workflows on Amazon EC2 , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[12]  Victor I. Chang,et al.  Corrigendum to "The business intelligence as a service in the cloud" [Future Gener. Comput. Systems 37C (2014) 512-534] , 2014, Future Gener. Comput. Syst..

[13]  Phuoc Tran-Gia,et al.  SDN-Based Application-Aware Networking on the Example of YouTube Video Streaming , 2013, 2013 Second European Workshop on Software Defined Networks.

[14]  Shawn McKee,et al.  TeraPaths: End-to-End Network Path QoS Configuration Using Cross-Domain Reservation Negotiation , 2006, 2006 3rd International Conference on Broadband Communications, Networks and Systems.

[15]  Saverio Mascolo,et al.  A Control Architecture for Massive Adaptive Video Streaming Delivery , 2014, VideoNext '14.

[16]  Daniel S. Katz,et al.  Montage: a grid portal and software toolkit for science-grade astronomical image mosaicking , 2009, Int. J. Comput. Sci. Eng..

[17]  Marian Bubak,et al.  A Cloud-Based Framework for Collaborative Data Management in the VPH-Share Project , 2013, 2013 27th International Conference on Advanced Information Networking and Applications Workshops.

[18]  Hamid Farhadi,et al.  Software-Defined Networking: A survey , 2015, Comput. Networks.

[19]  Rajiv Ranjan,et al.  G-Hadoop: MapReduce across distributed data centers for data-intensive computing , 2013, Future Gener. Comput. Syst..

[20]  Martín Casado,et al.  Extending Networking into the Virtualization Layer , 2009, HotNets.

[21]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[22]  Panagiotis Georgopoulos,et al.  Towards network-wide QoE fairness using openflow-assisted adaptive video streaming , 2013, FhMN@SIGCOMM.

[23]  Michael Barbehenn,et al.  A Note on the Complexity of Dijkstra's Algorithm for Graphs with Weighted Vertices , 1998, IEEE Trans. Computers.

[24]  Shiping Chen,et al.  A platform for secure monitoring and sharing of generic health data in the Cloud , 2014, Future Gener. Comput. Syst..

[25]  Ivan Janciak,et al.  Modeling and optimizing large-scale data flows , 2014, Future Gener. Comput. Syst..

[26]  Jeffrey S. Chase,et al.  ExoGENI: A Multi-Domain Infrastructure-as-a-Service Testbed , 2012, The GENI Book.

[27]  Mengxia Zhu,et al.  Concurrent Bandwidth Reservation Strategies for Big Data Transfers in High-Performance Networks , 2015, IEEE Transactions on Network and Service Management.

[28]  James M. Lucas,et al.  Exponentially weighted moving average control schemes: Properties and enhancements , 1990 .

[29]  K. M. Annervaz,et al.  Multi-site data distribution for disaster recovery - A planning framework , 2014, Future Gener. Comput. Syst..

[30]  Cees T. A. M. de Laat,et al.  Using ontologies for resource description in the CineGrid Exchange , 2011, Future Gener. Comput. Syst..

[31]  Maarten Litmaath The Storage Resource Manager Interface Specification Version 2.2 , 2013 .

[32]  Marian Bubak,et al.  Cloud Data Federation for Scientific Applications , 2013, Euro-Par Workshops.

[33]  Serge Fdida,et al.  LERU Roadmap for Research Data , 2013 .

[34]  Meral Shirazipour,et al.  OpenFlow and Multi-layer Extensions: Overview and Next Steps , 2012, 2012 European Workshop on Software Defined Networking.

[35]  Jiangchuan Liu,et al.  Statistics and Social Network of YouTube Videos , 2008, 2008 16th Interntional Workshop on Quality of Service.

[36]  Dario Barberis,et al.  The ATLAS Computing Model , 2010 .

[37]  Gabriel Antoniu,et al.  JetStream: Enabling high throughput live event streaming on multi-site clouds , 2016, Future Gener. Comput. Syst..

[38]  Victor I. Chang,et al.  The Business Intelligence as a Service in the Cloud , 2014, Future Gener. Comput. Syst..

[39]  Arie Shoshani,et al.  StorNet: Co-scheduling of end-to-end bandwidth reservation on storage and network systems for high-performance data transfers , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[40]  Kazuya Suzuki,et al.  A Design and Implementation of OpenFlow Controller Handling IP Multicast with Fast Tree Switching , 2012, 2012 IEEE/IPSJ 12th International Symposium on Applications and the Internet.

[41]  Li Shi,et al.  Design and implementation of an intelligent end-to-end network QoS system , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[42]  Matthias S. Müller,et al.  Performance and quality of service of data and video movement over a 100 Gbps testbed , 2013, Future Gener. Comput. Syst..