VNRE: Flexible and Efficient Acceleration for Network Redundancy Elimination

Network Redundancy Elimination (NRE) aims to improve network performance by identifying and removing repeated transmission of duplicate content from remote servers. Using a Content-Defined Chunking (CDC) policy, an inline NRE process can obtain a higher Redundancy Elimination (RE) ratio but may suffer from a considerably higher computational requirement than fixed-size chunking. Additionally, the existing work on NRE is either based on IP packet level redundancy elimination or rigidly adopting a CDC policy with a static empirically-decided expected chunk size. These approaches make it difficult for conventional NRE MiddleBoxes to achieve both high network throughput to match the increasing line speeds and a high RE ratio at the same time. In this paper we present a design and implementation of an inline NRE appliance which incorporates an improved FPGA-based scheme to speed up CDC processing to match the ever increasing network line speeds while simultaneously obtaining a high RE ratio. The overhead of Rabin fingerprinting, which is a key component of CDC, is greatly reduced through the use of a record table and registers in the FPGA. To efficiently utilize the hardware resources, the whole NRE process is handled by a Virtualized NRE (VNRE) controller. The uniqueness of this VNRE that we developed lies in its ability to exploit the redundancy patterns of different TCP flows and customize the chunking process to achieve a higher RE ratio. VNRE will first decide if the chunking policy should be either fixed-size chunking or CDC. Then VNRE decides the expected chunk size for the corresponding chunking policy based on the TCP flow patterns. Implemented in a partially reconfigurable FPGA card, our trace driven evaluation demonstrates that the chunking throughput for CDC in one FPGA processing unit outperforms chunking running in a virtual CPU by nearly 3X. Moreover, through the differentiation of chunking policies for each flow, the overall throughput of the VNRE appliance outperforms one with static NRE configurations by 6X to 57X while still guaranteeing a high RE ratio.

[1]  Haoyu Song,et al.  Efficient packet classification for network intrusion detection using FPGA , 2005, FPGA '05.

[2]  Vyas Sekar,et al.  SmartRE: an architecture for coordinated network-wide redundancy elimination , 2009, SIGCOMM '09.

[3]  Eric A. Brewer,et al.  Value-based web caching , 2003, WWW '03.

[4]  Youjip Won,et al.  Efficient Deduplication Techniques for Modern Backup Operation , 2011, IEEE Transactions on Computers.

[5]  Taesang Choi,et al.  CO-REDUCE: Collaborative Redundancy Reduction Service in Software-Defined Networks , 2015, HotMiddlebox '15.

[6]  David Wetherall,et al.  A protocol-independent technique for eliminating redundant network traffic , 2000, SIGCOMM 2000.

[7]  Ethan L. Miller,et al.  The effectiveness of deduplication on virtual machine disk images , 2009, SYSTOR '09.

[8]  Lixin Gao,et al.  Customizing virtual networks with partial FPGA reconfiguration , 2011, CCRV.

[9]  Aditya Akella,et al.  Redundancy in network traffic: findings and implications , 2009, SIGMETRICS '09.

[10]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[11]  Lucas Chaufournier,et al.  CloudNet: Dynamic Pooling of Cloud Resources by Live WAN Migration of Virtual Machines , 2011, IEEE/ACM Transactions on Networking.

[12]  Yuping Zhao,et al.  OpenANFV: accelerating network function virtualization with a consolidated framework in openstack , 2015, SIGCOMM 2015.

[13]  Nick Feamster,et al.  Building a fast, virtualized data plane with programmable hardware , 2009, CCRV.

[14]  Peng Zhang,et al.  QoS constraints-based energy-efficient model in cloud computing networks for multimedia clinical issues , 2016, Multimedia Tools and Applications.

[15]  George Varghese,et al.  EndRE: An End-System Redundancy Elimination Service for Enterprises , 2010, NSDI.

[16]  David Wetherall,et al.  A protocol-independent technique for eliminating redundant network traffic , 2000, SIGCOMM.

[17]  Glen Gibb,et al.  NetFPGA: reusable router architecture for experimental research , 2008, PRESTO '08.

[18]  Israel Cidon,et al.  The power of prediction: cloud bandwidth and cost reduction , 2011, SIGCOMM.

[19]  Eunyoung Jeong,et al.  Comparison of caching strategies in modern cellular backhaul networks , 2013, MobiSys '13.

[20]  Carey L. Williamson,et al.  Enhancing redundant network traffic elimination , 2012, Comput. Networks.

[21]  Lixin Gao,et al.  Scalable network virtualization using FPGAs , 2010, FPGA '10.

[22]  Nick Feamster,et al.  SwitchBlade: a platform for rapid deployment of network protocols on programmable hardware , 2010, SIGCOMM '10.

[23]  Wenhui Zhao,et al.  An optimization-based robust routing algorithm to energy-efficient networks for cloud computing , 2015, Telecommunication Systems.

[24]  Udi Manber,et al.  Finding Similar Files in a Large File System , 1994, USENIX Winter.

[25]  Philip Shilane,et al.  WAN-optimized replication of backup datasets using stream-informed delta compression , 2012, TOS.