FlowTracer: An Effective Flow Trajectory Detection Solution Based on Probabilistic Packet Tagging in SDN-Enabled Networks

Currently, parallel data transmissions in large-scale datacenter networks are becoming increasingly crucial to application performance. Despite fine-grained control by SDN-enabled networks, some transmission errors, such as misconfigurations, will inevitably occur, resulting in high-level forwarding policies that cannot be conformed to at the data plane. Therefore, flow trajectory detection is very important for allowing datacenter network operators to troubleshoot problems and ensure that all traffic flows are running on the correct paths. However, existing solutions detect flow trajectories by recording the entire path of each packet. These methods are prone to imposing significant overheads in terms of both the number of switch entries and the amount of packet header space required. To considerably reduce this overhead, we present FlowTracer, an efficient flow trajectory detection solution, which can sample a path one link at a time instead of recording the entire path. FlowTracer consists of a method of probabilistic packet tagging and a method of trajectory reconstruction. In this paper, we first introduce the method of probabilistic packet tagging, which is performed in OpenFlow-enabled switches with very few switch entries and limited packet header space by means of double VLAN tags. Then, we explore the topological structure of datacenter networks and propose our method of trajectory reconstruction, which is performed at end hosts and achieves rapid convergence. Finally, we evaluate FlowTracer on a 48-ary fat-tree topology. The results show that FlowTracer can detect trajectories quickly while placing far smaller demands on both switch entries and packet header space than state-of-the-art techniques.

[1]  Samuel T. King,et al.  Debugging the data plane with anteater , 2011, SIGCOMM 2011.

[2]  Qiang Xu,et al.  Enabling layer 2 pathlet tracing through context encoding in software-defined networking , 2014, HotSDN.

[3]  Antonio Capone,et al.  Detour planning for fast and reliable failure recovery in SDN with OpenState , 2014, 2015 11th International Conference on the Design of Reliable Communication Networks (DRCN).

[4]  Thierry Turletti,et al.  A Survey of Software-Defined Networking: Past, Present, and Future of Programmable Networks , 2014, IEEE Communications Surveys & Tutorials.

[5]  Petr Kuznetsov,et al.  In-Band Synchronization for Distributed SDN Control Planes , 2016, CCRV.

[6]  Ramana Rao Kompella,et al.  The TCP Outcast Problem: Exposing Unfairness in Data Center Networks , 2012, NSDI.

[7]  Junda Liu,et al.  Keep Forwarding: Towards k-link failure resilient routing , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[8]  Junda Liu,et al.  Libra: Divide and Conquer to Verify Forwarding Tables in Huge Networks , 2014, NSDI.

[9]  Shengru Li,et al.  Protocol Oblivious Forwarding (POF): Software-Defined Networking with Enhanced Programmability , 2017, IEEE Network.

[10]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[11]  Nick McKeown,et al.  I Know What Your Packet Did Last Hop: Using Packet Histories to Troubleshoot Networks , 2014, NSDI.

[12]  Brighten Godfrey,et al.  VeriFlow: verifying network-wide invariants in real time , 2012, HotSDN '12.

[13]  Rong Pan,et al.  Let It Flow: Resilient Asymmetric Load Balancing with Flowlet Switching , 2017, NSDI.

[14]  Minlan Yu,et al.  DIBS: just-in-time congestion mitigation for data centers , 2014, EuroSys '14.

[15]  George Varghese,et al.  Automatic Test Packet Generation , 2012, IEEE/ACM Transactions on Networking.

[16]  Fung Po Tso,et al.  Longer Is Better: Exploiting Path Diversity in Data Center Networks , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems.

[17]  David Walker,et al.  Infinite CacheFlow in software-defined networks , 2014, HotSDN.

[18]  Brian O'Connor,et al.  A Mininet-based Virtual Testbed for Distributed SDN Development , 2015, Computer communication review.

[19]  M. Hofri,et al.  The coupon-collector problem revisited — a survey of engineering problems and computational methods , 1997 .

[20]  David Walker,et al.  Compiling path queries in software-defined networks , 2014, HotSDN.

[21]  Nick McKeown,et al.  Leveraging SDN layering to systematically troubleshoot networks , 2013, HotSDN '13.

[22]  George Varghese,et al.  New directions in traffic measurement and accounting: Focusing on the elephants, ignoring the mice , 2003, TOCS.

[23]  Myungjin Lee,et al.  CherryPick: tracing packet trajectory in software-defined datacenter networks , 2015, SOSR.

[24]  Ramana Rao Kompella,et al.  On the impact of packet spraying in data center networks , 2013, 2013 Proceedings IEEE INFOCOM.

[25]  Paulo César da Rocha Fonseca,et al.  A Survey on Fault Management in Software-Defined Networks , 2017, IEEE Communications Surveys & Tutorials.

[26]  Teemu Koponen,et al.  Flow caching for high entropy packet fields , 2015, SIGCOMM 2015.

[27]  Chen-Nee Chuah,et al.  Fast Local Rerouting for Handling Transient Link Failures , 2007, IEEE/ACM Transactions on Networking.

[28]  David A. Maltz,et al.  Network traffic characteristics of data centers in the wild , 2010, IMC '10.

[29]  John B. Carter,et al.  SDN traceroute: tracing SDN forwarding without changing network behavior , 2014, HotSDN.

[30]  George Varghese,et al.  P4: programming protocol-independent packet processors , 2013, CCRV.

[31]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.

[32]  Myungjin Lee,et al.  Simplifying Datacenter Network Debugging with PathDump , 2016, OSDI.

[33]  Peng Liu,et al.  Elastic sketch: adaptive and fast network-wide measurements , 2018, SIGCOMM.

[34]  Mark Handley,et al.  Improving datacenter performance and robustness with multipath TCP , 2011, SIGCOMM 2011.

[35]  Martín Casado,et al.  The Design and Implementation of Open vSwitch , 2015, NSDI.

[36]  Katherine Barabash,et al.  NoEncap: overlay network virtualization with no encapsulation overheads , 2015, SOSR.

[37]  Jim Esch,et al.  Software-Defined Networking: A Comprehensive Survey , 2015, Proc. IEEE.

[38]  Praveen Yalagandula,et al.  Mahout: Low-overhead datacenter traffic management using end-host-based elephant detection , 2011, 2011 Proceedings IEEE INFOCOM.

[39]  David Walker,et al.  Abstractions for network update , 2012, SIGCOMM '12.

[40]  Hua Chen,et al.  Pingmesh: A Large-Scale System for Data Center Network Latency Measurement and Analysis , 2015, SIGCOMM.

[41]  Ming Zhang,et al.  Understanding data center traffic characteristics , 2010, CCRV.

[42]  Minlan Yu,et al.  FlowTags: enforcing network-wide policies in the presence of dynamic middlebox actions , 2013, HotSDN '13.

[43]  Randy H. Katz,et al.  X-Trace: A Pervasive Network Tracing Framework , 2007, NSDI.

[44]  Amin Vahdat,et al.  Hedera: Dynamic Flow Scheduling for Data Center Networks , 2010, NSDI.

[45]  Byrav Ramamurthy,et al.  Network Innovation using OpenFlow: A Survey , 2014, IEEE Communications Surveys & Tutorials.