JITeR: Just-in-time application-layer routing

Abstract The paper addresses the problem of providing message latency and reliability assurances for control traffic in wide-area IP networks. This is an important problem for cloud services and other geo-distributed information infrastructures that entail inter-datacenter real-time communication. We present the design and validation of JITeR  (Just-In-Time Routing), an algorithm that timely routes messages at application-layer using overlay networking and multihoming, leveraging the natural redundancy of wide-area IP networks. We implemented a prototype of JITeR that we evaluated experimentally by placing nodes in several regions of Amazon EC2. We also present a scenario-based (geo-distributed utility network) evaluation comparing JITeR with alternative overlay/multihoming routing algorithms that shows that it provides better timeliness and reliability guarantees.

[1]  George Pallis,et al.  Insight and perspectives for content delivery networks , 2006, CACM.

[2]  Martin de Prycker,et al.  Asynchronous Transfer Mode, Solution for Broadband Isdn , 1991 .

[3]  Helen J. Wang,et al.  Distributing streaming media content using cooperative networking , 2002, NOSSDAV '02.

[4]  Michael Dahlin,et al.  End-to-end WAN service availability , 2001, TNET.

[5]  Hari Balakrishnan,et al.  Improving web availability for clients with MONET , 2005, NSDI.

[6]  Randy Bush,et al.  From Paris to Tokyo: on the suitability of ping to measure latency , 2013, Internet Measurement Conference.

[7]  Randy H. Katz,et al.  Backup path allocation based on a correlated link failure probability model in overlay networks , 2002, 10th IEEE International Conference on Network Protocols, 2002. Proceedings..

[8]  David L. Black,et al.  Definition of the Differentiated Services Field (DS Field) in the IPv4 and IPv6 Headers , 1998, RFC.

[9]  Andreas Terzis,et al.  An Overlay Architecture for High-Quality VoIP Streams , 2006, IEEE Transactions on Multimedia.

[10]  Miguel Correia,et al.  Not Quickly, Just in Time: Improving the Timeliness and Reliability of Control Traffic in Utility Networks , 2009 .

[11]  Marcello Cinque,et al.  On data dissemination for large-scale complex critical infrastructures , 2012, Comput. Networks.

[12]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[13]  Chen-Nee Chuah,et al.  On the analysis of overlay failure detection and recovery , 2007, Comput. Networks.

[14]  Paulo Veríssimo,et al.  Distributed Systems for System Architects , 2001, Advances in Distributed Computing and Middleware.

[15]  Vern Paxson,et al.  Computing TCP's Retransmission Timer , 2000, RFC.

[16]  David Watson,et al.  Topology aware overlay networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[17]  Chen-Nee Chuah,et al.  Characterization of Failures in an Operational IP Backbone Network , 2008, IEEE/ACM Transactions on Networking.

[18]  Bruce M. Maggs,et al.  Globally Distributed Content Delivery , 2002, IEEE Internet Comput..

[19]  Yakov Rekhter,et al.  A Border Gateway Protocol 4 (BGP-4) , 1994, RFC.

[20]  T. N. Vijaykumar,et al.  Deadline-aware datacenter tcp (D2TCP) , 2012, CCRV.

[21]  Baochun Li,et al.  Joint request mapping and response routing for geo-distributed cloud services , 2013, 2013 Proceedings IEEE INFOCOM.

[22]  John S. Heidemann,et al.  A framework for classifying denial of service attacks , 2003, SIGCOMM '03.

[23]  Alex C. Snoeren,et al.  Mesh-based content routing using XML , 2001, SOSP.

[24]  David Watson,et al.  Experiences with monitoring OSPF on a regional service provider network , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[25]  A. Bose,et al.  GridStat: A Flexible QoS-Managed Data Dissemination Framework for the Power Grid , 2009, IEEE Transactions on Power Delivery.

[26]  John G. Apostolopoulos,et al.  On multiple description streaming with content delivery networks , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[27]  Randy H. Katz,et al.  OverQoS: An Overlay Based Architecture for Enhancing Internet QoS , 2004, NSDI.

[28]  Stefan Savage,et al.  Inferring Internet denial-of-service activity , 2001, TOCS.

[29]  Roberto Beraldi,et al.  Achieving Reliable and Timely Event Dissemination over WAN , 2012, ICDCN.

[30]  Thomas P. von Hoff,et al.  Security for Industrial Communication Systems , 2005, Proceedings of the IEEE.

[31]  Xin Zhang,et al.  Correlation-Resilient Path Selection in Multi-Path Routing , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[32]  Yair Amir,et al.  Membership Algorithms for Multicast Communication Groups , 1992, WDAG.

[33]  Christo Wilson,et al.  Better never than late , 2011, SIGCOMM 2011.

[34]  Geert Deconinck,et al.  Architecture, Services and Protocols for CRUTIAL , 2009 .

[35]  Dave Katz,et al.  Bidirectional Forwarding Detection (BFD) , 2010, RFC.

[36]  Erlend Smørgrav Viddal RATATOSKR: WIDE-AREA ACTUATOR RPC OVER GRIDSTAT WITH TIMEL INESS, REDUNDANCY, AND SAFETY , 2007 .

[37]  Hari Balakrishnan,et al.  Best-path vs. multi-path overlay routing , 2003, IMC '03.

[38]  Mahadev Konar,et al.  ZooKeeper: Wait-free Coordination for Internet-scale Systems , 2010, USENIX ATC.

[39]  Yawei Li,et al.  Megastore: Providing Scalable, Highly Available Storage for Interactive Services , 2011, CIDR.

[40]  Hari Balakrishnan,et al.  Resilient overlay networks , 2001, SOSP.

[41]  Chen-Nee Chuah,et al.  Analysis of link failures in an IP backbone , 2002, IMW '02.

[42]  Domenico Cotroneo,et al.  On reliability in publish/subscribe services , 2013, Comput. Networks.

[43]  Kien A. Hua,et al.  ZIGZAG: an efficient peer-to-peer scheme for media streaming , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[44]  Anees Shaikh,et al.  A measurement-based analysis of multihoming , 2003, SIGCOMM '03.

[45]  Bharat K. Bhargava,et al.  PROMISE: peer-to-peer media streaming using CollectCast , 2003, MULTIMEDIA '03.

[46]  Krishna P. Gummadi,et al.  Improving the Reliability of Internet Paths with One-hop Source Routing , 2004, OSDI.

[47]  Vinay M. Igure,et al.  Security issues in SCADA networks , 2006, Comput. Secur..

[48]  Kotagiri Ramamohanarao,et al.  Survey of network-based defense mechanisms countering the DoS and DDoS problems , 2007, CSUR.

[49]  Marc Blanchet,et al.  Internet Engineering Task Force (ietf) Multiple Interfaces and Provisioning Domains Problem Statement , 2022 .

[50]  Ying Zhang,et al.  A Measurement Study of Internet Delay Asymmetry , 2008, PAM.

[51]  Min Zhu,et al.  B4: experience with a globally-deployed software defined wan , 2013, SIGCOMM.

[52]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.