SyncProbe: Providing Assurance of Message Latency through Predictive Monitoring of Internet Paths

SyncProbe improves the end-to-end predictability of distributed systems by providing applications with a real-time estimate of the maximum expected message delay (upper bound on communication latency) for network paths. The upper bound is adjusted over time in response to the monitored network latency and serves as a real-time assurance of synchrony. We deployed SyncProbe on PlanetLab and assessed its performance with respect to violations, duration of synchrony, upper bound cost and recoverability. Experiments revealed that SyncProbe is successful in providing upper bound estimate in real-time for a variety of paths. The estimated upper bound can be utilized for more formal assessment of information related to timeouts, ordering of events or knowledge about global states. We describe the design and methodology of SyncProbe and discuss various issues related to its performance.

[1]  Michele C. Weigle,et al.  Delay-based early congestion detection and adaptation in TCP: impact on web performance , 2005, Comput. Commun..

[2]  Sujata Banerjee,et al.  S3: a scalable sensing service for monitoring large networked systems , 2006, INM '06.

[3]  Paul Francis,et al.  IDMaps: a global internet host distance estimation service , 2001, TNET.

[4]  Larry L. Peterson,et al.  Using PlanetLab for network research: myths, realities, and best practices , 2005, OPSR.

[5]  Emin Gün Sirer,et al.  Meridian: a lightweight network location service without virtual coordinates , 2005, SIGCOMM '05.

[6]  Darryl Veitch,et al.  A Precision Infrastructure for Active Probing , 2001 .

[7]  Rachid Guerraoui,et al.  Indulgent algorithms (preliminary version) , 2000, PODC '00.

[8]  David L. Mills,et al.  Internet time synchronization: the network time protocol , 1991, IEEE Trans. Commun..

[9]  Flaviu Cristian,et al.  Fail-aware datagram service , 1999, IEE Proc. Softw..

[10]  Nancy A. Lynch,et al.  Consensus in the presence of partial synchrony , 1988, JACM.

[11]  Robert Tappan Morris,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM '04.

[12]  Flaviu Cristian,et al.  The Timed Asynchronous Distributed System Model , 1999, IEEE Trans. Parallel Distributed Syst..

[13]  Arun Venkataramani,et al.  iPlane: an information plane for distributed services , 2006, OSDI '06.

[14]  Hari Balakrishnan,et al.  Resilient overlay networks , 2001, SOSP.

[15]  Jon Postel,et al.  Transmission Control Protocol , 1981, RFC.

[16]  D. Veitch,et al.  A User Friendly TSC Clock , 2006 .

[17]  Brian D. Noble,et al.  Mobile network estimation , 2001, MobiCom '01.

[18]  Monica Brockmeyer,et al.  PSON: predictable service overlay networks , 2007, QSHINE.

[19]  Hui Zhang,et al.  Predicting Internet network distance with coordinates-based approaches , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[20]  Ravi Prakash,et al.  Issues in message passing for a tele-immersive experience , 2001, Proceedings Sixth International Workshop on Object-Oriented Real-Time Dependable Systems.