Towards a Bayesian Statistical Model for the Classification of the Causes of Data Loss

Given the critical nature of communications in computational Grids it is important to develop efficient, intelligent, and adaptive communication mechanisms. An important milestone on this path is the development of classification mechanisms that can distinguish between the various causes of data loss in cluster and Grid environments. The idea is to use the classification mechanism to determine if data loss is caused by contention within the network or if the cause lies outside of the network domain. If it is outside of the network domain, then it is not necessary to trigger aggressive congestion-control mechanisms. Thus the goal is to operate the data transfer at the highest possible rate by only backing off aggressively when the data loss is classified as being network related. In this paper, we investigate one promising approach to developing such classification mechanisms based on the analysis of the patterns of packet loss and the application of Bayesian statistics.

[1]  Ibrahim Matta,et al.  Effectiveness of loss labeling in improving TCP performance in wired/wireless networks , 2002, 10th IEEE International Conference on Network Protocols, 2002. Proceedings..

[2]  B. Hao,et al.  Elementary Symbolic Dynamics And Chaos In Dissipative Systems , 1989 .

[3]  Kavé Salamatian,et al.  Hidden Markov modeling for network communication channels , 2001, SIGMETRICS '01.

[4]  Politi,et al.  Hierarchical approach to complexity with applications to dynamical systems. , 1990, Physical review letters.

[5]  Ibrahim Matta,et al.  End-to-End Inference of Loss Nature in a Hybrid Wired/Wireless Environment , 2002 .

[6]  David M. Nicol,et al.  Diagnostics for causes of packet loss in a high performance data transfer system , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[7]  William Gropp,et al.  High performance wide area data transfers over high performance networks , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[8]  William Gropp,et al.  An evaluation of object-based data transfers on high performance networks , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[9]  Nitin H. Vaidya,et al.  Discriminating congestion losses from wireless losses using inter-arrival times at the receiver , 1999, Proceedings 1999 IEEE Symposium on Application-Specific Systems and Software Engineering and Technology. ASSET'99 (Cat. No.PR00122).

[10]  Ibrahim Matta,et al.  Open issues on TCP for mobile computing , 2001, Wirel. Commun. Mob. Comput..

[11]  Srinivasan Seshan,et al.  Improving TCP/IP performance over wireless networks , 1995, MobiCom '95.

[12]  Ian T. Foster,et al.  Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing , 2001, 2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.

[13]  Phillip M. Dickens,et al.  Classifiers for the causes of data loss using packet-loss signatures , 2004, IEEE International Symposium on Cluster Computing and the Grid, 2004. CCGrid 2004..

[14]  Vern Paxson,et al.  TCP Congestion Control , 1999, RFC.

[15]  Phillip M. Dickens FOBS: A Lightweight Communication Protocol for Grid Computing , 2003, Euro-Par.

[16]  Phillip M. Dickens,et al.  Application-Level Congestion Control Mechanisms for Large Scale Data Transfers Across Computational Grids , 2003, ISCA PDCS.

[17]  Robert L. Grossman,et al.  PSockets: The Case for Application-level Network Striping for Data Intensive Applications using High Speed Wide Area Networks , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[18]  Robert L. Grossman,et al.  Simple Available Bandwidth Utilization Library for High-Speed Wide Area Networks , 2005, The Journal of Supercomputing.

[19]  Nitin H. Vaidya,et al.  Performance of TCP Congestion Predictors as Loss Predictors , 1998 .

[20]  W. G. Bardsley,et al.  SIMFIT - A Computer Package for Simulation, Curve Fitting and Statistical Analysis Using Life Science Models , 1993 .

[21]  Anastasios A. Tsonis,et al.  Complexity and Predictability of Hourly Precipitation , 1993 .