Reliability Calculus: A Theoretical Framework to Analyze Communication Reliability

Communication reliability is one of the most important concerns and fundamental issues in network systems, such as cyber-physical systems, where network components, sensors, actuators, controllers are interconnected with each other. These systems are prevalent in many safety-critical areas, including aerospace, automotive, civil infrastructure, energy, healthcare, manufacturing, and transportation, etc. In such systems, a single link failure, or communication delay could lead to catastrophic consequences. Hence, there is an urgent demand on efficient methodologies to model and analyze the delay distribution of control messages or feedback signals, especially when networks grow more complex and more heterogenous. In this paper, a calculus based on frequency domain analysis is developed to address this goal, so we can model and analyze the reliability of communication in large-scale compositional networked systems. Several network structures (e.g. serial, parallel, circular and backup) are defined as building blocks to model a wide variety of connections in networked systems. The advantages of the proposed theoretical framework over the traditional time domain approaches include the capability to capture higher order moments of system characteristics, calability to analyze the reliability of complex systems, efficiency in calculation and practicability in simulation.

[1]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks-the multiple node case , 1993, IEEE INFOCOM '93 The Conference on Computer Communications, Proceedings.

[2]  Péter Benkö,et al.  A large-scale, passive analysis of end-to-end TCP performance over GPRS , 2004, IEEE INFOCOM 2004.

[3]  U. Montanari,et al.  A Boolean algebra method for computing the terminal reliability in a communication network , 1973 .

[4]  Alhussein A. Abouzeid,et al.  Queuing network models for delay analysis of multihop wireless ad hoc networks , 2006, IWCMC '06.

[5]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks-the single node case , 1992, [Proceedings] IEEE INFOCOM '92: The Conference on Computer Communications.

[6]  Luca Podofillini,et al.  Optimal design of reliable network systems in presence of uncertainty , 2005, IEEE Transactions on Reliability.

[7]  Prasad Calyam,et al.  TBI: end-to-end network performance measurement testbed for empirical bottleneck detection , 2005, First International Conference on Testbeds and Research Infrastructures for the DEvelopment of NeTworks and COMmunities.

[8]  J. Scott Provan,et al.  Calculating bounds on reachability and connectedness in stochastic networks , 1983, Networks.

[9]  Deborah Estrin,et al.  RAP: An end-to-end rate-based congestion control mechanism for realtime streams in the Internet , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[10]  John P. Lehoczky,et al.  Real-time queueing theory , 1996, 17th IEEE Real-Time Systems Symposium.

[11]  Chenyang Lu,et al.  SPEED: a stateless protocol for real-time communication in sensor networks , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[12]  Patrick Thiran,et al.  Network loss inference with second order statistics of end-to-end flows , 2007, IMC '07.

[13]  Rene L. Cruz,et al.  A calculus for network delay, Part II: Network analysis , 1991, IEEE Trans. Inf. Theory.

[14]  Xue Liu,et al.  On Non-Utilization Bounds for Arbitrary Fixed Priority Policies , 2006, 12th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS'06).

[15]  Michael O. Ball,et al.  Computational Complexity of Network Reliability Analysis: An Overview , 1986, IEEE Transactions on Reliability.

[16]  Almut Burchard,et al.  A Min-Plus Calculus for End-to-End Statistical Service Guarantees , 2006, IEEE Transactions on Information Theory.

[17]  Martin Haenggi,et al.  Towards an end-to-end delay analysis of wireless multihop networks , 2009, Ad Hoc Networks.

[18]  Tai C Yang,et al.  Networked control system: a brief survey , 2006 .

[19]  Izhak Rubin,et al.  Throughput and Delay Analysis of Multihop IEEE 802.11 Networks with Capture , 2007, 2007 IEEE International Conference on Communications.

[20]  C. Colbourn,et al.  Computing 2-terminal reliability for radio-broadcast networks , 1989 .

[21]  T. Feo,et al.  Partial factoring: an efficient algorithm for approximating two-terminal reliability on complete graphs , 1990 .

[22]  Eduardo Tovar,et al.  Modeling and Worst-Case Dimensioning of Cluster-Tree Wireless Sensor Networks , 2006, 2006 27th IEEE International Real-Time Systems Symposium (RTSS'06).

[23]  Yasunori Iida,et al.  BASIC CONCEPTS AND FUTURE DIRECTIONS OF ROAD NETWORK RELIABILITY ANALYSIS , 1999 .

[24]  Paolo Giacomazzi,et al.  Closed-form analysis of end-to-end network delay with Markov-modulated Poisson and fluid traffic , 2009, Comput. Commun..

[25]  D. De Vleeschauwer,et al.  End-to-end queuing delay assessment in multi-service ip networks , 2002 .

[26]  Insup Lee,et al.  Compositional Analysis Framework Using EDP Resource Models , 2007, RTSS 2007.

[27]  Ness B. Shroff,et al.  Delay Analysis for Multi-Hop Wireless Networks , 2009, IEEE INFOCOM 2009.

[28]  Edward W. Knightly,et al.  End-to-end performance and fairness in multihop wireless backhaul networks , 2004, MobiCom '04.

[29]  Jean-Yves Le Boudec,et al.  Application of Network Calculus to Guaranteed Service Networks , 1998, IEEE Trans. Inf. Theory.

[30]  Qian Zhang,et al.  End-to-End QoS for Video Delivery Over Wireless Internet , 2005, Proc. IEEE.

[31]  Lothar Thiele,et al.  A Comprehensive Worst-Case Calculus for Wireless Sensor Networks with In-Network Processing , 2007, 28th IEEE International Real-Time Systems Symposium (RTSS 2007).

[32]  Jean-Yves Le Boudec,et al.  Network Calculus: A Theory of Deterministic Queuing Systems for the Internet , 2001 .

[33]  Tarek F. Abdelzaher,et al.  On real-time capacity limits of multihop wireless sensor networks , 2004, 25th IEEE International Real-Time Systems Symposium.

[34]  Hai Yang,et al.  Effect of Route Choice Models on Estimating Network Capacity Reliability , 2000 .

[35]  John A. Schormans,et al.  Measurement-based end to end latency performance prediction for SLA verification , 2005, J. Syst. Softw..

[36]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks: the single-node case , 1993, TNET.

[37]  Edward A. Lee Cyber Physical Systems: Design Challenges , 2008, 2008 11th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC).

[38]  Rene L. Cruz,et al.  A calculus for network delay, Part I: Network elements in isolation , 1991, IEEE Trans. Inf. Theory.

[39]  George S. Fishman,et al.  A Monte Carlo Sampling Plan for Estimating Network Reliability , 1984, Oper. Res..

[40]  S.A. Kotsopoulos,et al.  On the End-to-End Delay Analysis of the IEEE 802.11 Distributed Coordination Function , 2007, Second International Conference on Internet Monitoring and Protection (ICIMP 2007).

[41]  Chengzhi Li,et al.  A Network Calculus With Effective Bandwidth , 2007, IEEE/ACM Transactions on Networking.

[42]  L. Breuer Introduction to Stochastic Processes , 2022, Statistical Methods for Climate Scientists.