Fault-Tolerant Architecture for Peer to Peer Network Management Systems

In this paper we propose a 3-tier hierarchical architecture which is based on peer to peer model for network management purpose. The main focus of the proposed architecture is provisioning fault tolerance property which in turn leads to increasing the availability of the Network Management System (NMS). In each tier of the architecture we use redundancy to achieve the aforementioned goal. However we do not use redundant peers thus no peer redundancy is imposed to the system. Instead we use some selected peers in several roles and therefore only add some software redundancy which is easily tolerable by advanced processors of NMS's peers. Due to the hierarchal structure failure of nodes in each tier may affect NMS's availability differently. Therefore we examined the effect of failure of peers which play different roles in the architecture on the availability of the system by means of extensive simulation study. The results show that the proposed architecture offers higher availability in comparison to previously proposed peer to peer NMS. It also offered lower sensitivity to failure of nodes.

[1]  Lisandro Zambenedetti Granville,et al.  Managing computer networks using peer-to-peer technologies , 2005, IEEE Communications Magazine.

[2]  Pablo Arozarena Llopis,et al.  MADEIRA: a peer-to-peer approach to network management , 2006 .

[3]  Lisandro Zambenedetti Granville,et al.  Performance Evaluation of Notifications in a Web Services and P2P-Based Network Management Overlay , 2007, 31st Annual International Computer Software and Applications Conference (COMPSAC 2007).

[4]  Guihai Chen,et al.  Insight into redundancy schemes in DHTs , 2007, The Journal of Supercomputing.

[5]  Olivier Festor,et al.  A management platform over a peer to peer service infrastructure , 2003, 10th International Conference on Telecommunications, 2003. ICT 2003..

[6]  James Won-Ki Hong,et al.  Towards Management of Next Generation Networks , 2007, IEICE Trans. Commun..

[7]  Simon S. Lam,et al.  Failure recovery for structured P2P networks: protocol design and performance evaluation , 2004, SIGMETRICS '04/Performance '04.

[8]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[9]  Markus Fiedler,et al.  A P2P-Based Framework for Distributed Network Management , 2005, EuroNGI Workshop.

[10]  Kumbesan Sandrasegaran,et al.  Network management challenges for next generation networks , 2005, The IEEE Conference on Local Computer Networks 30th Anniversary (LCN'05)l.

[11]  Rodrigo Rodrigues,et al.  High Availability in DHTs: Erasure Coding vs. Replication , 2005, IPTPS.

[12]  Rolf Stadler,et al.  Towards Ambient Networks Management , 2005, MATA.

[13]  Hector Garcia-Molina,et al.  Designing a super-peer network , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[14]  William Stallings,et al.  Cryptography and Network Security: Principles and Practice , 1998 .

[15]  Virgílio A. F. Almeida,et al.  Performance by Design - Computer Capacity Planning By Example , 2004 .

[16]  A. Steininger,et al.  A Fail-Silent Reconfigurable Superscalar Processor , 2007 .

[17]  Jenn-Wei Lin,et al.  Fault Tolerance for Super-Peers of P2P Systems , 2007 .

[18]  Lisandro Zambenedetti Granville,et al.  Designing the Architecture of P2P-Based Network Management Systems , 2006, 11th IEEE Symposium on Computers and Communications (ISCC'06).

[19]  Miguel Castro,et al.  Performance and dependability of structured peer-to-peer overlays , 2004, International Conference on Dependable Systems and Networks, 2004.

[20]  Guillaume Urvoy-Keller,et al.  Hierarchical Peer-To-Peer Systems , 2003, Parallel Process. Lett..

[21]  John Kubiatowicz,et al.  Asymptotically Efficient Approaches to Fault-Tolerance in Peer-to-Peer Networks , 2003, DISC.