Strategies for fault tolerance in optical grid networks

The need for powerful computing resources as well as capabilities for storage and transmission of large amounts of data in a number of application areas have led to the emergence of optical grids as a natural, cost-effective platform for supporting such applications. As a result there is also an increasing need for strategies and techniques designed to achieve fault tolerance in optical grid networks. Design for fault tolerance in both grid computing and optical networks are mature, well-researched fields in their own right. However, survivability in optical grids should not be treated merely as a concatenation of techniques developed separately in these two disciplines. Rather, it would be beneficial, in terms of resource availability as well as cost-effectiveness, to develop an integrated approach that takes into consideration the allocation of both computing and networking resources jointly. In this paper, we review the state-of-the-art techniques and approaches that have been proposed in the literature, for designing survivable optical grid networks. We also discuss some challenges, identify some open problems and outline future research directions for developing an integrated approach to fault tolerance in optical grids.

[1]  Emmanuel Dotaro,et al.  Routing and wavelength assignment of scheduled lightpath demands , 2003, IEEE J. Sel. Areas Commun..

[2]  Chris Develder,et al.  On the impact of relocation on network dimensions in resilient optical Grids. , 2010, 2010 14th Conference on Optical Network Design and Modeling (ONDM).

[3]  Christopher E. Dabrowski,et al.  Reliability in grid computing systems , 2009, Concurr. Comput. Pract. Exp..

[4]  Wei Guo,et al.  Joint scheduling for optical grid applications , 2007 .

[5]  Ting Wang,et al.  Survivable logical topology design for distributed computing in WDM networks , 2009, 2009 Conference on Optical Fiber Communication - incudes post deadline papers.

[6]  Ying Chen,et al.  Resource provisioning for survivable WDM networks under a sliding scheduled traffic model , 2009, Opt. Switch. Netw..

[7]  Franco Travostino,et al.  Grid networks : enabling grids with advanced communication technology , 2006 .

[8]  Vinod Vokkarane,et al.  Burst cloning: a proactive scheme to reduce data loss in optical burst-switched networks , 2005, IEEE International Conference on Communications, 2005. ICC 2005. 2005.

[9]  Chris Develder,et al.  Column Generation for Dimensioning Resilient Optical Grid Networks with Relocation , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[10]  Luying Zhou,et al.  Scheduling Network and Computing Resources for Sliding Demands in Optical Grids , 2009, Journal of Lightwave Technology.

[11]  Neal Charbonneau,et al.  Dynamic circuits with lightpath switching over wavelength routed networks , 2010, 2010 IEEE 4th International Symposium on Advanced Networks and Telecommunication Systems.

[12]  Ting Wang,et al.  Survivable Optical Grids , 2008, OFC/NFOEC 2008 - 2008 Conference on Optical Fiber Communication/National Fiber Optic Engineers Conference.

[13]  Lei Liu,et al.  A resilient OBS/GMPLS network for survival optical grids , 2009, 2009 15th Asia-Pacific Conference on Communications.

[14]  Bin Wang,et al.  Path-protection-based routing and wavelength assignment in wavelength-division multiplexing optical networks under a scheduled traffic model , 2006 .

[15]  Wei Guo,et al.  Task scheduling considering fault probability for distributed computing applications over an optical network , 2008 .

[16]  Bin Wang,et al.  On service provisioning under a scheduled traffic model in reconfigurable WDM optical networks , 2005, 2nd International Conference on Broadband Networks, 2005..

[17]  R. Nejabati,et al.  A Fully Functional Application-aware Optical Burst Switched Network Test-bed , 2007, OFC/NFOEC 2007 - 2007 Conference on Optical Fiber Communication and the National Fiber Optic Engineers Conference.

[18]  Lei Liu,et al.  Experimental Demonstration of P2P-based Optical Grid on LOBS Testbed , 2008, OFC/NFOEC 2008 - 2008 Conference on Optical Fiber Communication/National Fiber Optic Engineers Conference.

[19]  Chris Develder,et al.  Exploiting relocation to reduce network dimensions of resilient optical grids , 2009, 2009 7th International Workshop on Design of Reliable Communication Networks.

[20]  Gigi Karmous-Edwards,et al.  Dynamic scheduling of network resources with advance reservations in optical grids , 2008, Int. J. Netw. Manag..

[21]  Wei Guo,et al.  Fault-Tolerant Policy for Optical Network Based Distributed Computing System , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).

[22]  Vinod Vokkarane,et al.  Analysis of TCP over optical burst-switched networks with burst retransmission , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[23]  Reza Nejabati Grid Optical Burst Switched Networks ( GOBS ) , 2006 .

[24]  Qiong Zhang,et al.  Reliable optical burst switching for next-generation grid networks , 2005, 2nd International Conference on Broadband Networks, 2005..

[25]  Biswanath Mukherjee,et al.  Survivable WDM mesh networks , 2003 .

[26]  Wei Guo,et al.  Availability-Driven Scheduling for Real-Time Directed Acyclic Graph Applications in Optical Grids , 2010, IEEE/OSA Journal of Optical Communications and Networking.

[27]  Biswanath Mukherjee,et al.  On dimensioning optical grids and the impact of scheduling , 2008, Photonic Network Communications.

[28]  Ying Chen,et al.  A new model for allocating resources to scheduled lightpath demands , 2011, Comput. Networks.

[29]  Bruno Volckaert,et al.  Grid computing: the next network challenge! , 2004 .

[30]  Bruno Volckaert,et al.  Scalable dimensioning of resilient Lambda Grids , 2008, Future Gener. Comput. Syst..

[32]  Piet Demeester,et al.  Design and control of optical grid networks , 2007, 2007 Fourth International Conference on Broadband Communications, Networks and Systems (BROADNETS '07).

[33]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.