Dimensioning optical clouds with shared-path shared-computing (SPSC) protection

Service relocation represents a promising strategy to provide flexible and resource efficient resiliency from link failures in the optical cloud environment. However, when a failure affects a node hosting a datacenter (DC), service relocation from the affected DC is not possible. One alternative to protect against DC failures relies on using design strategies that duplicate the IT (i.e., storage and processing) resources in a backup DC at the expense of increasing resource overbuild (i.e., cost) of the network. This work proposes a dimensioning strategy based on the shared-path shared-computing (SPSC) concept able to protect against any single link, server, or DC failure scenario with minimal resource overbuild for the network and IT infrastructures. SPSC is based on the intuition that only storage units need complete replication in backup DC, while processing units can be instantiated only after the occurrence of a failure, leaving the design strategy some leeway to minimize their number. As result, the proposed SPSC design shows a considerable reduction in the amount of backup resources when compared to the dedicated protection strategies.

[1]  Chris Develder,et al.  On the impact of relocation on network dimensions in resilient optical Grids. , 2010, 2010 14th Conference on Optical Network Design and Modeling (ONDM).

[2]  Qin Li,et al.  Enhancing Reliability for Virtual Machines via Continual Migration , 2009, 2009 15th International Conference on Parallel and Distributed Systems.

[3]  Chris Develder,et al.  Joint Dimensioning of Server and Network Infrastructure for Resilient Optical Grids/Clouds , 2012, IEEE/ACM Transactions on Networking.

[4]  M. Tornatore,et al.  Design of Disaster-Resilient Optical Datacenter Networks , 2012, Journal of Lightwave Technology.

[5]  Lena Wosinska,et al.  A relocation-based heuristic for restoring optical cloud services , 2014, 2014 13th International Conference on Optical Communications and Networks (ICOCN).

[6]  L. Wosinska,et al.  Enhancing restoration performance using service relocation in PCE-based resilient optical clouds , 2014, OFC 2014.

[7]  Didier Colle,et al.  Optical Networks for Grid and Cloud Computing Applications , 2012, Proceedings of the IEEE.

[8]  Biswanath Mukherjee,et al.  Network adaptability from disaster disruptions and cascading failures , 2013, IEEE Communications Magazine.

[9]  Prashant J. Shenoy,et al.  CloudNet: dynamic pooling of cloud resources by live WAN migration of virtual machines , 2011, VEE.