Entirely protecting operating systems against transient errors in space environment

In this article, we propose a mainly-software hardening technique to totally protect unmodified running operating systems on COTS hardware against transient errors in heavily radiation - flooded environment like high altitude space. The technique is currently being implemented in a hypervisor and allows to control the upper layers of the software stack (operating system and applications). The rest of the system, the hypervisor, will be protected by other means, thus resulting in a completely protected system against transient errors. The induced overhead turns around 200% but this is expected to decrease with future improvements.

[1]  Marc Lobelle,et al.  A software based approach to eliminate all SEU effects from mission critical programs , 2011, 2011 12th European Conference on Radiation and Its Effects on Components and Systems.

[2]  Udo Steinberg,et al.  NOVA: a microhypervisor-based secure virtualization architecture , 2010, EuroSys '10.

[3]  Björn Döbel,et al.  Operating system support for redundant multithreading , 2012, EMSOFT '12.

[4]  T. Goka,et al.  The on-orbit measurements of single event phenomena by ETA-V spacecraft , 1991 .

[5]  Raphael R. Some,et al.  Experimental evaluation of a COTS system for space applications , 2002, Proceedings International Conference on Dependable Systems and Networks.