Adaptable and Autonomic Mission Manager for Dependable Aerospace Computing

As NASA and other agencies continue to undertake ever challenging remote sensing missions, the ability of satellites and space probes to diagnose and autonomously recover from faults will be paramount. In addition, a more pronounced use of radiation-susceptible components in order to reduce cost makes the challenge of ensuring system dependability even more difficult. To meet these and other needs, a processing platform for space is currently under development at Honeywell Inc. and the University of Florida for an upcoming NASA New Millennium Program mission. Among other features, the platform deploys an autonomic software management system to increase system dependability. In addition, a mission manager has been investigated and developed to provide an autonomous means to adapt to environmental conditions and system failures. This paper provides a detailed analysis of the management system philosophy with a focus on the adaptable mission manager. A case study is presented that highlights the dependability and performance improvement provided by the mission manager and autonomic health monitoring scheme

[1]  High-performance, Dependable Multiprocessor , 2006, 2006 IEEE Aerospace Conference.

[2]  A. George,et al.  Scheduling tradeoffs for heterogeneous computing on an advanced space processing platform , 2006, 12th International Conference on Parallel and Distributed Systems - (ICPADS'06).

[3]  Eui-Nam Huh,et al.  Adaptive Management of Computing and Network Resources for Spacecraft Systems , 2000 .

[4]  Huaglory Tianfield,et al.  A concise introduction to autonomic computing , 2005, Adv. Eng. Informatics.

[5]  Alan D. George,et al.  Reliable Management Services for COTS-based Space Systems and Applications , 2006, ESA.

[6]  Roy Sterritt,et al.  Autonomic Computing - a means of achieving dependability? , 2003, 10th IEEE International Conference and Workshop on the Engineering of Computer-Based Systems, 2003. Proceedings..

[7]  Roy Sterritt,et al.  Self*- Properties in NASA Mission , 2005, Software Engineering Research and Practice.

[8]  Pamela Elizabeth Clark,et al.  ANTS for Human Exploration and Development of Space , 2003, 2003 IEEE Aerospace Conference Proceedings (Cat. No.03TH8652).

[9]  Daniel S. Katz,et al.  Demonstration of the remote exploration and experimentation (REE) fault-tolerant parallel-processing supercomputer for spacecraft onboard scientific data processing , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[10]  M. Patel,et al.  High Performance Dependable Multiprocessor II , 2007, 2007 IEEE Aerospace Conference.

[11]  David Finkel,et al.  Autonomic cluster management system (ACMS): a demonstration of autonomic principles at work , 2005, 12th IEEE International Conference and Workshops on the Engineering of Computer-Based Systems (ECBS'05).

[12]  Roy Sterritt,et al.  Self-* Properties in NASA Missions , 2005 .

[13]  Roy Sterritt,et al.  Towards Autonomic Management of NASA Missions , 2005, 11th International Conference on Parallel and Distributed Systems (ICPADS'05).

[14]  Salim Hariri,et al.  Autonomic Computing: An Overview , 2004, UPP.

[15]  Alan D. George,et al.  FEMPI: A Lightweight Fault-tolerant MPI for Embedded Cluster Systems , 2006, ESA.

[16]  Ravishankar K. Iyer,et al.  The Effects of an ARMOR-based SIFT environment on the performance and dependability of user applications , 2004, IEEE Transactions on Software Engineering.

[17]  A.D. George,et al.  Hardware/software interface for high-performance space computing with FPGA coprocessors , 2006, 2006 IEEE Aerospace Conference.