Optimal reconfiguration strategy for a degradable multimodule computing system

A new quantitative approach to the problem of reconfiguring a degradable multimodule system is presented. The approach is concerned with both assigning some modules for computation and arranging others for reliability. Conventionally, a fault-tolerant system performs reconfiguration only upon a subsystem failure. Since there exists an inherent trade-off between the computation capacity and fault tolerance of a multimodule computing system, the conventional approach is a passive action and does not yield a configuration that provides an optimal compromise for the trade-off. By using the expected total reward as the optimal criterion, the need and existence of an active reconfiguration strategy, in which the system reconfigures itself on the basis of not only the occurrence of a failure but also the progression of the mission, are shown. Following the problem formulation, some important properties of an optimal reconfiguration strategy, which specify (i) the times at which the system should undergo reconfiguration and (ii) the configurations to which the system should change, are investigated. Then, the optimal reconfiguration problem is converted to integer nonlinear knapsack and fractional programming problems. The algorithms for solving these problems and a demonstrative example are given. Extensions of the optimal reconfiguration problem are also discussed.

[1]  Howard Jay Siegel,et al.  The Extra Stage Cube: A Fault-Tolerant Interconnection Network for Supersystems , 1982, IEEE Transactions on Computers.

[2]  John F. Meyer,et al.  On Evaluating the Performability of Degradable Computing Systems , 1980, IEEE Transactions on Computers.

[3]  Kang G. Shin,et al.  Error Detection Process - Model, Design, and Its Impact on Computer Performance , 1984, IEEE Trans. Computers.

[4]  Abraham Charnes,et al.  Management Sciences and Management—Some Requirements for Further Development , 1966 .

[5]  Journal of the Association for Computing Machinery , 1961, Nature.

[6]  Wesley W. Chu,et al.  Task Allocation in Distributed Data Processing , 1980, Computer.

[7]  Mary W. Cooper,et al.  The use of dynamic programming methodology for the solution of a class of nonlinear programming problems , 1980 .

[8]  Svetlana P. Kartashev,et al.  A Multicomputer System with Dynamic Architecture , 1979, IEEE Transactions on Computers.

[9]  S. Chandra,et al.  A Branch and Bound Method for Integer Nonlinear Fractional Programs , 1980 .

[10]  B. Craven,et al.  On fractional programming and equivalence , 1975 .

[11]  S.H. Fuller,et al.  Multi-microprocessors: An overview and working example , 1978, Proceedings of the IEEE.

[12]  Dileep Bhandarkar Some Performance Issues in Multiprocessor System Design , 1977, IEEE Transactions on Computers.

[13]  R. E. Marsten,et al.  An Algorithm for Nonlinear Knapsack Problems , 1976 .

[14]  Gordon Bell,et al.  C.mmp: a multi-mini-processor , 1972, AFIPS '72 (Fall, part II).

[15]  Lorenzo Donatiello,et al.  Analysis of a composite performance reliability measure for fault-tolerant systems , 1987, JACM.

[16]  Bennett Fox,et al.  Discrete Optimization Via Marginal Analysis , 1966 .

[17]  Hanan Luss,et al.  Technical Note - Allocation of Effort Resources among Competing Activities , 1975, Oper. Res..

[18]  Alan M. Frieze Shortest path algorithms for knapsack type problems , 1976, Math. Program..

[19]  M. A. Pollatschek,et al.  On Some Nonlinear Knapsack Problems , 1977 .

[20]  S. Schaible Minimization of ratios , 1976 .

[21]  Kang G. Shin,et al.  Optimal design and use of retry in fault tolerant real-time computer systems , 1983 .

[22]  Paul H. Zipkin,et al.  Simple Ranking Methods for Allocation of One Resource , 1980 .

[23]  Robert J. McMillen,et al.  A survey of interconnection methods for reconfigurable parallel processing systems* , 1899, 1979 International Workshop on Managing Requirements Knowledge (MARK).

[24]  Kang G. Shin,et al.  Performance Measures for Multiprocessor Controllers , 1983, Performance.

[25]  Arthur D. Friedman,et al.  A survey and methodology of reconfigurable multi-module systems , 1978, COMPSAC.

[26]  M. Fisher Worst-Case Analysis of Heuristic Algorithms , 1980 .

[27]  Nimrod Megiddo,et al.  Combinatorial optimization with rational objective functions , 1978, Math. Oper. Res..

[28]  Arthur D. Friedman,et al.  Diagnostic And Computational Reconfiguration In Multiprocessor Systems , 1978, ACM Annual Conference.

[29]  Lorenzo Strigini,et al.  Reconfiguration procedure in a distributed multiprocessor system , 1982 .

[30]  Nimrod Megiddo Combinatorial Optimization with Rational Objective Functions , 1979, Math. Oper. Res..

[31]  T. B. Smith,et al.  Development and evaluation of a fault-tolerant multiprocessor (FTMP) computer. Volume 4: FTMP executive summary , 1984 .

[32]  Svetlana P. Kartashev,et al.  Supersystems for the 80's , 1980, Computer.

[33]  David Grover Furchtgott Performability models and solutions , 1983 .