Osculant: a multiprocessor self-organizing task scheduler

The world of computing is rapidly changing. One model states that in the future a variety of mobile and/or relocatable computational assets will be rapidly configured and synergistically linked together to achieve a desired outcome. From a logistical viewpoint, these assets will be a collection of interconnected heterogeneous resources ranging possibly high-end platforms to simple man portable systems. This articulated environment must be more mobile, reconfigurable, and fault-tolerant than those currently found in common use. A new bottom-up resource scheduling paradigm, called Osculant, is being studied as the facilitating technology. The paper develops Osculant both from a conceptual and well as experimental standpoint. Simulated results are presented which indicates that Osculant can perform as well, and is generally better in balancing system resources than traditional task schedulers. Issues regarding network communication requirements, processor capabilities, logistics, scheduling details are developed and discussed. Because of self-organized framework provided by Osculant, it will also be shown that it has superior fault recovery capabilities in comparison to traditional top-down schedulers.

[1]  S. T. Chanson,et al.  Adaptive threshold-based scheduling for real-time and non-real-time traffic , 1992, [1992] Proceedings Real-Time Systems Symposium.

[2]  Charles L. Seitz,et al.  Multicomputers: message-passing concurrent computers , 1988, Computer.

[3]  K. Gunther,et al.  Prevention of Deadlocks in Packet-Switched Data Transport Systems , 1981 .

[4]  Howard P. Katseff,et al.  Incomplete Hypercubes , 1988, IEEE Trans. Computers.

[5]  Karsten Schwan,et al.  Experimental Evaluation of a Real-Time Scheduler for a Multiprocessor System , 1991, IEEE Trans. Software Eng..

[6]  Anoop Gupta,et al.  Design of scalable shared-memory multiprocessors: the DASH approach , 1990, Digest of Papers Compcon Spring '90. Thirty-Fifth IEEE Computer Society International Conference on Intellectual Leverage.

[7]  P. Messina,et al.  Architectural requirements of parallel scientific applications with explicit communication , 1993, ISCA '93.

[8]  José Duato,et al.  A New Theory of Deadlock-Free Adaptive Routing in Wormhole Networks , 1993, IEEE Trans. Parallel Distributed Syst..

[9]  Kevin P. McAuliffe,et al.  The IBM Research Parallel Processor Prototype (RP3): Introduction and Architecture , 1985, ICPP.

[10]  William J. Dally Virtual-Channel Flow Control , 1992, IEEE Trans. Parallel Distributed Syst..

[11]  Richard M. Fujimoto,et al.  Multicomputer Networks: Message-Based Parallel Processing , 1987 .

[12]  William J. Dally,et al.  The message-driven processor: a multicomputer processing node with efficient mechanisms , 1992, IEEE Micro.

[13]  Sam Toueg,et al.  Deadlock- and livelock-free packet switching networks , 1980, STOC '80.

[14]  Ross A. Gagliano,et al.  Auction allocation of computing resources , 1995, CACM.

[15]  Ralph Grishman,et al.  The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel Computer , 1983, IEEE Transactions on Computers.

[16]  J. Y. Ngai,et al.  A framework for adaptive routing in multicomputer networks , 1989, CARN.

[17]  Rob A. Rutenbar,et al.  Simulated annealing algorithms: an overview , 1989, IEEE Circuits and Devices Magazine.

[18]  Allan Porterfield,et al.  Exploiting heterogeneous parallelism on a multithreaded multiprocessor , 1992, ICS '92.

[19]  Krithi Ramamritham,et al.  Distributed Scheduling of Tasks with Deadlines and Resource Requirements , 1989, IEEE Trans. Computers.

[20]  Gregory V. Wilson,et al.  A glossary of parallel computing terminology , 1993, IEEE Parallel & Distributed Technology: Systems & Applications.

[21]  Tatsuya Suda,et al.  Survey of traffic control schemes and protocols in ATM networks , 1991, Proc. IEEE.

[22]  Andrew A. Chien,et al.  Do Faster Routers Imply Faster Communication? , 1994, PCRCW.

[23]  James R. Armstrong,et al.  Fault Diagnosis in a Boolean n Cube Array of Microprocessors , 1981, IEEE Transactions on Computers.

[24]  William J. Dally,et al.  A VLSI Architecture for Concurrent Data Structures , 1987 .

[25]  Chung Laung Liu,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[26]  Raymond M. Bryant,et al.  Operating system support for parallel programming on RP3 , 1991, IBM J. Res. Dev..

[27]  Herb Schwetman,et al.  Cost-Performance Bounds for Multimicrocomputer Networks , 1983, IEEE Transactions on Computers.

[28]  Anant Agarwal,et al.  Limits on Interconnection Network Performance , 1991, IEEE Trans. Parallel Distributed Syst..

[29]  Charles L. Seitz,et al.  The cosmic cube , 1985, CACM.

[30]  C. H. C. Leung Quantitative Analysis of Computer Systems , 1990, SIGMETRICS Perform. Evaluation Rev..

[31]  William J. Dally,et al.  Deadlock-Free Adaptive Routing in Multicomputer Networks Using Virtual Channels , 1993, IEEE Trans. Parallel Distributed Syst..

[32]  Leonard Kleinrock,et al.  Virtual Cut-Through: A New Computer Communication Switching Technique , 1979, Comput. Networks.

[33]  Suresh Chalasani,et al.  A comparison of adaptive wormhole routing algorithms , 1993, ISCA '93.

[34]  Xiaodong Zhang,et al.  System effects of interprocessor communication latency in multicomputers , 1991, IEEE Micro.

[35]  Donald F. Ferguson,et al.  Microeconomic algorithms for load balancing in distributed computer systems , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[36]  Ronald Fernandes,et al.  Recursive Interconnection Networks for Multicomputer Networks , 1992, International Conference on Parallel Processing.