Performance Analysis of Parallel Systems: A Hybrid Approach

6 Model analysis 63 6.1 Introduction

[1]  Steven A. Przybylski,et al.  Cache and memory hierarchy design: a performance-directed approach , 1990 .

[2]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[3]  C. G. Bell Multis: A New Class of Multiprocessor Computers , 1985, Science.

[4]  Vivek Sarkar,et al.  Partitioning and Scheduling Parallel Programs for Multiprocessing , 1989 .

[5]  Stephen Gilmore,et al.  The PEPA Workbench: A Tool to Support a Process Algebra-based Approach to Performance Modelling , 1994, Computer Performance Evaluation.

[6]  M. Pernice,et al.  PVM: Parallel Virtual Machine - A User's Guide and Tutorial for Networked Parallel Computing [Book Review] , 1996, IEEE Parallel & Distributed Technology: Systems & Applications.

[7]  Christoph Lindemann,et al.  DSPNexpress: A Software Package for the Efficient Solution of Deterministic and Stochastic Petri Nets , 1993, Perform. Evaluation.

[8]  Constantine D. Polychronopoulos,et al.  Speedup Bounds and Processor Allocation for Parallel Programs on Multiprocessors , 1986, ICPP.

[9]  Raymond E. Miller,et al.  A Comparison of Some Theoretical Models of Parallel Computation , 1973, IEEE Transactions on Computers.

[10]  Guenter Klas,et al.  Net level aggregation using nonlinear optimization for the solution of hierarchical generalized stochastic Petri nets , 1992, [1992] Proceedings of the 12th International Conference on Distributed Computing Systems.

[11]  S. Natkin,et al.  Generalization of Queueing Network Product Form Solutions to Stochastic Petri Nets , 1991, IEEE Trans. Software Eng..

[12]  John Zahorjan,et al.  The approximate solution of large queueing network models , 1980 .

[13]  Jean-Marc Vincent,et al.  Stochastic Bounds on Execution Times of Parallel Programs , 1991, IEEE Trans. Software Eng..

[14]  Milos D. Ercegovac,et al.  A methodology for performance analysis of parallel computations with looping constructs , 1992 .

[15]  Kenneth E. Batcher,et al.  Design of a Massively Parallel Processor , 1980, IEEE Transactions on Computers.

[16]  Jeffrey P. Buzen,et al.  The aggregate server method for analyzing serialization delays in computer systems , 1983, TOCS.

[17]  L. Goddard,et al.  Operations Research (OR) , 2007 .

[18]  Alex Kapelnikov,et al.  A Modeling Methodology for the Analysis of Concurrent Systems and Computations , 1989, J. Parallel Distributed Comput..

[19]  Kishor S. Trivedi,et al.  Analytic Queueing Models for Programs with Internal Concurrency , 1983, IEEE Transactions on Computers.

[20]  Brigitte Plateau On the stochastic structure of parallelism and synchronization models for distributed algorithms , 1985, SIGMETRICS 1985.

[21]  William L. Maxwell,et al.  Theory of scheduling , 1967 .

[22]  Kishor S. Trivedi,et al.  Techniques and Tools for Reliability and Performance Evaluation: Problems and Perspectives , 1994, Computer Performance Evaluation.

[23]  Edward D. Lazowska,et al.  Analyzing queueing networks with simultaneous resource possession , 1982, CACM.

[24]  Raymond A. Marie An Approximate Analytical Method for General Queueing Networks , 1979, IEEE Transactions on Software Engineering.

[25]  Bruce P. Lester A System for Computing the Speedup of Parallel Programs , 1986, ICPP.

[26]  D J Kuck,et al.  Parallel Supercomputing Today and the Cedar Approach , 1986, Science.

[27]  Mary K. Vernon,et al.  The influence of random delays on parallel execution times , 1993, SIGMETRICS '93.

[28]  Armand M. Makowski,et al.  Queueing models for systems with synchronization constraints , 1989, Proc. IEEE.

[29]  Nicolas D. Georganas,et al.  Queueing networks - exact computational algorithms: a unified theory based on decomposition and aggregation , 1989, Computer systems.

[30]  François Baccelli,et al.  On the execution of parallel programs on multiprocessor systems—a queuing theory approach , 1990, JACM.

[31]  Ken Kennedy,et al.  A static performance estimator to guide data partitioning decisions , 1991, PPOPP '91.

[32]  Tadao Murata,et al.  Petri nets: Properties, analysis and applications , 1989, Proc. IEEE.

[33]  Anthony E. Krzesinski,et al.  Improved lineariser methods for queueing networks with queue dependent centres , 1984, SIGMETRICS '84.

[34]  Daniel Gajski,et al.  CEDAR: a large scale multiprocessor , 1983, CARN.

[35]  Günter Haring,et al.  PAPS - The Parallel Program Performance Prediction Toolset , 1994, Computer Performance Evaluation.

[37]  Jeffrey P. Buzen,et al.  Computational algorithms for closed queueing networks with exponential servers , 1973, Commun. ACM.

[38]  Krishnan Padmanabhan Cube structures for multiprocessors , 1990, CACM.

[39]  M. Malik,et al.  Operating Systems , 1992, Lecture Notes in Computer Science.

[40]  Arjan J. C. van Gemund,et al.  Performance prediction of parallel processing systems: the PAMELA methodology , 1993, ICS '93.

[41]  Subrata Dasgupta,et al.  A hierarchical taxonomic system for computer architectures , 1990, Computer.

[42]  Jack B. Dennis,et al.  Programming semantics for multiprogrammed computations , 1966, CACM.

[43]  Edward D. Lazowska,et al.  Quantitative system performance - computer system analysis using queueing network models , 1983, Int. CMG Conference.

[44]  Heinz Mühlenbein,et al.  Network Programming with MUPPET , 1988, J. Parallel Distributed Comput..

[45]  Derek L. Eager,et al.  Bound hierarchies for multiple-class queuing networks , 1986, JACM.

[46]  Kishor S. Trivedi,et al.  Reliability Modeling Using SHARPE , 1987, IEEE Transactions on Reliability.

[47]  James W. Moore,et al.  A Benchmark Comparison of Three Supercomputers: Fujitsu VP-200, Hitachi S810/120, and Cray X-MP/2 , 1985, Computer.

[48]  Dharma P. Agrawal,et al.  Design and Performance of Generalized Interconnection Networks , 1983, IEEE Transactions on Computers.

[49]  Markus Siegle Using Structured Modelling for Efficient Performance Prediction of Parallel Systems , 1993, PARCO.

[50]  E. de Jong,et al.  High-level specification tools for parallel application development , 1992, CompEuro 1992 Proceedings Computer Systems and Software Engineering.

[51]  Allen D. Malony,et al.  Vector Processing on the Alliant FX/8 Multiprocessor , 1986, ICPP.

[52]  Alan H. Karp,et al.  A comparison of 12 parallel FORTRAN dialects , 1988, IEEE Software.

[53]  Ajmone MarsanMarco,et al.  A class of generalized stochastic Petri nets for the performance evaluation of multiprocessor systems , 1984 .

[54]  Sten F. Andler,et al.  Predicate path expressions , 1979, POPL.

[55]  Henk Jonkers,et al.  Queueing Models of Parallel Applications: The Glamis Methodology , 1994, Computer Performance Evaluation.

[56]  Wolfgang Kreutzer,et al.  System simulation , 1986 .

[57]  Arjan J. C. van Gemund The PAMELA Approach to Performance Modeling of Parallel and Distributed Systems , 1993, PARCO.

[58]  C. A. R. Hoare,et al.  Monitors: an operating system structuring concept , 1974, CACM.

[59]  Subhash C. Agrawal Metamodeling: A Study of Approximations in Queueing Models , 1984 .

[60]  Jane Hillston A Tool to Enhance Model Exploitation , 1995, Perform. Evaluation.

[61]  Simon S. Lam,et al.  PAM-a noniterative approximate solution method for closed multichain queueing networks , 1988, SIGMETRICS '88.

[62]  B. J. Lageweg,et al.  Multiprocessor scheduling with communication delays , 1990, Parallel Comput..

[63]  Wlodzimierz M. Zuberek,et al.  Timed Petri nets and preliminary performance evaluation , 1980, ISCA '80.

[64]  Arjan J. C. van Gemund,et al.  Compiling performance models from parallel programs , 1994, ICS '94.

[65]  Teemu Kerola The Composite Bound Method for Computing Throughput Bounds in Multiple Class Environments , 1986, Perform. Evaluation.

[66]  Erwin M. Thurner,et al.  TOMSPIN - A Tool for Modeling with Stochastic Petri Nets , 1995, EUROCAST.

[67]  Albrecht Sieber,et al.  Approximate Analysis of Load Dependent General Queueing Networks , 1988, IEEE Trans. Software Eng..

[68]  Matteo Sereno,et al.  On the Product Form Solution for Stochastic Petri Nets , 1992, Application and Theory of Petri Nets.

[69]  Gregory R. Andrews,et al.  Concepts and Notations for Concurrent Programming , 1983, CSUR.

[70]  Vassilis Mertsiotakis,et al.  PEPP: Performance Evaluation of Parallel Programs , 1992 .

[71]  Melvin E. Conway,et al.  A multiprocessor system design , 1899, AFIPS '63 (Fall).

[72]  Victor W. K. Mak,et al.  Performance prediction of concurrent systems , 1987 .

[73]  C. Petri Kommunikation mit Automaten , 1962 .

[74]  Harry F. Jordan,et al.  The Force: A Highly Portable Parallel Programming Language , 1989, ICPP.

[75]  Henk Jonkers,et al.  A probabilistic approach to parallel system performance modelling , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[76]  Ray Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[77]  Satish K. Tripathi,et al.  Approximate Solution to Multichain Queueing Networks with State Dependent Service Rates , 1985, Perform. Evaluation.

[78]  John Zahorjan,et al.  Balanced job bound analysis of queueing networks , 1982, CACM.

[79]  P. Brinch-Hansen,et al.  The programming language Concurrent Pascal , 1975 .

[80]  Frederica Darema,et al.  A Speedup Analyzer for Parallel Programs , 1987, ICPP.

[81]  John L. Gustafson,et al.  Reevaluating Amdahl's law , 1988, CACM.

[82]  Kurt Jensen,et al.  Coloured Petri nets: A high level language for system design and analysis , 1991, Applications and Theory of Petri Nets.

[83]  Yonathan Bard,et al.  Some Extensions to Multiclass Queueing Network Analysis , 1979, Performance.

[84]  Dharma P. Agrawal,et al.  Performance of multiprocessor interconnection networks , 1989, Computer.

[85]  Allen D. Malony,et al.  Automatic Scalability Analysis of Parallel Programs Based on Modeling Techniques , 1994, Computer Performance Evaluation.

[86]  Marco Ajmone Marsan,et al.  On Petri nets with deterministic and exponentially distributed firing times , 1986, European Workshop on Applications and Theory of Petri Nets.

[87]  David B. Skillicorn A taxonomy for computer architectures , 1988, Computer.

[88]  Michael J. Flynn,et al.  Some Computer Organizations and Their Effectiveness , 1972, IEEE Transactions on Computers.

[89]  Dennis Gannon,et al.  Building analytical models into an interactive performance prediction tool , 1989, Proceedings of the 1989 ACM/IEEE Conference on Supercomputing (Supercomputing '89).

[90]  Craig J. Mundie,et al.  The Architecture of the Alliant FX/8 Computer , 1986, COMPCON.

[91]  Lionel M. Ni,et al.  A survey of wormhole routing techniques in direct networks , 1993, Computer.

[92]  W. Daniel Hillis,et al.  The connection machine , 1985 .

[93]  Norbert Götz,et al.  Multiprocessor and Distributed System Design: The Integration of Functional Specification and Performance Analysis Using Stochastic Process Algebras , 1993, Performance/SIGMETRICS Tutorials.

[94]  James Lyle Peterson,et al.  Petri net theory and the modeling of systems , 1981 .

[95]  D. Cox A use of complex probabilities in the theory of stochastic processes , 1955, Mathematical Proceedings of the Cambridge Philosophical Society.

[96]  Kishor S. Trivedi,et al.  Performance and Reliability Analysis Using Directed Acyclic Graphs , 1987, IEEE Transactions on Software Engineering.

[97]  J. C. M. Baeten,et al.  Process Algebra: Bibliography , 1990 .

[98]  Raymond A. Marie,et al.  Extensions and computational aspects of an iterative method , 1982, SIGMETRICS '82.

[99]  Steve Wallach Integrated scalar and vector processing on the affordable CONVEX C-1 supercomputer , 1989 .

[100]  C. A. R. Hoare,et al.  Communicating sequential processes , 1978, CACM.

[101]  Yves Robert The Impact of Vector and Parallel Architectures on the Gaussian Elimination Algorithm , 1991 .

[102]  Rj Wijbrands,et al.  Queuing network models and performance analysis of computer systems , 1988 .

[103]  Philip J. Hatcher,et al.  Data-parallel programming on multicomputers , 1990, IEEE Software.

[104]  Anand Sivasubramaniam,et al.  Machine Abstractions and Locality Issues in Studying Parallel Systems , 1993 .

[105]  Peter G. Taylor,et al.  Embedded Processes in Stochastic Petri Nets , 1991, IEEE Trans. Software Eng..

[106]  Gianfranco Balbo,et al.  Combining queueing network and Generalized Stochastic Petri Net models for the analysis of some software blocking phenomena , 1985, IEEE Transactions on Software Engineering.

[107]  Peter Buchholz,et al.  Hierarchical Markovian Models: Symmetries and Reduction , 1995, Perform. Evaluation.

[108]  Alois Ferscha,et al.  A Petri Net Approach for Performance Oriented Parallel Program Design , 1992, J. Parallel Distributed Comput..

[109]  M. E. Woodward,et al.  Queueing Models of Shared-memory Parallel Applications , 1994 .

[110]  Wil M.P. van der Aalst Putting High-level Petri Nets to Work in Industry , 1994 .

[111]  Erol Gelenbe,et al.  Multiprocessor Performance , 1990, SIGMETRICS Perform. Evaluation Rev..

[112]  Thierry Priol,et al.  KOAN: A Shared Virtual Memory for the iPSC/2 Hypercube , 1992, CONPAR.

[113]  Gianfranco Balbo,et al.  Combining Queueing Networks and Generalized Stochastic Petri Nets for the Solution of Complex Models of System Behavior , 1988, IEEE Trans. Computers.

[114]  Henk Sips,et al.  Booster : a high-level language for portable parallel algorithms , 1991 .

[115]  K. Mani Chandy,et al.  Open, Closed, and Mixed Networks of Queues with Different Classes of Customers , 1975, JACM.

[116]  Vikram S. Adve,et al.  Analyzing the behavior and performance of parallel programs , 1993 .

[117]  Dirk Roose,et al.  Benchmarking the iPSC/2 Hypercube Multiprocessor , 1989, Concurr. Pract. Exp..

[118]  C. A. Petri Communication with automata , 1966 .

[119]  A. V. Gemund,et al.  Predicting Contention in Distributed-Memory Machines , 1995 .

[120]  Stephen F. Lundstrom,et al.  Predicting Performance of Parallel Computations , 1990, IEEE Trans. Parallel Distributed Syst..

[121]  K. Mani Chandy,et al.  Linearizer: a heuristic algorithm for queueing network models of computing systems , 1982, CACM.

[122]  David K. Smith,et al.  Queueing Networks — Exact Computational Algorithms: A United Theory Based on Decomposition and Aggregation , 1991 .

[123]  Arjan J. C. van Gemund The PAMELA Run-Time Library Version 1.0 , 1994 .

[124]  Jack Dongarra,et al.  Comparison of the CRAY X-MP-4, Fujitsu VP-200, and Hitachi S-810/20 , 1986, Simul..

[125]  Peter J. Denning,et al.  The Operational Analysis of Queueing Network Models , 1978, CSUR.