Performance modeling and analysis of correlated parallel computations

A performance analysis methodology for correlated parallel computations based on statistical theory is proposed. Divide-and-conquer strategy is widely used in solving problems in parallel by partitioning and allocating a number of given tasks to available computing resources. When the tasks exhibit run-time-dependent behaviors during execution and share a universal distribution function in their execution times, analysis of parallel execution time can be performed with the assistance of probabilistic and statistical models. Correlation (dependence) in execution times among tasks has posed a significant factor in influencing the analysis accuracy which is unmanageable by any known analysis methodologies. We establish a relation between a task's or a processor's execution time and the parallel execution time, in terms of expected value as well as variance when each task's execution time can be closely modeled by a normal distribution, for either uncorrelated or correlated tasks. This relation is then applied to the modeling and analysis of various parallel computation paradigms in which different communication and synchronization patterns along the processing are present. The method proposed has a wider application scope and gives more accurate prediction results than previously known approaches. We also show that, as an extended application of the analysis method to a large scope of problems, load balance among processors can be vastly improved with some novel static task allocation technique in manipulating the correlation among tasks. Experimental results in analyzing a parallel tree search algorithm and two parallel sorting algorithms show very accurate analysis and prediction with the proposed method.

[1]  R. F. Brown,et al.  PERFORMANCE EVALUATION , 2019, ISO 22301:2019 and business continuity management – Understand how to plan, implement and enhance a business continuity management system (BCMS).

[2]  Satish K. Tripathi,et al.  On Performance Prediction of Parallel Computations with Precedent Constraints , 2000, IEEE Trans. Parallel Distributed Syst..

[3]  J. T. Robinson,et al.  Some Analysis Techniques for Asynchronous Multiprocessor Algorithms , 1979, IEEE Transactions on Software Engineering.

[4]  Daniel A. Menascé,et al.  A Methodology for Performance Evaluation of Parallel Applications on Multiprocessors , 1992, J. Parallel Distributed Comput..

[5]  Jon G. Kuhl,et al.  Stochastic Performance Models of Parallel Task Systems , 1994, SIGMETRICS.

[6]  Mary K. Vernon,et al.  The influence of random delays on parallel execution times , 1993, SIGMETRICS '93.

[7]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[8]  Xingfu Wu,et al.  Performance Evaluation, Prediction and Visualization of Parallel Systems , 1999, The Kluwer International Series on Asian Studies in Computer and Information Science.

[9]  Bajis M. Dodin,et al.  Bounding the Project Completion Time Distribution in PERT Networks , 1985, Oper. Res..

[10]  Jean-Marc Vincent,et al.  Stochastic Bounds on Execution Times of Parallel Programs , 1991, IEEE Trans. Software Eng..

[11]  Ranette Halverson,et al.  Performance Evaluation of Parallel Implementation of Nested Loop Control Structures , 2022 .

[12]  Roland R. Mielke,et al.  Prediction of Performance and Processor Requirements in Real-Time Data Flow Architectures , 1993, IEEE Trans. Parallel Distributed Syst..

[13]  Michel Dubois,et al.  Performance of Synchronized Iterative Processes in Multiprocessor Systems , 1982, IEEE Transactions on Software Engineering.

[14]  Bo Yang,et al.  Load balancing technique for parallel search with statistical model , 1995, Proceedings International Phoenix Conference on Computers and Communications.

[15]  Jens Mache,et al.  Performance evaluation of parallel file systems for PC clusters and ASCI red , 2001, Proceedings 42nd IEEE Symposium on Foundations of Computer Science.

[16]  Kin K. Leung,et al.  Task Response Time For Real-Time Distributed Systems With Resource Contentions , 1991, IEEE Trans. Software Eng..

[17]  Randolph D. Nelson,et al.  A performance evaluation of a general parallel processing model , 1990, SIGMETRICS '90.

[18]  Anthony J. G. Hey,et al.  Realistic Parallel Performance Estimation , 1997, Parallel Comput..

[19]  Vishwani D. Agrawal,et al.  Performance Analysis of Synchronized Iterative Algorithms on Multiprocessor Systems , 1992, IEEE Trans. Parallel Distributed Syst..

[20]  Dan C. Marinescu,et al.  Synchronization and load imbalance effects in distributed memory multi-processor systems , 1991, Concurr. Pract. Exp..

[21]  Naohiro Ishii,et al.  Multithreaded Parallel Computer Model with Performance Evaluation , 2000, IPDPS Workshops.

[22]  B. Arnold,et al.  A first course in order statistics , 1994 .

[23]  S. Madala,et al.  Performance of Synchronous Parallel Algorithms with Regular Structures , 1991, IEEE Trans. Parallel Distributed Syst..

[24]  Arjan J. C. van Gemund,et al.  Low-cost static performance prediction of parallel stochastic task compositions , 2006, IEEE Transactions on Parallel and Distributed Systems.

[25]  John Gaschnig,et al.  A General Backtrack Algorithm That Eliminates Most Redundant Tests , 1977, IJCAI.

[26]  Alan Weiss,et al.  Allocating Independent Subtasks on Parallel Processors , 1985, IEEE Transactions on Software Engineering.

[27]  Arjan J. C. van Gemund,et al.  Symbolic Performance Modeling of Parallel Systems , 2003, IEEE Trans. Parallel Distributed Syst..

[28]  A. E. Sarhan,et al.  Contributions to order statistics , 1964 .

[29]  Mark J. Clement,et al.  Analytical performance prediction on multicomputers , 1993, Supercomputing '93. Proceedings.

[30]  Vivek Sarkar,et al.  Determining average program execution times and their variance , 1989, PLDI '89.

[31]  Bo Yang,et al.  Probabilistic performance analysis for parallel search techniques , 1995, International Journal of Parallel Programming.

[32]  Stephen F. Lundstrom,et al.  Predicting Performance of Parallel Computations , 1990, IEEE Trans. Parallel Distributed Syst..

[33]  Linda F. Wilson,et al.  Synchronization and communication in algorithmic structures , 1994, Proceedings of 1994 6th IEEE Symposium on Parallel and Distributed Processing.

[34]  Robert M. Haralick,et al.  Increasing Tree Search Efficiency for Constraint Satisfaction Problems , 1979, Artif. Intell..

[35]  Theodore Brown,et al.  Parallel Median Splitting and k-Splitting with Application to Merging and Sorting , 1993, IEEE Trans. Parallel Distributed Syst..

[36]  Hong Linh Truong,et al.  SCALEA: A Performance Analysis Tool for Distributed and Parallel Programs , 2002, Euro-Par.