论文信息 - Simple Performance Bounds for Multicore and Parallel Channel Systems

Simple Performance Bounds for Multicore and Parallel Channel Systems

A simple modification of existing divisible load scheduling algorithms, boosting link speed by M for M parallel channels per link, allows time optimal load scheduling and performance prediction for parallel channel systems. The situation for multicore models is more complex but can be handled by a substitution involving equivalent processor speed. These modifications yield upper bounds on such parallel systems' performance. This concept is illustrated for ideal single level (star) tree networks under a variety of scheduling policies. Less than ideal parallelism can also be modeled though mechanisms of inefficiency require further research.

Thomas G. Robertazzi | Carlos Fernando Gamboa

[1] H. V. Jagadish,et al. Partitioning Techniques for Large-Grained Parallelism , 1988, IEEE Trans. Computers.

[2] Nathan R. Tallent,et al. Identifying Performance Bottlenecks in Work-Stealing Computations , 2009, Computer.

[3] Henri Casanova,et al. UMR: a multi-round algorithm for scheduling divisible workloads , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[4] Thomas G. Robertazzi,et al. Optimal Divisible Load Scheduling and Markov Chain Models , 2003 .

[5] Bharadwaj Veeravalli,et al. Scheduling Multisource Divisible Loads on Arbitrary Networks , 2010, IEEE Transactions on Parallel and Distributed Systems.

[6] Thomas G. Robertazzi,et al. Bus-oriented load sharing for a network of sensor driven processors , 1991, IEEE Trans. Syst. Man Cybern..

[7] William J. Dally,et al. Research Challenges for On-Chip Interconnection Networks , 2007, IEEE Micro.

[8] C. Siva Ram Murthy,et al. Distributed computation for a hypercube network of sensor-driven processors with communication delays including setup time , 1998, IEEE Trans. Syst. Man Cybern. Part A.

[9] Thomas G. Robertazzi. A Product Form Solution for Tree Networks with Divisible Loads , 2011, Parallel Process. Lett..

[10] Maciej Drozdowski,et al. THE PERFORMANCE LIMITS OF A TWO-DIMENSIONAL NETWORK OF LOAD-SHARING PROCESSORS , 2011 .

[11] Henri Casanova,et al. Parallel Algorithms , 2019, Design and Analysis of Algorithms.

[12] Henri Casanova,et al. A realistic network/application model for scheduling divisible loads on large-scale platforms , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[13] Maciej Drozdowski,et al. Scheduling for Parallel Processing , 2009, Computer Communications and Networks.

[14] Debasish Ghose,et al. Multi-installment load distribution in tree networks with delays , 1995 .

[15] Jacek Blazewicz,et al. Scheduling Divisible Jobs on Hypercubes , 1995, Parallel Comput..

[16] Debasish Ghose,et al. Divisible Load Theory: A New Paradigm for Load Scheduling in Distributed Systems , 2004, Cluster Computing.

[17] Thomas G. Robertazzi. Processor equivalence for a linear daisy chain of load sharing processors , 1992 .

[18] T.G. Robertazzi,et al. Optimal divisible job load sharing for bus networks , 1996, IEEE Transactions on Aerospace and Electronic Systems.

[19] Jacek Blazewicz,et al. Distributed Processing of Divisible Jobs with Communication Startup Costs , 1997, Discret. Appl. Math..

[20] Yves Robert,et al. Optimal algorithms for scheduling divisible workloads on heterogeneous systems , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[21] Arnold L. Rosenberg,et al. Sharing partitionable workloads in heterogeneous NOWs: greedier is not better , 2001, Proceedings 42nd IEEE Symposium on Foundations of Computer Science.

[22] Debasish Ghose,et al. Scheduling Divisible Loads in Parallel and Distributed Systems , 1996 .

[23] Thomas G. Robertazzi,et al. Grid scheduling divisible loads from two sources , 2009, Comput. Math. Appl..

[24] Thomas G. Robertazzi,et al. Distributed computation for a tree network with communication delays , 1990 .

[25] Debasish Ghose,et al. Distributed Computation with Communication Delays: Asymptotic Performance Analysis , 1994, J. Parallel Distributed Comput..

[26] Thomas G. Robertazzi. Processor equivalence for daisy chain load sharing processors , 1993 .

[27] M. Moges,et al. Wireless sensor networks: scheduling for measurement and data reporting , 2006, IEEE Transactions on Aerospace and Electronic Systems.

[28] Xian-He Sun,et al. Reevaluating Amdahl's law in the multicore era , 2010, J. Parallel Distributed Comput..

[29] Thomas G. Robertazzi,et al. Scalable Scheduling for Clusters and Grids Using Cut Through Switching , 2004 .

[30] Theron Lammie,et al. A Linear Daisy Chain with Two Divisible Load Sources , 2005 .

[31] T.G. Robertazzi,et al. Multi-Source Grid Scheduling for Divisible Loads , 2006, 2006 40th Annual Conference on Information Sciences and Systems.

[32] Thomas G. Robertazzi. Networks and grids - technology and theory , 2007, Information technology.

[33] Dantong Yu,et al. Data Intensive Grid Scheduling: Multiple Sources with Capacity Constraints , 2003 .

[34] Thomas G. Robertazzi,et al. Ten Reasons to Use Divisible Load Theory , 2003, Computer.

[35] Mark D. Hill,et al. Amdahl's Law in the Multicore Era , 2008 .