Design of storage hierarchy in multithreaded architectures

Multithreaded execution models attempt to combine some aspects of dataflow-like execution with von Neumann model execution. Their main objective is to mask the latency of inter-processor communications and remote memory accesses in large scale multiprocessors. An important issue in the analysis and evaluation of multithreaded execution is the design and performance of the storage hierarchy. Because of the sequential execution of threads, the locality of access within an executing thread can be exploited using registers and cache. At the inter-thread level, however, the locality of accesses to memory and its effect on the cache is not yet well understood. A storage model which can exploit this locality is developed and evaluated. The results indicate there is a large amount of inter-thread locality that can be exploited and that we can get an efficient storage system by exploiting the characteristics of nonblocking threads.

[1]  Walid A. Najjar,et al.  Generation and quantitative evaluation of dataflow clusters , 1993, FPCA '93.

[2]  Manoj Franklin,et al.  Block-level prediction for wide-issue superscalar processors , 1995, Proceedings 1st International Conference on Algorithms and Architectures for Parallel Processing.

[3]  David E. Culler,et al.  Two Fundamental Limits on Dataflow Multiprocessing , 1993, Architectures and Compilation Techniques for Fine and Medium Grain Parallelism.

[4]  Masaru Takesue A unified resource management and execution control mechanism for data flow machines , 1987, ISCA '87.

[5]  David E. Culler,et al.  Managing parallelism and resources in scientific dataflow programs , 1989 .

[6]  Walid A. Najjar,et al.  An Evaluation of Optimized Threaded Code Generation , 1994, IFIP PACT.

[7]  Lawrence Rauchwerger,et al.  Measuring limits of parallelism and characterizing its vulnerability to resource constraints , 1993, Proceedings of the 26th Annual International Symposium on Microarchitecture.

[8]  Masaru Takesue Cache Memories for Data Flow Machines , 1992, IEEE Trans. Computers.