Recently, CPUs with an identical ISA tend to have different microarchitectures, different computation resources, and special instructions. To achieve efficient program execution on such hardware, compilers have machine-dependent code optimization. However, software vendors cannot adopt this optimization for software production, since the software would be widely distributed and therefore it must be executable on any machine with the same ISA. On the other hand, there is a significant gap between processorpsilas operational speed and memory access speed, and currently the gap is increasing. In this paper, we introduce several special prefetch instructions that are suited for memory access patterns that frequently appear in program execution. However, such special instructions are utilized only by compilerpsilas machine-dependent code optimization, and therefore software vendors do not utilize such instructions. To increase opportunities for effectively exploiting the instructions for optimization, we propose dynamic optimization techniques that consist of dynamic code modification and analysis methods of memory references. We evaluate the techniques by using SPEC2000 benchmarks.
[1]
Michel Dubois,et al.
International Conference on Parallel Processing Fixed and Adaptive Sequential Prefetching in Shared Memory Multiprocessors
,
2006
.
[2]
Wei-Chung Hsu,et al.
Data Prefetching On The HP PA-8000
,
1997,
Conference Proceedings. The 24th Annual International Symposium on Computer Architecture.
[3]
Philippe Clauss,et al.
Performance driven data cache prefetching in a dynamic software optimization system
,
2007,
ICS '07.
[4]
Kenneth C. Yeager.
The Mips R10000 superscalar microprocessor
,
1996,
IEEE Micro.
[5]
Janak H. Patel,et al.
Stride directed prefetching in scalar processors
,
1992,
MICRO 1992.
[6]
Doug Burger,et al.
Evaluating Future Microprocessors: the SimpleScalar Tool Set
,
1996
.
[7]
J.W.C. Fu,et al.
Stride Directed Prefetching In Scalar Processors
,
1992,
[1992] Proceedings the 25th Annual International Symposium on Microarchitecture MICRO 25.
[8]
Jean-Loup Baer,et al.
Effective Hardware Based Data Prefetching for High-Performance Processors
,
1995,
IEEE Trans. Computers.