Improving software pipelining with hardware support for self-spatial loads
暂无分享,去创建一个
[1] M. Rajagopalan,et al. Software Pipelining: Petri Net Pacemaker , 1993, Architectures and Compilation Techniques for Fine and Medium Grain Parallelism.
[2] Monica S. Lam,et al. A data locality optimizing algorithm , 1991, PLDI '91.
[3] Krishna Subramanian,et al. Enhanced modulo scheduling for loops with conditional branches , 1992, MICRO 25.
[4] A. Gonzalez,et al. Cache sensitive module scheduling , 1997, Proceedings of 30th Annual International Symposium on Microarchitecture.
[5] Anoop Gupta,et al. Design and evaluation of a compiler algorithm for prefetching , 1992, ASPLOS V.
[6] D.A. Reed,et al. An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs , 1995, Proceedings of the IEEE/ACM SC95 Conference.
[7] F. Jesús Sánchez,et al. Cache Sensitive Modulo Scheduling , 1997, MICRO.
[8] Keith D. Cooper,et al. Effective partial redundancy elimination , 1994, PLDI '94.
[9] Keith D. Cooper,et al. Operator strength reduction , 2001, TOPL.
[10] Vicki H. Allan,et al. Software pipelining , 1995, CSUR.
[11] Keith D. Cooper,et al. Value Numbering , 1997, Softw. Pract. Exp..
[12] Philip H. Sweany,et al. Modulo Scheduling with Cache Reuse Information , 1997, Euro-Par.
[13] Ken Kennedy,et al. Scalar replacement in the presence of conditional control flow , 1994, Softw. Pract. Exp..
[14] Alexander Aiken,et al. Optimal loop parallelization , 1988, PLDI '88.
[15] Scott A. Mahlke,et al. Reverse If-Conversion , 1993, PLDI '93.
[16] Jean-Loup Baer,et al. Reducing memory latency via non-blocking and prefetching caches , 1992, ASPLOS V.
[17] Ken Kennedy,et al. Software prefetching , 1991, ASPLOS IV.