Hardware Support for Control Transfers in Code Caches
暂无分享,去创建一个
[1] D.R. Kaeli,et al. Branch history table prediction of moving target branches due to subroutine returns , 1991, [1991] Proceedings. The 18th Annual International Symposium on Computer Architecture.
[2] Kemal Ebcioglu,et al. An architectural framework for supporting heterogeneous instruction-set architectures , 1993, Computer.
[3] Eric Rotenberg,et al. Trace cache: a low latency approach to high bandwidth instruction fetching , 1996, Proceedings of the 29th Annual IEEE/ACM International Symposium on Microarchitecture. MICRO 29.
[4] Mendel Rosenblum,et al. Embra: fast and flexible machine simulation , 1996, SIGMETRICS '96.
[5] Raymond J. Hookway,et al. DIGITAL FX!32: Combining Emulation and Binary Translation , 1997, Digit. Tech. J..
[6] Todd M. Austin,et al. The SimpleScalar tool set, version 2.0 , 1997, CARN.
[7] Bich C. Le,et al. An out-of-order execution technique for runtime binary translators , 1998, ASPLOS VIII.
[8] M. K. Gschwind,et al. Method and apparatus for determining branch addresses in programs generated by binary translation , 1998 .
[9] Erik R. Altman,et al. BOA: The Architecture of a Binary Translation Processor , 1999 .
[10] Evelyn Duesterwald,et al. Design and implementation of a dynamic optimization framework for windows , 2000 .
[11] Daniel A. Jiménez,et al. The impact of delay on the design of branch predictors , 2000, Proceedings 33rd Annual IEEE/ACM International Symposium on Microarchitecture. MICRO-33 2000.
[12] Mateo Valero,et al. The effect of code reordering on branch prediction , 2000, Proceedings 2000 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00622).
[13] Sorin Lerner,et al. Mojo: A Dynamic Optimization System , 2000 .
[14] Wen-mei W. Hwu,et al. A hardware mechanism for dynamic extraction and relayout of program hot spots , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).
[15] Cindy Zheng,et al. PA-RISC to IA-64: Transparent Execution, No Recompilation , 2000, Computer.
[16] Cristina Cifuentes,et al. Dynamic Binary Translation , 2000 .
[17] John Whaley. Partial method compilation using dynamic profile information , 2001, OOPSLA '01.
[18] Michael Gschwind,et al. Dynamic Binary Translation and Optimization , 2001, IEEE Trans. Computers.
[19] David J. Sager,et al. The microarchitecture of the Pentium 4 processor , 2001 .
[20] Sanjay J. Patel,et al. Performance characterization of a hardware mechanism for dynamic optimization , 2001, Proceedings. 34th ACM/IEEE International Symposium on Microarchitecture. MICRO-34.
[21] David Gregg,et al. The Behavior of Efficient Virtual Machine Interpreters on Modern Architectures , 2001, Euro-Par.
[22] Balaram Sinharoy,et al. POWER4 system microarchitecture , 2002, IBM J. Res. Dev..
[23] DELI: a new run-time control point , 2002, MICRO 35.
[24] Pat Conway,et al. The AMD Opteron Processor for Multiprocessor Servers , 2003, IEEE Micro.
[25] Laurie J. Hendren,et al. Dynamic profiling and trace cache generation , 2003, International Symposium on Code Generation and Optimization, 2003. CGO 2003..
[26] Mary Lou Soffa,et al. Retargetable and reconfigurable software dynamic translation , 2003, International Symposium on Code Generation and Optimization, 2003. CGO 2003..
[27] James E. Smith,et al. Dynamic binary translation for accumulator-oriented architectures , 2003, International Symposium on Code Generation and Optimization, 2003. CGO 2003..
[28] Derek Bruening,et al. An infrastructure for adaptive dynamic optimization , 2003, International Symposium on Code Generation and Optimization, 2003. CGO 2003..
[29] Scott A. Mahlke,et al. The superblock: An effective technique for VLIW and superscalar compilation , 1993, The Journal of Supercomputing.
[30] Woody Lichtenstein,et al. The multiflow trace scheduling compiler , 1993, The Journal of Supercomputing.