SLICC: Self-Assembly of Instruction Cache Collectives for OLTP Workloads
暂无分享,去创建一个
[1] Sarita V. Adve,et al. Performance of database workloads on shared-memory systems with out-of-order processors , 1998, ASPLOS VIII.
[2] Anastasia Ailamaki,et al. Reducing OLTP instruction misses with thread migration , 2012, DaMoN '12.
[3] Babak Falsafi,et al. Proactive instruction fetch , 2011, 2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).
[4] David A. Patterson,et al. Performance characterization of a Quad Pentium Pro SMP using OLTP workloads , 1998, ISCA.
[5] Ippokratis Pandis,et al. From A to E: analyzing TPC's OLTP benchmarks: the obsolete, the ubiquitous, the unexplored , 2013, EDBT '13.
[6] Gabriel H. Loh,et al. Zesto: A cycle-level simulator for highly detailed microarchitecture exploration , 2009, 2009 IEEE International Symposium on Performance Analysis of Systems and Software.
[7] Babak Falsafi,et al. Reactive NUCA: near-optimal block placement and replication in distributed caches , 2009, ISCA '09.
[8] Babak Falsafi,et al. Shore-MT: a scalable storage manager for the multicore era , 2009, EDBT '09.
[9] Pierre Michaud. Exploiting the cache capacity of a single-chip multi-core processor with execution migration , 2004, 10th International Symposium on High Performance Computer Architecture (HPCA'04).
[10] Srinivas Devadas,et al. DIRECTORYLESS SHARED MEMORY COHERENCE USING EXECUTION MIGRATION , 2011 .
[11] Gil Neiger,et al. Intel virtualization technology , 2005, Computer.
[12] Anastasia Ailamaki,et al. Improving instruction cache performance in OLTP , 2006, TODS.
[13] Harish Patil,et al. Pin: building customized program analysis tools with dynamic instrumentation , 2005, PLDI '05.
[14] Koushik Chakraborty,et al. Computation spreading: employing hardware migration to specialize CMP cores on-the-fly , 2006, ASPLOS XII.
[15] Babak Falsafi,et al. Clearing the clouds: a study of emerging scale-out workloads on modern hardware , 2012, ASPLOS XVII.
[16] Norman P. Jouppi,et al. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.
[17] Srinivas Devadas,et al. Judicious Thread Migration When Accessing Distributed Shared Caches , 2012 .
[18] Aamer Jaleel,et al. Adaptive insertion policies for high performance caching , 2007, ISCA '07.
[19] Gu-Yeon Wei,et al. Thread motion: fine-grained power management for multi-core systems , 2009, ISCA '09.
[20] Thomas F. Wenisch,et al. Temporal instruction fetch streaming , 2008, 2008 41st IEEE/ACM International Symposium on Microarchitecture.
[21] Ippokratis Pandis,et al. Data-oriented transaction execution , 2010, Proc. VLDB Endow..
[22] Norman P. Jouppi,et al. CACTI 6.0: A Tool to Model Large Caches , 2009 .
[23] Aamer Jaleel,et al. High performance cache replacement using re-reference interval prediction (RRIP) , 2010, ISCA.
[24] Shih-Lien Lu,et al. Bloom filtering cache misses for accurate data speculation and prefetching , 2014, ICS 25th Anniversary.
[25] Alan Jay Smith,et al. Evaluating Associativity in CPU Caches , 1989, IEEE Trans. Computers.