CoolSim: Eliminating traditional cache warming with fast, virtualized profiling

Sampling (e.g., SMARTS and SimPoint) improves simulation performance by an order of magnitude or more through the reduction of large workloads into a small but representative sample. Virtualized fast-forwarding (e.g., FSA) speeds up simulation further by advancing execution at near-native speed between simulation points, making cache warming the critical limiting factor for simulation performance. CoolSim is an efficient simulation framework that eliminates cache warming. It collects sparse memory reuse information (MRI) while advancing between simulation points using virtualized fast-forwarding. During detailed simulation, a statistical cache model uses the previously acquired MRI to estimate the performance of the caches. CoolSim builds upon KVM and gem5 and runs 19x faster than the state-of-the-art sampled simulation. It estimates the CPI of the SPEC CPU2006 benchmarks with 3.62% error on average, across a wide range of cache sizes.

[1]  David Eklov,et al.  Extending statistical cache models to support detailed pipeline simulators , 2014, 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[2]  Nikos Nikoleris,et al.  Implementing Fast , Virtualized Profiling to Eliminate Cache Warming , 2016 .

[3]  Nikos Nikoleris,et al.  Full Speed Ahead: Detailed Architectural Simulation at Near-Native Speed , 2015, 2015 IEEE International Symposium on Workload Characterization.