Design Space Exploration Using Cycle Accurate Simulator

Multi-Processor System on Chip (MPSoC) architectures have become a mainstream technology for obtaining performance improvements in computing platforms. With the increase in the number of cores, the role of cache memory has become pivotal. An ideal memory configuration is always desired to be fast and large; but, in fact, striking to balance between the size and access time of the memory hierarchy is considered by processor architect. Design space exploration is used for performance analysis of systems and helps to find the optimal solution for obtaining the desired objectives. In this chapter, we explore two design space parameters, i.e., cache size and number of cores, for obtaining the desired energy consumption. Moreover, previously presented energy models for multilevel cache are evaluated by using cycle accurate full system simulator. Our results show that with the increase in cache sizes, the number of cycles required for application execution decreases, and by increasing number of cores, the throughput improve.

[1]  Hellen Adams,et al.  Patent and Trademark Office , 2017 .

[2]  Jung Ho Ahn,et al.  McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[3]  Fredrik Larsson,et al.  Simics: A Full System Simulation Platform , 2002, Computer.

[4]  Anoop Gupta,et al.  The SPLASH-2 programs: characterization and methodological considerations , 1995, ISCA.

[5]  William H. Mangione-Smith,et al.  The filter cache: an energy efficient memory structure , 1997, Proceedings of 30th Annual International Symposium on Microarchitecture.

[6]  Alvin M. Despain,et al.  Cache design trade-offs for power and performance optimization: a case study , 1995, ISLPED '95.

[7]  D. Geer,et al.  Chip makers turn to multicore processors , 2005, Computer.

[8]  Laurence Pierre,et al.  Runtime Verification of Typical Requirements for a Space Critical SoC Platform , 2011, FMICS.

[9]  Kanad Ghose,et al.  Analytical energy dissipation models for low-power caches , 1997, ISLPED '97.

[10]  J. L. Hennessy,et al.  A parallel adaptive fast multipole method , 1993, Supercomputing '93.

[11]  Alan Jay Smith,et al.  Cache Memories , 1982, CSUR.

[12]  Alice Wang,et al.  Adaptive Techniques for Dynamic Processor Optimization , 2008 .

[13]  Robert I. Davis,et al.  Explicit Reservation of Local Memory in a Predictable, Preemptive Multitasking Real-Time System , 2012, 2012 IEEE 18th Real Time and Embedded Technology and Applications Symposium.

[14]  Klaus D. McDonald-Maier,et al.  Analytical Evaluation of Energy and Throughput for Multilevel Caches , 2010, 2010 12th International Conference on Computer Modelling and Simulation.

[15]  Antonio Gonzalez,et al.  A data cache with multiple caching strategies tuned to different types of locality , 1995, International Conference on Supercomputing.

[16]  Matt T. Yourst PTLsim: A Cycle Accurate Full System x86-64 Microarchitectural Simulator , 2007, 2007 IEEE International Symposium on Performance Analysis of Systems & Software.

[17]  Jindong Tan,et al.  RT-ROS: A real-time ROS architecture on multi-core processors , 2016, Future Gener. Comput. Syst..

[18]  Jörg Henkel,et al.  A framework for estimation and minimizing energy dissipation of embedded HW/SW systems , 1998, DAC.

[19]  Klaus D. McDonald-Maier,et al.  Data Cache-Energy and Throughput Models: Design Exploration for Embedded Processors , 2009, EURASIP J. Embed. Syst..