Performance estimation of streaming applications for hierarchical MPSoCs

Parallel programming and effective partitioning of applications for embedded many-core architectures requires optimization algorithms. However, these algorithms have to quickly evaluate thousands of different partitions. We present a fast performance estimator embedded in a parallelizing compiler for streaming applications. The estimator combines a single execution-based simulation and an analytic approach. Experimental results demonstrate that the estimator has a mean error of 2.6% and computes its estimation 2848 times faster compared to a cycle accurate simulator.

[1]  Lothar Thiele,et al.  Mapping Applications to Tiled Multiprocessor Embedded Systems , 2007, Seventh International Conference on Application of Concurrency to System Design (ACSD 2007).

[2]  Ahmed Amine Jerraya,et al.  Software Performance Estimation in MPSoC Design , 2007, 2007 Asia and South Pacific Design Automation Conference.

[3]  Wayne Kelly,et al.  An Abstract Model for Performance Estimation of the Embedded Multiprocessor CoreVA-MPSoC Technical Report (v1.0) , 2015 .

[4]  Mario Porrmann,et al.  RAPTOR - A Scalable Platform for Rapid Prototyping and FPGA-based Cluster Computing , 2009, PARCO.

[5]  Wayne Kelly,et al.  System-Level Analysis of Network Interfaces for Hierarchical MPSoCs , 2015, NoCArc@MICRO.

[6]  Nicolas Benoit,et al.  A First Step to Performance Prediction for Heterogeneous Processing on Manycores , 2015, ICCS.

[7]  Andy D. Pimentel,et al.  A systematic approach to exploring embedded system architectures at multiple abstraction levels , 2006, IEEE Transactions on Computers.

[8]  Rajendra Patel,et al.  Recent trends in embedded system software performance estimation , 2013, Design Automation for Embedded Systems.

[9]  Michael F. P. O'Boyle,et al.  Partitioning streaming parallelism for multi-cores: A machine learning based approach , 2010, 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT).

[10]  Ulrich Rückert,et al.  Evaluation of interconnect fabrics for an embedded MPSoC in 28 nm FD-SOI , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[11]  Gang Chen,et al.  Profiling and annotation combined method for multimedia application specific MPSoC performance estimation , 2015, Frontiers of Information Technology & Electronic Engineering.

[12]  William Thies,et al.  StreamIt: A Language for Streaming Applications , 2002, CC.

[13]  Ulrich Rückert,et al.  A 65 nm 32 b Subthreshold Processor With 9T Multi-Vt SRAM and Adaptive Supply Voltage Control , 2013, IEEE Journal of Solid-State Circuits.

[14]  Wayne Kelly,et al.  A communication model and partitioning algorithm for streaming applications for an embedded MPSoC , 2014, 2014 International Symposium on System-on-Chip (SoC).

[15]  Eric Cheung,et al.  Fast and accurate performance simulation of embedded software for MPSoC , 2009, 2009 Asia and South Pacific Design Automation Conference.