Exploiting loop-level parallelism on multi-core architectures for the wimax physical layer

Multiprocessor system-on-chip (MPSoC) architectures are increasingly being adopted in the design of emerging complex embedded systems. This paper introduces a new application mapping technique for MPSoC architectures using a case study. The technique incorporates profiling-driven task partitioning, task transformations, loop level partitioning and memory architecture aware data mapping to reduce system execution time. Experiments are conducted to evaluate the technique by implementing the WiMAX physical layer on several multi-core architectures based on newly emerging dynamically reconfigurable processor cores. The results demonstrate that the proposed technique is able to generate high-quality mappings of realistic applications on the target architecture, achieving up to 4.92x speedup by employing only five dynamically reconfigurable processor cores.

[1]  Pierre G. Paulin Automatic mapping of parallel applications onto multi-processor platforms: a multimedia application , 2004, Euromicro Symposium on Digital System Design, 2004. DSD 2004..

[2]  T. N. Vijaykumar,et al.  Reducing register ports for higher speed and lower energy , 2002, MICRO.

[3]  L. Carro,et al.  Time and energy efficient mapping of embedded applications onto NoCs , 2005, Proceedings of the ASP-DAC 2005. Asia and South Pacific Design Automation Conference, 2005..

[4]  Karam S. Chatha,et al.  An ILP Formulation for System-Level Application Mapping on Network Processor Architectures , 2007, 2007 Design, Automation & Test in Europe Conference & Exhibition.

[5]  Victor V. Zyuban,et al.  The energy complexity of register files , 1998, Proceedings. 1998 International Symposium on Low Power Electronics and Design (IEEE Cat. No.98TH8379).

[6]  P. G. Paulin Automatic mapping of parallel applications onto multi-processor platforms: a multimedia application , 2004 .

[7]  Victor V. Zyuban,et al.  Inherently Lower-Power High-Performance Superscalar Architectures , 2001, IEEE Trans. Computers.

[8]  Mateo Valero,et al.  Multiple-banked register file architectures , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).

[9]  Tughrul Arslan,et al.  Multi-core Architectures with Dynamically Reconfigurable Array Processors for the WiMAX Physical Layer , 2008, 2008 Symposium on Application Specific Processors.

[10]  Rainer Leupers,et al.  A modular simulation framework for spatial and temporal task mapping onto multi-processor SoC platforms , 2005, Design, Automation and Test in Europe.

[11]  Tughrul Arslan,et al.  The Reconfigurable Instruction Cell Array , 2008, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.