An Automatic Design Flow for Data Parallel and Pipelined Signal Processing Applications on Embedded Multiprocessor with NoC: Application to Cryptography

Embedded system design is increasingly based on single chip multiprocessors because of the high performance and flexibility requirements. Embedded multiprocessors on FPGA provide the additional flexibility by allowing customization through addition of hardware accelerators on FPGA when parallel software implementation does not provide the expected performance. And the overall multiprocessor architecture is still kept for additional applications. This provides a transition to software only parallel implementation while avoiding pure hardware implementation. An automatic design flow is proposed well suited for data flow signal processing exhibiting both pipelining and data parallel mode of execution. Fork-Join model-based software parallelization is explored to find out the best parallelization configuration. C-based synthesis coprocessor is added to improve performance with more hardware resource usage. The Triple Data Encryption Standard (TDES) cryptographic algorithm on a 48-PE single-chip distributed memory multiprocessor is selected as an application example of the flow.

[1]  Vikram Pasham,et al.  High-Speed DES and Triple DES Encryptor/Decryptor , 2001 .

[2]  Simon Heron,et al.  Encryption: Advanced Encryption Standard (AES) , 2009 .

[3]  Toshihiro Hattori,et al.  SH-X3 xible SuperH multi-core for high-performance and low-power embedded systems , 2007, 2007 IEEE Hot Chips 19 Symposium (HCS).

[4]  Luca Benini,et al.  Networks on chips - technology and tools , 2006, The Morgan Kaufmann series in systems on silicon.

[5]  Odysseas G. Koufopavlou,et al.  VLSI implementations of the triple-DES block cipher , 2003, 10th IEEE International Conference on Electronics, Circuits and Systems, 2003. ICECS 2003. Proceedings of the 2003.

[6]  Jun Shirako,et al.  An 8640 MIPS SoC with Independent Power-Off Control of 8 CPUs and 8 RAMs by An Automatic Parallelizing Compiler , 2008, 2008 IEEE International Solid-State Circuits Conference - Digest of Technical Papers.

[7]  Dominique Houzet,et al.  A Case Study: Quantitative Evaluation of C-Based High-Level Synthesis Systems , 2008, EURASIP J. Embed. Syst..

[8]  Luca Benini,et al.  On-Chip Communication Architectures: System on Chip Interconnect , 2008 .

[9]  Omar Hammami,et al.  C-based hardware-accelerator coprocessing for SOC an quantitative area-performance evaluation , 2008, 2008 15th IEEE International Conference on Electronics, Circuits and Systems.

[10]  David A. Patterson,et al.  Computer Architecture, Fifth Edition: A Quantitative Approach , 2011 .

[11]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[12]  Lars R. Knudsen,et al.  Advanced Encryption Standard (AES) - An Update , 1999, IMACC.

[13]  Hugh Garraway Parallel Computer Architecture: A Hardware/Software Approach , 1999, IEEE Concurrency.

[14]  Omar Hammami,et al.  MOCDEX: Multiprocessor on Chip Multiobjective Design Space Exploration with Direct Execution , 2006, EURASIP J. Embed. Syst..

[15]  Christian Haubelt,et al.  SystemCoDesigner: Automatic design space exploration and rapid prototyping from behavioral models , 2008, 2008 45th ACM/IEEE Design Automation Conference.

[16]  Ed F. Deprettere,et al.  Daedalus: Toward composable multimedia MP-SoC design , 2008, 2008 45th ACM/IEEE Design Automation Conference.

[17]  Alasdair McAndrew Data Encryption Standard (DES) for Sage , 2009 .

[18]  Lionel Lacassagne,et al.  System-Platforms-Based SystemC TLM Design of Image Processing Chains for Embedded Applications , 2007, EURASIP J. Embed. Syst..

[19]  Kiyoung Choi,et al.  SoCDAL: System-on-chip design AcceLerator , 2008, TODE.

[20]  Morris J. Dworkin,et al.  Recommendation for Block Cipher Modes of Operation: Methods and Techniques , 2001 .

[21]  Wayne H. Wolf,et al.  Multiprocessor Systems-on-Chips , 2004, ISVLSI.