Multicore SoCs Design Methods

Systems-on-chip designs have evolved from fairly simple uni-core, single memory designs to complex multicore SoCs consisting of tens or hundreds of cores in a single chip. As more and more cores are integrated into these chips to share the ever increasing processing load, the main challenges lie in how to efficiently and quickly integrate these cores together into a single system capable of leveraging their individual flexibility. Moreover, for better inter-core communication, the multicore system requires high performance communication architectures and efficient communication protocols, such as hierarchical bus (Diefendorff 1997; Liu 2005), point-to-point connection (Loghi 2004), Time Division Multiplexed Access (TDMA) based bus (Kulkarani 2002), or packet-switching networks (Ben-Abdallah 2006).

[1]  Masahiro Sowa,et al.  Basic Network-on-Chip Interconnection for Future Gigascale MCSoCs Applications: Communication and Computation Orthogonalization , 2006 .

[2]  Edwin Hsing-Mean Sha,et al.  Hardware/Software co-design with the HMS framework , 1996, J. VLSI Signal Process..

[3]  Arquimedes Canedo,et al.  Compiling for Reduced Bit-Width Queue Processors , 2010, J. Signal Process. Syst..

[4]  Giovanni De Micheli,et al.  Introduction to Introduction , 2002 .

[5]  Jörg Henkel,et al.  Hardware-software cosynthesis for microcontrollers , 1993, IEEE Design & Test of Computers.

[6]  Ansi Ieee,et al.  IEEE Standard for Binary Floating Point Arithmetic , 1985 .

[7]  Andrew D. Booth,et al.  A SIGNED BINARY MULTIPLICATION TECHNIQUE , 1951 .

[8]  Kathryn M. O'Brien,et al.  Supporting OpenMP on cell , 2008 .

[9]  Arquimedes Canedo,et al.  Design and implementation of a queue compiler , 2009, Microprocess. Microsystems.

[10]  Arquimedes Canedo,et al.  Efficient compilation for queue size constrained queue processors , 2009, Parallel Comput..

[11]  David Stevenson A Proposed Standard for Binary Floating-Point Arithmetic , 1981 .

[12]  T. Yoshinaga,et al.  Queue processor architecture for novel queue computing paradigm based on produced order scheme , 2004, Proceedings. Seventh International Conference on High Performance Computing and Grid in Asia Pacific Region, 2004..

[13]  Jürgen Becker,et al.  Architecture and application of a dynamically reconfigurable hardware array for future mobile communication systems , 2000, Proceedings 2000 IEEE Symposium on Field-Programmable Custom Computing Machines (Cat. No.PR00871).

[14]  Nikil D. Dutt,et al.  Constraint-driven bus matrix synthesis for MPSoC , 2006, Asia and South Pacific Conference on Design Automation, 2006..

[15]  Alice C. Parker,et al.  SOS: Synthesis of application-specific heterogeneous multiprocessor systems , 2001, J. Parallel Distributed Comput..

[16]  Jürgen Becker,et al.  Runtime adaptive multi-processor system-on-chip: RAMPSoC , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[17]  Tsutomu Yoshinaga,et al.  Modular Design Structure and High-Level Prototyping for Novel Embedded Processor Core , 2005, EUC.

[18]  Patrick Schaumont,et al.  Standards for system-level design: practical reality or solution in search of a question? , 2000, Proceedings Design, Automation and Test in Europe Conference and Exhibition 2000 (Cat. No. PR00537).

[19]  Jürgen Becker,et al.  Multiprocessor System-on-Chip - Hardware Design and Tool Integration , 2011, Multiprocessor System-on-Chip.

[20]  Niraj K. Jha,et al.  COFTA: hardware-software co-synthesis of heterogeneous distributed embedded system architectures for low overhead fault tolerance , 1997, Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing.

[21]  Tsutomu Yoshinaga,et al.  The QC-2 parallel Queue processor architecture , 2008, J. Parallel Distributed Comput..

[22]  Arquimedes Canedo,et al.  Natural instruction level parallelism-aware compiler for high-performance QueueCore processor architecture , 2011, The Journal of Supercomputing.

[23]  Luca Benini,et al.  Analyzing on-chip communication in a MPSoC environment , 2004, Proceedings Design, Automation and Test in Europe Conference and Exhibition.

[24]  Wei Tsang Ooi,et al.  Workload characterization and cost-quality tradeoffs in MPEG-4 decoding on resource-constrained devices , 2005, 3rd Workshop on Embedded Systems for Real-Time Multimedia, 2005..

[25]  Pradeep K. Dubey,et al.  How Multimedia Workloads Will Change Processor Design , 1997, Computer.

[26]  Arquimedes Canedo,et al.  Compiler Support for Code Size Reduction Using a Queue-Based Processor , 2009, Trans. High Perform. Embed. Archit. Compil..