Evaluating Celerity: A 16-nm 695 Giga-RISC-V Instructions/s Manycore Processor With Synthesizable PLL

This letter presents a 16-nm 496-core RISC-V network-on-chip (NoC). The mesh achieves 1.4 GHz at 0.98 V, yielding a peak throughput of 695 Giga RISC-V instructions/s (GRVIS), a peak energy efficiency of 314.89 GRVIS/W, and a record 825 320 CoreMark benchmark score. Unlike previously reported <xref ref-type="bibr" rid="ref1">[1]</xref>, this new score was obtained without modifying the core benchmark code. The main feature is the NoC architecture, which uses only <inline-formula> <tex-math notation="LaTeX">$1881~\mu \text{m}^{2}$ </tex-math></inline-formula> per router node, enables highly scalable and dense compute, and provides up to 361 Tb/s of aggregate bandwidth.

[1]  Ching-Che Chung,et al.  A portable digitally controlled oscillator using novel varactors , 2005, IEEE Trans. Circuits Syst. II Express Briefs.

[2]  David Wentzlaff,et al.  Power and Energy Characterization of an Open Source 25-Core Manycore Processor , 2018, 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA).

[3]  Henry Hoffmann,et al.  Remote Store Programming , 2010, HiPEAC.

[4]  William Thies,et al.  StreamIt: A Language for Streaming Applications , 2002, CC.

[5]  M. Taylor,et al.  Experiences Using the RISC-V Ecosystem to Design an Accelerator-Centric SoC in TSMC 16nm , 2017 .

[6]  Yunsup Lee,et al.  A 45nm 1.3GHz 16.7 double-precision GFLOPS/W RISC-V processor with vector accelerators , 2014, ESSCIRC 2014 - 40th European Solid State Circuits Conference (ESSCIRC).

[7]  Visvesh Sathe,et al.  A 1–2 GHz Computational-Locking ADPLL With Sub-20-Cycle Locktime Across PVT Variation , 2019, IEEE Journal of Solid-State Circuits.

[8]  M. A. Copeland,et al.  An oversampling delta-sigma frequency discriminator , 1994 .

[9]  Bin Liu,et al.  KiloCore: A 32-nm 1000-Processor Computational Array , 2017, IEEE Journal of Solid-State Circuits.

[10]  Christoforos E. Kozyrakis,et al.  SCD: A scalable coherence directory with flexible sharer set encoding , 2012, IEEE International Symposium on High-Performance Comp Architecture.

[11]  Moein Khazraee,et al.  Specializing a Planet's Computation: ASIC Clouds , 2017, IEEE Micro.

[12]  Christopher Torng,et al.  The Celerity Open-Source 511-Core RISC-V Tiered Accelerator Fabric: Fast Architectures and Design Methodologies for Fast Chips , 2018, IEEE Micro.

[13]  David Wentzlaff,et al.  Processor: A 64-Core SoC with Mesh Interconnect , 2010 .

[14]  Christopher Torng,et al.  A 1.4 GHz 695 Giga Risc-V Inst/s 496-Core Manycore Processor With Mesh On-Chip Network and an All-Digital Synthesized PLL in 16nm CMOS , 2019, 2019 Symposium on VLSI Circuits.