An Asymmetric, Energy Efficient One-to-Many Traffic-Aware Wireless Network-in-Package Interconnection Architecture for Multichip Systems

High Performance Computing (HPC) platforms like blade servers consist of multiple processor chips which may be multicore CPUs, GPUs, memory modules and other subsystems. These high performance and memory intensive multichip systems require efficient support for one-to-many traffic patterns which originates from cache coherency, system-level synchronization mechanisms and other control signals. Small portions of such traffic can introduce congestion which significantly reduce overall performance and cause energy bottleneck unless low latency transmission is ensured by a one-to-many traffic aware interconnection architecture. Traditional metal based Network-on-Chip (NoC) interconnection architecture is not suitable for such traffic as it provides high-latency, power hungry multi-hop paths. To address this issue, we propose the design of a one-to-many traffic-aware Wireless Network-in-Package (WiNiP) architecture by introducing a novel asymmetric wireless interconnection topology and flow control. The proposed asymmetric topology provides low latency communication for one-to-many traffic and increase system bandwidth with lower energy consumption. Through cycle accurate simulator we show that the proposed topology reduce energy by 55.92% and outperforms other interconnection architecture for synthetic as well as application specific traffics.

[1]  Kaushik Roy,et al.  Process Variations and Process-Tolerant Design , 2007, 20th International Conference on VLSI Design held jointly with 6th International Conference on Embedded Systems (VLSID'07).

[2]  Eduard Alarcón,et al.  Scalability of Broadcast Performance in Wireless Network-on-Chip , 2016, IEEE Transactions on Parallel and Distributed Systems.

[3]  Natalie D. Enright Jerger,et al.  Interconnect-Memory Challenges for Multi-chip, Silicon Interposer Systems , 2015, MEMSYS.

[4]  Joonyoung Kim,et al.  HBM: Memory solution for bandwidth-hungry processors , 2014, 2014 IEEE Hot Chips 26 Symposium (HCS).

[5]  Partha Pratim Pande,et al.  Performance evaluation and design trade-offs for network-on-chip interconnect architectures , 2005, IEEE Transactions on Computers.

[6]  Arvind Kumar,et al.  Three-dimensional integrated circuits , 2006, IBM J. Res. Dev..

[7]  Hong Wang,et al.  Programming Spiking Neural Networks on Intel’s Loihi , 2018, Computer.

[8]  Amlan Ganguly,et al.  A Wireless Interconnection Framework for Seamless Inter and Intra-Chip Communication in Multichip Systems , 2017, IEEE Transactions on Computers.

[9]  Natalie D. Enright Jerger,et al.  Achieving predictable performance through better memory controller placement in many-core CMPs , 2009, ISCA '09.

[10]  Amlan Ganguly,et al.  Energy-efficient wireless interconnection framework for multichip systems with in-package memory stacks , 2017, 2017 30th IEEE International System-on-Chip Conference (SOCC).

[11]  Shahriar Mirabbasi,et al.  An 18.7-Gb/s 60-GHz OOK Demodulator in 65-nm CMOS for Wireless Network-on-Chip , 2015, IEEE Transactions on Circuits and Systems I: Regular Papers.

[12]  Amlan Ganguly,et al.  The Advances, Challenges and Future Possibilities of Millimeter-Wave Chip-to-Chip Interconnections for Multi-Chip Systems , 2018 .

[13]  Shahriar Mirabbasi,et al.  A 1.2-pJ/bit 16-Gb/s 60-GHz OOK Transmitter in 65-nm CMOS for Wireless Network-On-Chip , 2014, IEEE Transactions on Microwave Theory and Techniques.

[14]  Natalie D. Enright Jerger,et al.  NoC Architectures for Silicon Interposer Systems: Why Pay for more Wires when you Can Get them (from your interposer) for Free? , 2014, 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture.

[15]  Mauricio Hanzich,et al.  Broadcast-Enabled Massive Multicore Architectures: A Wireless RF Approach , 2015, IEEE Micro.

[16]  Terrence Mak,et al.  A Survey of Emerging Interconnects for On-Chip Efficient Multicast and Broadcast in Many-Cores , 2016, IEEE Circuits and Systems Magazine.

[17]  Yu Su,et al.  Communication Using Antennas Fabricated in Silicon Integrated Circuits , 2007, IEEE Journal of Solid-State Circuits.

[18]  Yi Wang,et al.  SD-MAC: Design and Synthesis of a Hardware-Efficient Collision-Free QoS-Aware MAC Protocol for Wireless Network-on-Chip , 2008, IEEE Transactions on Computers.

[19]  Amlan Ganguly,et al.  Energy-efficiency in interconnection fabrics for inter and intra-chip communication using Graphene-based THz-band antennas , 2017, 2017 Eighth International Green and Sustainable Computing Conference (IGSC).

[20]  Partha Pratim Pande,et al.  Multicast-Aware High-Performance Wireless Network-on-Chip Architectures , 2017, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[21]  James E. Jaussi,et al.  A Scalable 5–15 Gbps, 14–75 mW Low-Power I/O Transceiver in 65 nm CMOS , 2008, IEEE Journal of Solid-State Circuits.

[22]  Mario Badr,et al.  SynFull: Synthetic traffic models capturing cache coherent behaviour , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[23]  Amlan Ganguly,et al.  Increasing interposer utilization: A scalable, energy efficient and high bandwidth multicore-multichip integration solution , 2017, 2017 Eighth International Green and Sustainable Computing Conference (IGSC).

[24]  David W. Matolak,et al.  A New Frontier in Ultralow Power Wireless Links: Network-on-Chip and Chip-to-Chip Interconnects , 2015, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[25]  J. He,et al.  Advances and Challenges in Flip-Chip Packaging , 2006, IEEE Custom Integrated Circuits Conference 2006.