Energy Efficient And Low Latency Interconnection Network For Multicast Invalidates In Shared Memory Systems

Optical network-on-chip (NoC) are being investigated to reduce the latency and power consumption of networks for multicore processors. Our previous work has shown that switched optical networks can achieve lower latency for a given power consumption and component count in shared memory processors compared with arbitration-free networks such as single writer multiple reader. We have also shown the advantage of leaving optical circuits open after being generated to capture multiple memory transactions. However invalidation processes, where numerous cores are sharing a memory block, need to establish a large number of very short lived circuits and this increases the average message latency and overall on-chip contention. In this paper, a low power broadcast architecture is proposed which deals specifically with multicast messages. Separating multicast messages from unicast ones shows an improvement in average arbitration latency of up to 88.2% for the Vips benchmark while the Swaptions benchmark shows the highest improvement in average memory access time (up to 21.1%). Vips also sees an increase of 147% in the average number of messages passing through an open optical circuit. Obtaining these advantages requires an additional broadcast network which consumes only 66.1mW power.

[1]  Philip M. Watts,et al.  Low Latency Scheduling Algorithm for Shared Memory Communications over Optical Networks , 2013, 2013 IEEE 21st Annual Symposium on High-Performance Interconnects.

[2]  Yuliya Semenova,et al.  Design of an extra-low-loss broadband Y-branch waveguide splitter based on a tapered MMI structure , 2011 .

[3]  Hui Chen,et al.  Cascaded Microresonator-Based Matrix Switch for Silicon On-Chip Optical Interconnection , 2009, Proceedings of the IEEE.

[4]  Timothy M. Jones,et al.  Towards zero latency photonic switching in shared memory networks , 2014, Concurr. Comput. Pract. Exp..

[5]  Luca P. Carloni,et al.  Physical-Layer Modeling and System-Level Design of Chip-Scale Photonic Interconnection Networks , 2011, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[6]  R. Ho,et al.  Ultra-low power arrayed CMOS silicon photonic transceivers for an 80 Gbps WDM optical link , 2011, 2011 Optical Fiber Communication Conference and Exposition and the National Fiber Optic Engineers Conference.

[7]  Somayeh Sardashti,et al.  The gem5 simulator , 2011, CARN.

[8]  Marco Fiorentino,et al.  A ring-resonator-based silicon photonics transceiver with bias-based wavelength stabilization and adaptive-power-sensitivity receiver , 2013, 2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers.

[9]  Yu Zhang,et al.  Firefly: illuminating future network-on-chip with nanophotonics , 2009, ISCA '09.

[10]  Ashok V. Krishnamoorthy,et al.  Silicon-photonic network architectures for scalable, power-efficient multi-chip systems , 2010, ISCA '10.

[11]  Kai Li,et al.  The PARSEC benchmark suite: Characterization and architectural implications , 2008, 2008 International Conference on Parallel Architectures and Compilation Techniques (PACT).

[12]  Jung Ho Ahn,et al.  Corona: System Implications of Emerging Nanophotonic Technology , 2008, 2008 International Symposium on Computer Architecture.

[13]  Avinash Karanth Kodi,et al.  Extending the Performance and Energy-Efficiency of Shared Memory Multicores with Nanophotonic Technology , 2014, IEEE Transactions on Parallel and Distributed Systems.

[14]  Yuanyuan Yang,et al.  Energy-aware routing in hybrid optical network-on-chip for future Multi-Processor System-on-Chip , 2010, 2010 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS).

[15]  A. Van Laer,et al.  Full system simulation of optically interconnected chip multiprocessors using gem5 , 2013, 2013 Optical Fiber Communication Conference and Exposition and the National Fiber Optic Engineers Conference (OFC/NFOEC).

[16]  David A. Patterson,et al.  Computer Architecture, Fifth Edition: A Quantitative Approach , 2011 .

[17]  Ian O'Connor,et al.  Chameleon: Channel efficient Optical Network-on-Chip , 2014, 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[18]  George Kurian,et al.  ATAC: A 1000-core cache-coherent processor with on-chip optical network , 2010, 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT).

[19]  Timothy M. Jones,et al.  Coherence based message prediction for optically interconnected chip multiprocessors , 2015, 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE).