Dual-Layer Locality-Aware Optical Interconnection Architecture for Latency-Critical Resource Disaggregation Environments

Significant research efforts, both industrial and academic, have been committed in the direction of Rack-scale computing through resource disaggregation, that aims to increase resource utilization at a reduced energy and cost envelope. However, the realization of resource disaggregation necessitates an underlying network infrastructure that can compete with a challenging set of requirements including low-latency performance and high-port count connectivity, as well as high data-rate operation. At the same time, it is crucial for the interconnection architecture to be able to accommodate efficient delivery of traffic with different locality characteristics. We propose a dual-layer locality-aware optical interconnection architecture for disaggregated Data Centers by combining the STREAMS silicon-based on-board communication paradigm with the disaggregation-oriented Hipoλaos high-port count switch. Simulation evaluation of a 256-node disaggregated system, comprising 32 optically-interconnected 8-socket boards, revealed up to 100% throughput and mean, p99 latencies not higher than 335 nsec and 610 nsec, respectively, when a 50:50 ratio between on- and off-board traffic is employed. Evaluation of the same layout with 75:25 on-/off-board traffic yields even lower mean and p99 latency at 210 ns and 553 ns, respectively.

[1]  Roger Dangel,et al.  Polymer Waveguides Enabling Scalable Low-Loss Adiabatic Optical Coupling for Silicon Photonics , 2018, IEEE Journal of Selected Topics in Quantum Electronics.

[2]  M. Moralis-Pegios,et al.  Sub-μs Latency High-Port Optical Packet Switch Fabrics for Disaggregated Computing: The Hipoλaos OPS Architecture , 2018, 2018 20th International Conference on Transparent Optical Networks (ICTON).

[3]  Roberto Proietti,et al.  Bit-parallel all-to-all and flexible AWGR-based optical interconnects , 2017, 2017 Optical Fiber Communications Conference and Exhibition (OFC).

[4]  Alex C. Snoeren,et al.  Inside the Social Network's (Datacenter) Network , 2015, Comput. Commun. Rev..

[5]  Xin Yin,et al.  52 km-Long Transmission Link Using a 50 Gb/s O-Band Silicon Microring Modulator Co-Packaged With a 1V-CMOS Driver , 2019, IEEE Photonics Journal.

[6]  Corey Gough,et al.  Energy Efficient Servers: Blueprints for Data Center Optimization , 2015 .

[7]  Randy H. Katz,et al.  Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[8]  M. Moralis-Pegios,et al.  High-port and low-latency optical switches for disaggregated data centers: the Hipoλaos switch architecture , 2018, IEEE/OSA Journal of Optical Communications and Networking.

[9]  Nikos Pleros,et al.  WDM mid-board optics for chip-to-chip wavelength routing interconnects in the H2020 ICT-STREAMS , 2017, OPTO.

[10]  T. Aalto,et al.  Multicast-Enabling Optical Switch Design Employing Si Buffering and Routing Elements , 2018, IEEE Photonics Technology Letters.

[11]  N. Pleros,et al.  On-Chip SOI Delay Line Bank for Optical Buffers and Time Slot Interchangers , 2018, IEEE Photonics Technology Letters.

[12]  M. Moralis-Pegios,et al.  Multicasting in a High-Port Sub-μ sec Latency Hipo λ aos Optical Packet Switch , 2018 .

[13]  George T. Kanellos,et al.  Optics in Computing: From Photonic Network-on-Chip to Chip-to-Chip Interconnects and Disintegrated Architectures , 2019, Journal of Lightwave Technology.

[14]  Kostas Katrinis,et al.  dReDBox: Materializing a full-stack rack-scale system prototype of a next-generation disaggregated datacenter , 2018, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[15]  N. Pleros,et al.  40 Gb/s NRZ Wavelength Conversion Using a Differentially-Biased SOA-MZI: Theory and Experiment , 2011, Journal of Lightwave Technology.

[16]  Scott Shenker,et al.  Network Requirements for Resource Disaggregation , 2016, OSDI.

[17]  Nikos Pleros,et al.  A 1024-Port Optical Uni- and Multicast Packet Switch Fabric , 2019, Journal of Lightwave Technology.

[18]  Xin Yin,et al.  EOCB-Platform for Integrated Photonic Chips Direct-on-Board Assembly within Tb/s Applications , 2018, 2018 IEEE 68th Electronic Components and Technology Conference (ECTC).

[19]  Thomas Wicki,et al.  Bixby: The scalability and coherence directory ASIC in Oracle's highly scalable enterprise systems , 2013, IEEE Hot Chips Symposium.

[20]  Nikos Pleros,et al.  High-port low-latency optical switch architecture with optical feed-forward buffering for 256-node disaggregated data centers. , 2018, Optics express.

[21]  Georgios Zervas,et al.  Hardware programmable network function service chain on optical rack-scale data centers , 2017, 2017 Optical Fiber Communications Conference and Exhibition (OFC).

[22]  Emmanuel Varvarigos,et al.  Application-Oriented On-Board Optical Technologies for HPCs , 2017, Journal of Lightwave Technology.

[23]  Xin Yin,et al.  A 40 Gb/s Chip-to-Chip Interconnect for 8-Socket Direct Connectivity Using Integrated Photonics , 2018, IEEE Photonics Journal.

[24]  Franck Cappello,et al.  Characterizing Cloud Applications on a Google Data Center , 2013, 2013 42nd International Conference on Parallel Processing.

[25]  George Mourgias-Alexandris,et al.  Multicasting in a High-Port Sub- $\mu$ sec Latency Hipo $\lambda$ aos Optical Packet Switch , 2018 .

[26]  Albert G. Greenberg,et al.  The nature of data center traffic: measurements & analysis , 2009, IMC '09.

[27]  N. Jones How to stop data centres from gobbling up the world’s electricity , 2018, Nature.

[28]  F. Horst,et al.  Monolithically Integrated CMOS-Compatible III–V on Silicon Lasers , 2018, IEEE Journal of Selected Topics in Quantum Electronics.

[29]  Chin-Lin Chen,et al.  Arrayed‐Waveguide Gratings , 2005 .

[30]  Nikos Pleros,et al.  O-band Energy-efficient Broadcast-friendly Interconnection Scheme with SiPho Mach-Zehnder Modulator (MZM) & Arrayed Waveguide Grating Router (AWGR) , 2018, 2018 Optical Fiber Communications Conference and Exposition (OFC).

[31]  Christina Delimitrou,et al.  ECHO: Recreating network traffic maps for datacenters with tens of thousands of servers , 2012, 2012 IEEE International Symposium on Workload Characterization (IISWC).

[32]  Corey Gough,et al.  Energy Efficient Servers , 2015, Apress.