Feasibility tests of RoCE v2 for LHCb event building

This paper evaluates the utilization of Remote Direct Memory Access (RDMA) over Converged Ethernet (RoCE) for the Run 3 LHCb event building at CERN. The acquisition system of the detector will collect partial data from approximately 1000 separate detector streams. The total estimated throughput equals 32 Terabits per second. Full events will be assembled for subsequent processing and data selection in the filtering farm of the online trigger. High-throughput transmissions with up to 90% links utilization will be an essential feature of the system. The data exchange mechanism must support zero-copy transmissions. In this work, the RoCE high-throughput kernel bypass Ethernet protocol is benchmarked as a potential alternative to InfiniBand. A RoCE-based event building network is presented and two implementations are considered. The former variant combined shallow-buffered and deep-buffered switches with enabled flow control. In the latter setup, only deep-buffered devices are used, where operation relied on their memory throughput and capacity. Feasibility tests were conducted with selected Ethernet switches. Memory bandwidth utilization was investigated, in comparison with InfiniBand. Relevant utilization and interoperability issues of RoCE flow control are detailed with lessons learned along the road.

[1]  Sébastien Valat,et al.  An Evaluation of 100-Gb/s LAN Networks for the LHCb DAQ Upgrade , 2017, IEEE Transactions on Nuclear Science.

[2]  A. Piucci The LHCb Upgrade , 2017 .

[3]  Sébastien Valat,et al.  The LHCb DAQ Upgrade for LHC Run3 , 2019, IEEE Transactions on Nuclear Science.