Performance Evaluation of Cauchy Reed-Solomon Coding on Multicore Systems

We have evaluated the performance of Cauchy Reed-Solomon (CRS) encoding of data blocks with sizes 32 kB to 256 MB. The performance measurements are done on an Intel processor with 4 cores and integrated graphics support. We also used an AMD graphics card in our performance evaluations. Three versions of the CRS algorithm are developed: one sequential version and two OpenCL versions. The OpenCL versions have been targeted to the CPU, the integrated GPU and the AMD graphics card. The measurements show that the graphics card performs better than CPU for large buffers. However, the highest throughput is obtained for one of the CPU versions and moderate buffer sizes (around 1 MB).

[1]  Manish Vachharajani,et al.  GPU acceleration of numerical weather prediction , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[2]  Anthony Skjellum,et al.  Accelerating Reed-Solomon coding in RAID systems with GPUs , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[3]  S.A. Manavski,et al.  CUDA Compatible GPU as an Efficient Hardware Accelerator for AES Cryptography , 2007, 2007 IEEE International Conference on Signal Processing and Communications.

[4]  J. Xu OpenCL – The Open Standard for Parallel Programming of Heterogeneous Systems , 2009 .

[5]  Peter Sobe Parallel Reed/Solomon Coding on Multicore Processors , 2010, 2010 International Workshop on Storage Network Architecture and Parallel I/Os.

[6]  Marek Karpinski,et al.  An XOR-based erasure-resilient coding scheme , 1995 .

[7]  Lihao Xu,et al.  Optimizing Cauchy Reed-Solomon Codes for Fault-Tolerant Network Storage Applications , 2006, Fifth IEEE International Symposium on Network Computing and Applications (NCA'06).

[8]  Peter Sobe Parallel coding for storage systems — An OpenMP and OpenCL capable framework , 2012, ARCS 2012.

[9]  Osvaldo Gervasi,et al.  The AES Implantation Based on OpenCL for Multi/many Core Architecture , 2010, 2010 International Conference on Computational Science and Its Applications.

[10]  Luciana Porcher Nedel,et al.  GPU Accelerated Path-Planning for Multi-agents in Virtual Environments , 2009, 2009 VIII Brazilian Symposium on Games and Digital Entertainment.

[11]  Roman Wyrzykowski,et al.  Towards Efficient Execution of Erasure Codes on Multicore Architectures , 2010, PARA.

[12]  John Kubiatowicz,et al.  Erasure Coding Vs. Replication: A Quantitative Comparison , 2002, IPTPS.