Evaluating the Performance of Parallel Linear Algebra Libraries for Level-1 BLAS

We present a performance evaluation of the scalar-vector product (axpy) operation on four widespread linear algebra libraries. Benchmarks are performed for multi-cores and many-cores architectures and the results are compared.