GPU Acceleration of Dense Matrix and Block Operations for Lanczos Method for Systems over Large Prime Finite Field

GPU based acceleration of computations with dense matrices and blocks over large prime finite field are studied. Particular attention is paid to the following algorithms: multiplication of rectangular \(N \times K\) blocks with \(N \gg K;\) multiplication of \(N \times K\) blocks by square \(K \times K\) matrices; LU-decomposition of matrices.