Efficient decoding of polar codes with some 16$\times$16 kernels