crypto: arm64/chacha - use combined SIMD/ALU routine for more speed
authorArd Biesheuvel <ard.biesheuvel@linaro.org>
Tue, 4 Dec 2018 13:13:33 +0000 (14:13 +0100)
committerHerbert Xu <herbert@gondor.apana.org.au>
Thu, 13 Dec 2018 10:24:55 +0000 (18:24 +0800)
commit2fe55987b2624a86a5c709a8df65d4de2608dc07
treeaa1e6ba2142d05e74cdfabf2b581ca57a381b7f4
parentf2ca1cbd0fb584b5b5e0dbd9bda819f49cf9cdb6
crypto: arm64/chacha - use combined SIMD/ALU routine for more speed

To some degree, most known AArch64 micro-architectures appear to be
able to issue ALU instructions in parellel to SIMD instructions
without affecting the SIMD throughput. This means we can use the ALU
to process a fifth ChaCha block while the SIMD is processing four
blocks in parallel.

Signed-off-by: Ard Biesheuvel <ard.biesheuvel@linaro.org>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
arch/arm64/crypto/chacha-neon-core.S
arch/arm64/crypto/chacha-neon-glue.c