sha/asm/keccak1600-ppc64.pl: Load data in 8 byte chunks on little endian
authorAnton Blanchard <anton@linux.ibm.com>
Tue, 12 Mar 2019 05:03:56 +0000 (16:03 +1100)
committerTomas Mraz <tomas@openssl.org>
Thu, 11 Nov 2021 09:58:46 +0000 (10:58 +0100)
commit8f9842fd03945d9484dcc9e5fab37dce7caa5f50
tree6865548452212e47164516e4fd110ad46bda9384
parent90c311315c15a4fea895fd317d9c8fe801ba04a0
sha/asm/keccak1600-ppc64.pl: Load data in 8 byte chunks on little endian

We currently load data byte by byte in order to byteswap it on big
endian. On little endian we can just do 8 byte loads.

A SHAKE128 benchmark runs 10% faster on POWER9 with this patch applied.

Reviewed-by: Paul Dale <pauli@openssl.org>
Reviewed-by: Tomas Mraz <tomas@openssl.org>
(Merged from https://github.com/openssl/openssl/pull/8455)
crypto/sha/asm/keccak1600-ppc64.pl