chacha/asm/chacha-x86_64.pl: add AVX512 path optimized for shorter inputs.