Throw in AES CBC assembler, up to +40% on aes-128-cbc benchmark.