gcm128.c: tidy up, minor optimization, rearrange gcm128_context.