[Arm64, S390x] Optimize Chacha20