Hello Michael,
On Mon, Jan 25, 2021 at 8:45 PM Michael Weiser michael.weiser@gmx.de wrote:
Attached are the current patches, the first being your original. What do you think?
I liked how the patch ended up so far, just give me one or two days to give the patch additional review before letting it up to Neils.
As said, I'm up for looking into endianness-specific versions of the macros again. But what was supposed to be the LE versions of PMUL and friends has now become the BE-native versions and we'd need to come up with variants of them that make the rev64s unneccessary. Any ideas?
Are you looking for removing rev64s on LE? If so, I don't think we can figure a variant that allows us continue working on an unsorted register value on LE as pmull requires the input to be sorted properly, that is transposed doublewords.
regards, Mamone