Originally posted by carewolf
View Post
Realistically, the A64FX is the only 512-bit implementation of which I'm aware. So, it's not currently very consequential to support no larger than 256-bit. Also, perhaps there's a point of diminishing returns that 256-bit already exceeds, though 512-bit is a typical cacheline size.
Comment