Originally posted by avis
View Post
Linus Torvalds Comes Out Against "Completely Broken" x86_64 Feature Levels
Collapse
X
-
-
-
Originally posted by blackshard View Post
CoC in this case does not apply at all. It's easy to see there is not a person that is behaving (the conduit) badly. It's just an opinion over a technical decision, that I share a lot too: it has no sense to introduce a segmentation at any level in the kernel to make things even more complex with other ifs or #ifdefs here and there.
Comment
-
-
- ...what did we learn today?
- Today we learned if you are obedient to your corporate masters and support their russo phobia or CIS phobia or palestinophobia or whatever, then it's technical debate and you don't have to be obedient to the COC.
- Great kids! And do we call this blatant hypocrisy or democracy and freedom of speech?
- Me, me teacher, I know me me ask me...
Comment
-
-
Originally posted by patrick1946 View PostActually v4 was useful until Intel could not get their little cores up to that level.
For instance, I point to ARM Neoverse V1 and V2. V1 implemented SVE @ 2x 256-bit. V2 implemented it @ 4x 128-bit. The benefit of implementing at 128-bit is better hardware utilization for all of that legacy NEON code. Yet, I'm sure they wouldn't have done it, if it compromised SVE performance/efficiency hardly at all.Last edited by coder; 05 December 2024, 02:38 PM.
Comment
-
-
Originally posted by patrick1946 View PostWhy should a new version a superset of the lower numbered? If you bring out a CPU which is not supporting that version you have to find an other. Mostly it will be older but it can be newer too. It is called deprecation.
I don't entirely reject the idea of ISA feature levels or having v5 be a superset of v3, while leaving v4 as an orphan, but it's just taking us down a road that could ultimately make ISA feature levels somewhat self-defeating.
Comment
-
-
Originally posted by Gonk View PostNo, any attempt to support AVX-512 was doomed before Alder Lake's e-cores came along.
I'd love it if Zen 6 turned out to be a solid green row.
BTW, how does Alder Lake support VP2INTERSECT but Sapphire Rapids doesn't? Not to mention Tiger Lake supporting it...
Comment
-
-
Originally posted by hardfalcon View PostEven the usefulness of v2 and v3 seems questionable. One example: Even though Skylake CPUs are supposed to support AVX and AVX2, since last summer, the Linux kernel blocks/disables AVX and AVX2 support for userland code on Skylake CPUs to mitigate the GDS/Downfall vulnerability: https://www.phoronix.com/review/downfall
Comment
-
-
Originally posted by coder View PostBTW, how does Alder Lake support VP2INTERSECT but Sapphire Rapids doesn't? Not to mention Tiger Lake supporting it...
More info here: http://www.numberworld.org/blogs/202...x512_teardown/- Intel added AVX512-VP2INTERSECT to Tiger Lake. But it was really slow. (microcoded ~25 cycles/46 uops)
- It was so slow that someone found a better way to implement its functionality without using the instruction itself.
- Intel deprecates the instruction and removes it from all processors after Tiger Lake. (ignoring the fact that early Alder Lake unofficially also had it)
- AMD adds it to Zen5.
But how good is AMD's implementation? 1 cycle throughput.
Comment
-
I think what people are missing is Linus' point that the cpu already has cpuid flags. And already regularly includes regular and fast code paths based on some instruction existing or not. So there's no reason to build a x86-64v4 kernel when the kernel can detect if AVX512 exists, test if it actually gets a speedup (sometimes it doesn't) and use it if it does. There's stuff for encryption, compression, stuff in the RAID modules. etc. that does exactly this (test several implementations at start and use whichever is fastest.)
Just to add, this whole thing with AVX512 is a bit gross IMHO. The ARM NEON (and equivalent instructions on 64-bit) you just specify the length. You can have some chip with like 128-bit vector stuff and it'll do 512-bit or even 2048-bit. It's an implementation detail, not 'we have wider vector units so here's the same instructions but twice as wide of vectors.'Last edited by hwertz; 05 December 2024, 03:04 PM.
Comment
-
Comment