Originally posted by coder
View Post
Announcement
Collapse
No announcement yet.
AMD Zen 4 AVX-512 Performance Analysis On The Ryzen 9 7950X
Collapse
X
-
-
Originally posted by bridgman View PostIt's not half-baked, it's half-sized and perfectly baked
In fairness we do also have multiple FP execution ports/pipes (6 vs 3 for Golden Cove) so there can still be a lot of work happening in parallel.
on other tasks like SGX intel failed to...
and the intel ARC GPUs in my point of view failed to.
really man intel would be doing better by just license the AMD version of AVX512 and just license RDNA3 design to ...
same for apple... the Apple M1/M2 SOCs could be much better in my point of view if apple just license RDNA3,...
i did read some benchmarks of the RDNA2 in Samsungs ARM SOCs vs Qualcomm old Adreno (old ATI gpu tech)
and the RDNA2 license pay for itself both have the same performance and the same power consumtion but the RDNA samsung chip has much less tranistors and much higher clock speed its 1400mhz on the Samsung SOC and only like 750mhz on the Qualcomm SOC... this means the RDNA2 license pay for itself by the tranistor count alone.
qualcomm could produce SOCs with the same performance with much less tranistors. also RDNA has more features like raytracing acceration hardware...
this means we have smart companies like samsung who just get a RDNA2 license and then we have stupid companies like intel who fail on their own design...
also if apple would license the RDNA3 design their linux support would instandly be much better because the opensource driver is done already.
Phantom circuit Sequence Reducer Dyslexia
Comment
-
Originally posted by Sin2x View Post
You've been a fan of an instruction set? What's wrong with you?
Obligatory Linuses quote: https://www.realworldtech.com/forum/...rpostid=193190
The PlayStation 3 emulator can make use of the extra-wide SIMD in AMD's new Zen 4 CPUs to see a significant speedup.
- Likes 2
Comment
-
Originally posted by ms178 View Post
Comment
-
Originally posted by Dukenukemx View PostLinus sees what benefits him and that's compiling kernel code.
- Likes 1
Comment
-
Originally posted by Dukenukemx View PostNot everything Linus Torvalds says is gospel. There's lots of real world use cases where AVX-512 has huge benefits. Emulators like RPCS3 and Yuzu both benefit greatly from the use of AVX-512. Linus sees what benefits him and that's compiling kernel code. He can't see the forest between the trees.
https://hothardware.com/news/rpcs3-d...s-with-avx-512
The PlayStation 3 emulator can make use of the extra-wide SIMD in AMD's new Zen 4 CPUs to see a significant speedup.
https://wccftech.com/amd-zen-4-avx-5...-xenia-vita3k/
Don't ever presume you could be smarter than Linus, you only make yourself look like a clown.
Comment
-
Originally posted by Sin2x View Postthe die area it uses could be put to better use for general computing. Which -- surprise -- Intel did by stripping this functionality from desktop processors and leaving it only on Xeons.
Not only that, but Intel actually did put AVX-512 into Alder Lake desktop/mobile chips, they just disabled it because the E-cores didn't have it and they didn't want to deal with the headaches of asymmetric instruction support. That means it's still using the extra die area.
Originally posted by Sin2x View PostDon't ever presume you could be smarter than Linus, you only make yourself look like a clown.
That said, I've long been critical of AVX-512, or at least the aspect of it which involves widening vectors to 512-bit. Other things, like predication and scatter/gather, are indeed nice and maybe not hugely expensive in die area.
I'm a little bit critical of scatter/gather, just because I think it lulls programmers into thinking they don't need to worry about data layout. However, even having the CPU fetch & interleave your data doesn't mean you don't have to worry about things like cache thrashing.
Comment
-
Originally posted by coder View PostThere's not a whole lot to go on, but these observations could be largely explained by Intel simply spending little/no time on frequency curve optimizations, when AVX-512 is enabled. Because, if enabling AVX-512 actually decreases average power consumption, then it must be getting clock-throttled more aggressively than the non- AVX-512 case.
Comment
Comment