Originally posted by AsuMagic
View Post
Announcement
Collapse
No announcement yet.
The Impact Of GCC Zen Compiler Tuning On AMD Ryzen Performance
Collapse
X
-
Last edited by zboson; 03 March 2017, 02:43 PM.
-
Originally posted by zboson View PostZen is also dual channel if I recall whereas Skylake (not sure which was first) is quad-channel. This means Zen is more effected by memory bandwidth. That's maybe the second most disappointing thing about Zen after sticking with AVX128. I'm still likely to build a Zen system. It will be the first desktop I have build in years.
As for the article... hoping GCC7 takes this further. Fairly disappointing. But hey, I'm even more disappointed in the platform itself with its extreme lack of PCIe and SATA connectivity.
- Likes 1
Comment
-
Originally posted by zboson View PostZen is also dual channel if I recall whereas Skylake (not sure which was first) is quad-channel. This means Zen is more affected by memory bandwidth. That's maybe the second most disappointing thing about Zen after sticking with AVX128. I'm still likely to build a Zen system. It will be the first desktop I have build in years.
Comment
-
Originally posted by Holograph View Post
Intel only has quad-channel in the socket 2011 stuff. They have never done quad-channel in a mainstream consumer product. And Skylake for 2011 isn't out.
As for the article... hoping GCC7 takes this further. Fairly disappointing. But hey, I'm even more disappointed in the platform itself with its extreme lack of PCIe and SATA connectivity.
https://arstechnica.com/gadgets/2017...t-can-compete/
"One particular Geekbench subtest showed a strong advantage in the other direction. Geekbench includes tests of the cryptographic instructions found in all mainstream processors these days. In a test of single-threaded performance, the Ryzen trounces the Broadwell-E, encrypting at 4.5GB/s compared to 2.7GB/s. Ryzen has two AES units that both reside within the floating point portion of the processor. Broadwell only has one, giving the AMD chip a big lead.
This situation is reversed when moving from one thread to 16: the Intel system can do 24.4GB/s while the AMD only does 10.2GB/s. This suggests that the test becomes bandwidth-limited with high thread counts, allowing the 6900K's quad-memory channels to give it a lead over the 1800X's dual channels. Even though the Ryzen has more computational resources to throw at this particular problem, that doesn't help when the processor sits twiddling its thumbs waiting for data."Last edited by zboson; 03 March 2017, 02:52 PM.
Comment
-
I'd be curious to know if overclocking the memory on Ryzen helps, but I know Michael isn't really into overclocking. And supposedly a lot of people are having issue trying to overclock memory with Ryzen so far, but some have pulled it off. I'm hoping more people will release data on this. Of course this question is worthless to anyone who is using Linux for productivity-related stuff.
Comment
-
Originally posted by Holograph View PostI'd be curious to know if overclocking the memory on Ryzen helps, but I know Michael isn't really into overclocking. And supposedly a lot of people are having issue trying to overclock memory with Ryzen so far, but some have pulled it off. I'm hoping more people will release data on this. Of course this question is worthless to anyone who is using Linux for productivity-related stuff.Michael Larabel
https://www.michaellarabel.com/
- Likes 3
Comment
-
Originally posted by zboson View Post
"One particular Geekbench subtest showed a strong advantage in the other direction. Geekbench includes tests of the cryptographic instructions found in all mainstream processors these days. In a test of single-threaded performance, the Ryzen trounces the Broadwell-E, encrypting at 4.5GB/s compared to 2.7GB/s. Ryzen has two AES units that both reside within the floating point portion of the processor. Broadwell only has one, giving the AMD chip a big lead.
This situation is reversed when moving from one thread to 16: the Intel system can do 24.4GB/s while the AMD only does 10.2GB/s. This suggests that the test becomes bandwidth-limited with high thread counts, allowing the 6900K's quad-memory channels to give it a lead over the 1800X's dual channels. Even though the Ryzen has more computational resources to throw at this particular problem, that doesn't help when the processor sits twiddling its thumbs waiting for data."
- Likes 1
Comment
-
According to an analysis of Ryzen cache by hardware.fr (which Chrome translates well): latency is very poor when a core from one of the two four-core-clusters accesses something in the L3 cache of the other.
Comment
Comment