Originally posted by Veto
View Post
Announcement
Collapse
No announcement yet.
AMD Ryzen 9 7900X Performance With ECC DDR5 Memory
Collapse
X
-
Last edited by piorunz; 05 October 2023, 07:16 PM.
- Likes 3
-
Originally posted by atmartens View PostThe data really show how pathetic it is that ECC isn't standard. The performance penalty is negligible!
For me this is worth it, but for extreme gamers maybe not because they can redownload their game and so on.
- Likes 1
Comment
-
In summary, I am surprised that there is a measurable difference from turning ECC on and off. Even 2-3% isn't what I expcted.
7% slower surprises me. It doesn't match my mental model of how ECC works.
Now I'm afraid that somebody does the same testing for registered vs. unbuffered RAM
- Likes 1
Comment
-
I've been running my consumer grade ASRock AM4 motherboard NFS server with ECC memory for over a year now and have yet to have yet to have any errors reported by edac-util and no reliability problems. I would not even consider running an NFS server without ECC memory. Most ASRock AM4 mother boards support ECC. A quick check of their AM5 motherboards shows that ECC support is 50/50 or less with few of the low end and even some of the high(est) end motherboards not supporting it.
- Likes 1
Comment
-
Originally posted by schmidtbag View PostThis reminds me of the days when someone was arguing with me that ECC was going to be standard on DDR5 even for desktops and laptops. Sure seems that didn't pan out.
Comment
-
Originally posted by Yalok View Post
According to DDR5 Wikipedia article, there is in fact some form of ECC by design on all DDR5 sticks, that was not present on standard DDR4.
I trust it to get DDR5 working well enough to be competitive with DDR4. We don't know how often the in-die ECC is correcting errors introduced by the chip and if a cosmic ray hits your memory in the neighborhood of the data your overclocked DDR5 DIMM is triggering repeat errors you've got a corruption.
- Likes 1
Comment
-
Originally posted by Yalok View Post
According to DDR5 Wikipedia article, there is in fact some form of ECC by design on all DDR5 sticks, that was not present on standard DDR4.
- Likes 3
Comment
-
Originally posted by piorunz View PostUnfortunately it's not that easy. Normal non-ECC sticks have much lower latency and are faster because of that. ECC sticks have very high CL and are slower. Just inserting ECC stick with ECC disabled cost you some performance.
ECC UDIMMs only differ from non-ECC UDIMMs in terms of the number of DRAM chips on them. The DRAM chips, themselves, are exactly the same as the type used on non-ECC UDIMMs.
So, why the higher latency spec? That's only because they used conservative timing and adhere strictly to JEDEC specifications. They also tend not to have heatspreaders on them, like you often to see on "gaming" DIMMs. So, I'm not sure how much the specs are down to reducing heat dissipation or power consumption.
When the CPU is running in ECC mode, the CPU's integrated memory controller performs the ECC checking & computation. That can cost you a couple extra nanoseconds, at most. I'd further speculate that perhaps enabling ECC might disable burst chop, as that's just about the only way I can make sense of the more extreme outliers, assuming those results are stable.Last edited by coder; 06 October 2023, 04:01 AM.
- Likes 5
Comment
-
Originally posted by Yalok View PostAccording to DDR5 Wikipedia article, there is in fact some form of ECC by design on all DDR5 sticks, that was not present on standard DDR4.
Furthermore, as others have noted, DDR5 doesn't report these on-die errors. So, your machine will be blissfully unaware of any error detections or corrections, which I think is a wasted opportunity. Imagine the kernel keeping a list of which pages have ECC errors and then simply removing from circulation any pages in which more than one error has been detected! I believe this is similar to what modern SSDs do to help compensate for NAND aging.
From what I've read, there are two reasons DDR5 has on-die ECC:- Relative to DDR4, it runs at lower voltage and increased the refresh interval. Both of these changes provide power-savings, but increase the likelihood of bit errors.
- Shrinking cell sizes (and die-stacking?) also increases the likelihood of bit errors.
So, to compensate for its higher intrinsic error rate, the DDR5 spec allows for on-die ECC. The expectation is that manufacturers will use just enough to achieve comparable external error rates as DDR4. So, don't expect a net benefit from DDR5's on-die ECC. It's there to provide adequate reliability for mass-market applications, not extra. That's why the DDR5 DIMM spec allows for end-to-end ECC!Last edited by coder; 06 October 2023, 03:59 AM.
- Likes 7
Comment
-
Originally posted by undersuit View PostWe don't know how often the in-die ECC is correcting errors introduced by the chip and if a cosmic ray hits your memory in the neighborhood of the data your overclocked DDR5 DIMM is triggering repeat errors you've got a corruption.
For a sysadmin, ECC errors are a sign that you should replace a DIMM. Without (visible) ECC, you won't know it has worn out until it's so bad the machine has become unstable or corrupted data -- and then, just maybe you happen to have the presence of mind to run memtest.
Long ago, I read that poor quality power supplies can also increase the frequency of memory errors, though I'm not sure if that's due more to noise or voltages being off.Last edited by coder; 06 October 2023, 03:56 AM.
- Likes 5
Comment
Comment