Originally posted by stormcrow
View Post
Announcement
Collapse
No announcement yet.
Intel's Latest CPU Microcode Update Isn't All That Scary
Collapse
X
-
Originally posted by stormcrow View PostSome cases suggest perhaps otherwise: https://travisdowns.github.io/blog/2...-zero-opt.html
It all depends on what you're doing.
No, I wouldn't expect that to move the needle on the geomean. Maybe a handful of benchmarks could show a measurable difference, perhaps in high-core-count scenarios where there's a lot of L3 cache pressure.
Comment
-
Originally posted by coder View PostWow, that's a little nuts. So, they basically just had a micro-optimization for filling memory with 0's that this disables, presumably because timing-based attacks can use that to determine (to a limited degree) the contents of what's being written.
No, I wouldn't expect that to move the needle on the geomean. Maybe a handful of benchmarks could show a measurable difference, perhaps in high-core-count scenarios where there's a lot of L3 cache pressure.
Comment
-
Originally posted by Jabberwocky View Post
Maybe using clang to compile chrome or the kernel could be a good test. You're right using the mean could be like boiling the frog.
More seriously, I doubt you'd want to compile Chromium as a benchmark, it seems to take hours even on a high end machine. I'm not sure what would be a good test in this case other than array calculations with many objects filled with zeros. But then you'd just condense it down to sparse operations on non-zero ops wouldn't you?Last edited by stormcrow; 23 June 2021, 11:05 AM. Reason: clarity on the objects, must have clear objects you know!
- Likes 1
Comment
-
Originally posted by stormcrow View PostI'm not sure what would be a good test in this case other than array calculations with many objects filled with zeros. But then you'd just condense it down to sparse operations on non-zero ops wouldn't you?
My first thought was of calloc(), so maybe some fairly light-weight text processing that uses it. It'd have to be multithreaded or otherwise scale up enough to really hit L3, though.
Comment
-
Originally posted by coder View PostSoftware rendering and imaging benchmarks, perhaps? Could involve lots of buffer-clears.
My first thought was of calloc(), so maybe some fairly light-weight text processing that uses it. It'd have to be multithreaded or otherwise scale up enough to really hit L3, though.
Comment
-
Originally posted by stormcrow View Post
Poor frog.
More seriously, I doubt you'd want to compile Chromium as a benchmark, it seems to take hours even on a high end machine.
IIRC the last time I compiled a browser or a kernel, excluding SBC-kernels, was before chrome/chromium existed. I've been neglecting the passion I had (even called chrome by mistake lol).
I'm not sure what would be a good test in this case other than array calculations with many objects filled with zeros. But then you'd just condense it down to sparse operations on non-zero ops wouldn't you?
I found the title very funny: "Intel's Latest CPU Microcode Update Isn't All That Scary". The lecturers I knew would have failed you if you said something like that. It's not because scary is not a technical word, but because basically impossible* to formulate that type of conclusion on a blackbox system.
* Breaking the x86 Instruction Set https://youtu.be/KrksBdWcZgQ (this was before spectre/meltdown) 🤣
The future generations of scientifically educated developers will probably laugh when they study this era of hardware and firmware. There's a lot of study material lying around: https://www.anandtech.com/show/15962...b-of-ip-leaked I wonder how many years or decades it will take before it becomes part of computer science. Hopefully part of it will make it into educational entities. It's important to learn from the mistakes of others.
- Likes 1
Comment
-
Comment