Originally posted by coder
View Post
I don't disagree with you re: Zen having a 4-way front end, but there are two of them per core, so from a total throughput perspective it's 8-way. From a single-thread perspective it's 4-way. I pointed that out. No need to rub it in my face, lol.
Originally posted by coder
View Post
Actually, the OOO mechanisms of modern CPUs (ROB, branch predictors, speculation, op schedulers) exist exactly for the purpose of extracting parallelism in order to construct waves of parallel operations. They're not VLIW from a user-facing ISA perspective, but they are from a microarch perspective. The waves of parallel micro-ops / fused micro-ops, are fed to the schedulers which form the basis of the VLIW instruction set driving the execution/memory units. Those units and their schedulers are the core, and everything that comes ahead of it are just getting its ducks in a row so it can actually crunch the numbers and push the effects out.
Now coming full circle... let's apply an empirical test to your assertions:
- x86 ISA is by design inferior and impairs core performance vs ARM ISA
- CISC cannot outperform RISC on code density
- Zen only uses a 4-way front end (granted)
Hmm... what's real-world testing on sites like this one say? Generally, x86 still holds the single-thread and multi-thread crowns. There are some exceptions. x86 doesn't completely pwn ARM.
I've followed a lot of your contributions here and I appreciate a lot of the information you've provided, even your occasional wry humour.
If the empirical evidence supported your assertions, by all means, I would go conduct research to educate myself around the superiority of this new information you're providing, which disagrees with the 40 years of research I've already done. However in this instance...
Comment