Announcement

**ssokolow** · 20 February 2024, 02:55 AM

Originally posted by duby229 View Post

Thats what PentiumD was. It was literally a copy and pasted core, so much so that they had to do cache coherency through L3 cache because it had no other means of interconnecting cores. Athlon64X2 had a full crossbar controller and they even modified MESI coherency protocol to MOESI so that cache coherency could be done over that crossbar, it didn't even need L3 cache

EDIT: https://en.wikipedia.org/wiki/MOESI_protocol

This additional state was invented specifically to take advantage of AMD's crossbar.

Fair. I should have been more clear that I was contrasting "copy-pasted cores" against the argument that cores are an artificial abstraction over what is really a single superscalar core.

**torsionbar28** · 20 February 2024, 10:39 AM

Originally posted by ddriver View Post

Thank intel and its fake marketing cores. But I'd say 2 types of fake cores is simply not enough. It has to be a real maze, a paradoxical issue for the OS to schedule work properly.

This is Intel's new strategy. Remember Itanium? Shift the burden to the compiler, who cares if said compiler is essentially impossible to write. Call the whole thing brilliant, and then blame the software when it doesn't work in the real world.

**WorBlux** · 27 February 2024, 06:29 PM

Originally posted by sophisticles View Post

drakonas777 and WorBlux

i take it you don't realize that what I suggested was an extension of what Skylake was already doing:

Intel's Skylake Processors Allegedly Rocking 'Inverse Hyper Threading' - VISC Like Architecture with Massive Single Threaded Performance

https://wccftech.com/intel-inverse-hyper-threading-skylake/

Intel's new Skylake processors have a major update when it comes to single threaded performance, dubbed 'inverse hyper threading'.

As I said, cores are an artificial construct, what is the difference between a single core CPU that has 2 ALUs and 2 FPUs and a dual core CPU that has 1 ALU and 1 FPU per core?

The correct answer is none.

In both cases the chip can handle 2 integer based threads and 2 floating point based threads.

If anything the single core chip will be faster because of the unified cache.

Understand?

No, VISC and Softcore IP never made it into the intel core architecture.

Also VISC was premised on the idea of the compiler finding and issuing explicit micro-threads based on data flow analysis, and being able to run some of them speculatively.

http://people.ece.cornell.edu/martinez/doc/ics12.pdf

Skylake was mostly just an iteration on Haswell, with bigger everything, and was about the point when there was enough decodes and u-Op dispatch to make hyper-threading mostly unoticable on control/branch/logic heavy code.

Skylake (client) - Microarchitectures - Intel - WikiChip

https://en.wikichip.org/wiki/intel/microarchitectures/skylake_%28client%29

As to a big hyperthread vs a smaller single threaded core... which is faster really depends on the workload. Unified cache could be great, or it could mean data is evicted to the L2 or L3 at the worst possible time, and smaller caches are faster. Also scaling isn't linear... a lot of stuff is on order n*log(n), or
n^2 complexity. ... hence the need for a 3-d architecture to even attempt a 40-60 issue superscaler design.

**WorBlux** · 27 February 2024, 06:29 PM

edit: duplicate post deleted

Announcement

Cleaning Up A Mess: Linux 6.9 Likely To Land Rework Of x86 CPU Topology Code

Comment

Comment

Comment

Comment