Announcement

Collapse
No announcement yet.

Cleaning Up A Mess: Linux 6.9 Likely To Land Rework Of x86 CPU Topology Code

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #41
    Originally posted by duby229 View Post
    Thats what PentiumD was. It was literally a copy and pasted core, so much so that they had to do cache coherency through L3 cache because it had no other means of interconnecting cores. Athlon64X2 had a full crossbar controller and they even modified MESI coherency protocol to MOESI so that cache coherency could be done over that crossbar, it didn't even need L3 cache

    EDIT: https://en.wikipedia.org/wiki/MOESI_protocol

    This additional state was invented specifically to take advantage of AMD's crossbar.
    Fair. I should have been more clear that I was contrasting "copy-pasted cores" against the argument that cores are an artificial abstraction over what is really a single superscalar core.

    Comment


    • #42
      Originally posted by ddriver View Post
      Thank intel and its fake marketing cores. But I'd say 2 types of fake cores is simply not enough. It has to be a real maze, a paradoxical issue for the OS to schedule work properly.
      This is Intel's new strategy. Remember Itanium? Shift the burden to the compiler, who cares if said compiler is essentially impossible to write. Call the whole thing brilliant, and then blame the software when it doesn't work in the real world.

      Comment


      • #43
        Originally posted by sophisticles View Post
        drakonas777 and WorBlux

        i take it you don't realize that what I suggested was an extension of what Skylake was already doing:



        As I said, cores are an artificial construct, what is the difference between a single core CPU that has 2 ALUs and 2 FPUs and a dual core CPU that has 1 ALU and 1 FPU per core?

        The correct answer is none.

        In both cases the chip can handle 2 integer based threads and 2 floating point based threads.

        If anything the single core chip will be faster because of the unified cache.

        Understand?
        No, VISC and Softcore IP never made it into the intel core architecture.

        Also VISC was premised on the idea of the compiler finding and issuing explicit micro-threads based on data flow analysis, and being able to run some of them speculatively.


        Skylake was mostly just an iteration on Haswell, with bigger everything, and was about the point when there was enough decodes and u-Op dispatch to make hyper-threading mostly unoticable on control/branch/logic heavy code.



        As to a big hyperthread vs a smaller single threaded core... which is faster really depends on the workload. Unified cache could be great, or it could mean data is evicted to the L2 or L3 at the worst possible time, and smaller caches are faster. Also scaling isn't linear... a lot of stuff is on order n*log(n), or
        n^2 complexity. ... hence the need for a 3-d architecture to even attempt a 40-60 issue superscaler design.

        Comment


        • #44
          edit: duplicate post deleted

          Comment

          Working...
          X