Announcement

Collapse
No announcement yet.

Benchmarking The AMD EPYC Speed Boost Coming To Linux 5.18, Thanks To Scheduler/NUMA Improvement

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Benchmarking The AMD EPYC Speed Boost Coming To Linux 5.18, Thanks To Scheduler/NUMA Improvement

    Phoronix: Benchmarking The AMD EPYC Speed Boost Coming To Linux 5.18, Thanks To Scheduler/NUMA Improvement

    Earlier this month I noted a Linux scheduler change queued into sched/core ahead of the Linux 5.18 cycle that is expected to help AMD EPYC processors and other select Zen processors in various workloads. The change has been in the works for several months and is about adjusting the allowed NUMA imbalance when spanning multiple LLCs. I've now carried out some of my own benchmarks on EPYC hardware and indeed is further ratcheting up the Linux kernel performance.

    Phoronix, Linux Hardware Reviews, Linux hardware benchmarks, Linux server benchmarks, Linux benchmarking, Desktop Linux, Linux performance, Open Source graphics, Linux How To, Ubuntu benchmarks, Ubuntu hardware, Phoronix Test Suite

  • #2
    This looks great for people who let their EPYCs run in default configuration due to not reading AMD's optimization documentation

    Michael Can you test with NPS4 and L3 as NUMA enabled?

    Comment


    • #3
      Originally posted by numacross View Post
      This looks great for people who let their EPYCs run in default configuration due to not reading AMD's optimization documentation

      Michael Can you test with NPS4 and L3 as NUMA enabled?
      Then the question is: Why is AMD's default configuration so sub-optimal?

      Comment


      • #4
        Originally posted by chuckula View Post

        Then the question is: Why is AMD's default configuration so sub-optimal?
        Because people are scared of NUMA and rightfully so - for some workloads it is beneficial while for others it's detrimental.
        Intel has similar configuration modes available. 3rd generation Xeon Scalable can be split into 2 or 4 NUMA nodes if needed. There is also a special mode for L3, but I have no experience with it so it might not work the same as L3 as NUMA on AMD.
        Last edited by numacross; 28 February 2022, 10:47 AM.

        Comment


        • #5
          That's enough of a performance difference to be considered a next-gen CPU, or tapping into a new instruction set. Pretty impressive results for a scheduler. Makes sense though.

          Comment


          • #6
            Not bad for some minor tweaking of the NUMA code! In most tests it's a decent win so that's nice. Always goes to show there's more perf to be squeezed out of AMD CPUS with some thought and effort. I bet after baking for a while someone is going to find a further way to tweak the NUMA stuff after others have had some time to put it into production and beat on it for a while. Nice job with this.

            Comment


            • #7
              I wonder what the performance is with NUMA turned off.

              For some reason I cannot remember any longer, I turned NUMA off when I got a Ryzen setup.

              Comment


              • #8
                Originally posted by Beherit View Post
                I wonder what the performance is with NUMA turned off.

                For some reason I cannot remember any longer, I turned NUMA off when I got a Ryzen setup.
                Would be curious to see this too. Something like NUMA off vs NUMA vanilla vs new NUMA perf code. Big data benefits most from this though, no?

                Comment


                • #9
                  Could you please test a 2-die Ryzen 5000 as well please (5950 for example) ?

                  Comment


                  • #10
                    This shows just how foolish AMD's underinvestment in the kernel (and software generally) has been.

                    Generational performance improvement from one patch - that isn't even by an AMD employee - years after Rome launched. How many million man-hours did it take to get the 15-20% perf improvement in Milan, while low-hanging fruit like this was sitting around on the software side?

                    It's been improving for the last year or two, but still a handful of kernel and graphics engineers could do much more to make the product attractive than the diminishing-returns end of hardware updates.

                    Comment

                    Working...
                    X