Poking At A Big NUMA Benchmark Regression In Linux 5.18 Git
One of the systems I started running Linux 5.18 Git benchmarks during this second week of the merge window was on the AMD Ryzen 9 5950X.
Linux 5.18 on the Ryzen 9 5950X has been largely stable compared to recent kernel series...
While most benchmarks didn't show any measurable change from Linux 5.15 through 5.18 Git this week, Stress-NG was an outlier... And in particular, its NUMA benchmark.
The stress-ng load/stress program does a good job stressing Linux systems and its NUMA test case in particular regressed heavy with Linux 5.18. Other tested stress-ng stressors were unaffected. With Linux 5.18 Git, the NUMA bogo ops per second performance nosedived.
While a synthetic test case, I was able to reproduce this Stress-NG NUMA nosedive on an Intel Core i9 12900K desktop too where Linux 5.16/5.17 was steady but then Linux 5.18 fell sharply...
Ouch... Seems to be the first major regression seen so far in my early #Linux 5.18 Git testing.
— Phoronix (@phoronix) March 30, 2022
At least should be a quick, easy fun one to track down. pic.twitter.com/U5nsDeqvtM
So back on the Ryzen 9 5950X system I went through to bisect this regression affecting Linux 5.18 Git:
The regression traced back to the memory management changes merged last week for Linux 5.18... In particular, this commit was Git bisected to being the first bad commit where the stress-ng NUMA performance collapsed.
At least with the various real-world workloads I've benchmarked so far on the Ryzen 9 5950X, Core i9 12900K, and a few others I have yet to see any significant difference there with Linux 5.18. At least though stress-ng is a quick and easy to run test case. Though with perpetual time/resource limitations, that's where I ended this testing that anyone can now pick up from with the information available.
I'll be looking at other areas of Linux 5.18 performance and on more hardware as the merge window draws to a close. If you enjoy the Linux benchmarks and other work, consider showing your support via going Phoronix Premium or PayPal tip to allow for more time/resources for such investigative benchmarking.