Software Linux Reviews & Articles
There have been 921 Linux hardware reviews and benchmark articles on Phoronix for software. Separately, check out our news section for related product news.
There have been 921 Linux hardware reviews and benchmark articles on Phoronix for software. Separately, check out our news section for related product news.
With the in-development Linux 6.13 kernel one of the biggest features for those using new AMD EPYC 9005 "Turin" processors is using the AMD P-State driver by default for servers/motherboards with ACPI CPPC support enabled. But even for platforms without that where ACPI CPUFreq remains the default, the Linux 6.13 kernel is still showing some nice incremental uplift at large on these new AMD server processors. Here are some Linux 6.11 vs. 6.12 vs. 6.13 Git kernel benchmarks using an AMD EPYC 9575F 64-core server.
With the Linux 6.13 merge window having ended this past weekend, here's the Phoronix overview of all the interesting feature additions, new hardware support, and other kernel changes coming for Linux 6.13.
Following the release last month of the EPYC 9005 series processors, AMD published a BIOS and Workload Tuning Guide of straight-forward settings recommendations for those running new EPYC Turin servers to optimize the performance of different workloads like databases and Java to HPC and AI/ML software. Recently I started running some benchmarks to look at the impact of AMD's recommended BIOS tuning and beginning this comparison by looking at the performance (and power) impact across a range of AI / machine learning workloads on a 5th Gen AMD EPYC server.
At the beginning of November I wrote about AMD Linux engineers posting Linux patches enabling a new "ERAPS" feature for Zen 5. ERAPS wasn't talked about by AMD at the Zen 5 launches of the Ryzen 9000 / Ryzen AI 300 series or with the more recent EPYC 9005 "Turin" launch but when enabled, the Enhanced Return Address Prediction Security feature can help deliver some additional gains on new AMD Zen 5 systems by allowing some existing software security mitigations to be avoided. Here are some preliminary comparison benchmarks showing the benefit in affected workloads for using ERAPS on Linux with AMD Zen 5.
Last week when launching the AMD EPYC 9005 "Turin" processors, on the same day AOCC 5.0 was quietly released as the newest version of AMD's Zen-focused compiler derived from LLVM/Clang. With not only adding AMD Zen 5 "znver5" support but also additional vectorization improvements and other performance optimizations, I was eager to run some benchmarks of AOCC 5.0 against the open-source GCC and LLVM/Clang compilers. Here are those initial benchmarks using dual AMD EPYC 9755 128-core Zen 5 processors.
The Linux 6.12 merge window is wrapping up today with the release of Linux 6.12-rc1 in the coming hours. This is going to be a heck of an exciting kernel. There's real-time PREEMPT_RT finally in mainline, the much anticipated sched_ext code also was merged, QR codes for DRM panic messages, initial out-of-the-box support for Intel Xe2 graphics with Lunar Lake and Battlemage, initial Raspberry Pi 5 support, and a ton of other hardware support additions and new innovative kernel software features.
While having the Supermicro ARS-211M-NR R13SPD server in the lab for AmpereOne benchmarking with the flagship AmpereOne A192-32X processor, I took the opportunity to run some fresh GCC vs. LLVM Clang compiler performance benchmarks on AArch64. Here are those results for that healthy competition between these open-source C/C++ compilers on AmpereOne cores.
While Windows gamers seem mixed over the AMD Ryzen 9000 series processors, for creator, scientific / HPC, code development, and many other technical computing areas I remain very impressed by the Ryzen 9000 (Zen 5) series desktop processors more than one month into constant testing with these Granite Ridge chips. One of the areas I hadn't explored until now but made me curious given the mixed messaging around gaming was how well workstation graphics workloads were performing with the new processors. For this brief weekend article is a look at the workstation graphics performance between the Ryzen 9 9950X and former Ryzen 9 7950X/7950X3D processors.
One of the security changes with AMD Zen 5 processors that I haven't seen AMD publicly mention at least not prominently is that the new cores are not vulnerable to Speculative Return Stack Overflow (SRSO). Unlike Zen 4 and prior, under Linux I noticed that Zen 5 is no longer affected by the SRSO "INCEPTION" vulnerability. But of course there does remain other CPU security mitigations in place carried over from Zen 4. For those wondering about the mitigation costs or if it's worthwhile running Zen 5 with the "mitigations=off" insane mode, here are some benchmarks.
Continuing on with the AmpereOne performance benchmarking while having the AmpereOne A192-32X in the lab within a Supermicro ARS-211M-NR R13SPD server, the next set of benchmarks is looking at the performance when using the near-final Linux 6.11 kernel. Additionally, quantifying the performance impact of using the ARM64 64K page size kernel as an alternative to the default 4K page size.
Continuing on with the AMD Ryzen 9000 series Linux benchmarking, today's testing is looking at the performance and power impact of the AMD Ryzen 9 9950X when adjusting the CPU frequency scaling driver, governor, and Energy Performance Preference (EPP) tunable to help look at the performance and power efficiency characteristics of this current flagship Zen 5 desktop processor.
Linux 6.11 as often is the case each kernel cycle brings a lot of new improvements for recent/upcoming Intel and AMD platforms. There are big step forwards for AMD confidential computing, initial RDNA4 graphics support, new AMD P-State driver features, and much more on the AMD side. On the Intel side there is ongoing work around Xe2 graphics for Lunar Lake and Battlemage but not yet stable. Intel also has initial Panther Lake audio support, more NPU driver enhancements, and a range of other kernel additions to benefit their platforms.
Covered last week on Phoronix was a new patch from Intel that with tuning to the P-State CPU frequency scaling driver was showing big wins for Intel Core Ultra "Meteor Lake" performance and power efficiency. I was curious with the Intel claims posted for a couple benchmarks and thus over the weekend set out to run many Intel Meteor Lake benchmarks on this one-line kernel patch... The results are great for boosting the Linux performance of Intel Core ultra laptops with as much as 72% better performance.
Now that the Linux 6.10 merge window has wrapped up, here's a look at all of the exciting features/changes coming to this summer 2024 kernel. Linux 6.10 brings a lot as usual for the latest/upcoming Intel and AMD platforms, never-ending work on file-systems, a new memory sealing "mseal" system call, TPM bus encryption, and dozens of other exciting changes and new hardware support.
As noted at the end of March, System76's Pop!_OS Linux distribution has upgraded to the Linux 6.8 kernel as a stable release update for this Ubuntu-derived distribution. That Linux 6.6 to 6.8 leap on Pop!_OS yielded some nice kernel performance improvements for the AMD Ryzen Threadripper 7980X available with their newest System76 Thelio Major workstations. But tests I've carried out of the Pop!_OS upgrade on a years older System76 Thelio is showing off some nice advantages too.
As part of my Linux 6.9 benchmarking I've been trying out many hardware combinations and overall seeing nice performance out of this kernel that will debut as stable in the next 2~3 weeks. AMD EPYC 4th Gen performance is boosted, Intel Xeon Max sees some AI improvements, and as shown in some prior Intel Core Ultra performance benchmarks is enhanced as well. Here are some more benchmarks looking at the Intel Core Ultra 7 "Meteor Lake" performance on Linux 6.9 compared to the current Linux 6.8 stable kernel.
One of the leading-edge benefits of Fedora Linux is that it always ships with the most up-to-date open-source compiler toolchains at release. For their spring releases each year, it typically means shipping with a GCC compiler that isn't even officially released as stable yet. With this week's release of Fedora 40, it's shipping with GCC 14.0.1 as the development version that will culminate with the inaugural GCC 14 stable release in the coming weeks. Plus Fedora 40 has all of the other latest GNU toolchain components and then over on the LLVM side is with the current LLVM 18 stable series. For those curious how GCC 14 vs. LLVM Clang 18 performance is looking, here is a wide range of C/C++ benchmarks carried out on Fedora Workstation 40 using a System76 Thelio Major workstation powered by the Zen 4 AMD Ryzen Threadripper 7980X.
Last week being surprised to see a number of AMD EPYC performance gains with Linux 6.9 using that in-development kernel, I was curious about what other platforms may be benefiting from better performance on this kernel that will debut as stable in May. This week I turned to running some fresh benchmarks of Intel Xeon Max using the Supermicro Hyper SuperServer SYS-221H-TNR. More than 230 benchmarks were carried out of Linux 6.8 stable versus Linux 6.9-rc2 in looking for any performance differences.
Now that the Linux 6.9 merge window is past I've begun testing out this in-development kernel on more hardware platforms in the lab. While some performance boosts like Intel Core Ultra "Meteor Lake" running faster on Linux 6.9 was to be expected given EPP tuning in the new kernel specific to those SoCs, one of the unexpected delights has been seeing AMD 4th Gen EPYC performance with some nice performance gains over Linux 6.8 stable.
While Intel Core Ultra "Meteor Lake" has been working out well under Linux already -- especially with regards to the enticing integrated Arc Graphics -- with the in-development Linux 6.9 kernel it's looking even better for the CPU performance. Here are some initial benchmarks looking at the Intel Core Ultra 7 155H Meteor Lake performance with Linux 6.8 vs. 6.9 Git.
Some time ago I ran through a number of benchmarks of Google Cloud's C3D VMs powered by AMD EPYC Genoa processors. The AMD EPYC 9004 series showed terrific performance with strong generational improvements over the Intel Xeon Scalable processors. Following that a request came in to examine the PingCAP TiDB database performance given its growing popularity. In this article we'll review those benchmarks showing how GCE C3D delivers strong performance advantages for TiDB.
Now that Linux 6.9-rc1 was released on Sunday to mark the end of the merge window, here is a look at all of the new features that have been merged for the Linux 6.9 kernel cycle.
With my recent NVIDIA GH200 Grace CPU benchmarks carried out remotely via GPTshop.ai, besides looking at areas like the 64K kernel page size performance benefits I also ran some fresh benchmarks looking at the performance difference when the binaries were generated by LLVM Clang rather than the default GCC compiler on Ubuntu Linux. This article shows off the performance difference for the 72-core Neoverse-V2 server/HPC processor when leveraging LLVM Clang rather than the GNU Compiler Collection.
Earlier this week on Patch Tuesday was the disclosure by Intel of the Register File Data Sampling (RFDS) vulnerability and mitigation via updated CPU microcode and a kernel patch. RFDS is around malicious user-space software potentially being able to infer stale register values from kernel space. Register File Data Sampling affects recent Intel Atom / E-core bearing processors including the latest Raptor Lake Refresh processors. In this article are some initial benchmarks of the RFDS performance impact under Linux when using the Core i9 14900K processor.
Open-source developer Roman Gilg who is known for his work on KWinFT prior to its rebranding as Theseus' Ship has some more important news to share today by way of Phoronix. Here's his guest post announcing The Compositor Modules.
By default the AArch64 kernel on Ubuntu and other Linux distributions tend to default to a standard 4K page size but for newer AArch64 hardware especially in the server/HPC space, there can be great benefits to using a 64K page size. As it's been a while since I last ran any 64-bit ARM 4K vs. 64K kernel page size benchmarks, while having remote access to the NVIDIA GH200 I ran a fresh comparison for looking at the performance advantages to switching over to a 64K page size kernel. These new 64K kernel numbers are shown alongside the recent AMD EPYC and Intel Xeon CPU reference benchmark results for a look at how the 4K vs. 64K page size affects the overall computing landscape.
While the Linux 6.8 kernel merge window has been over for several weeks now, due to a busy February of new hardware releases and lots of Linux hardware reviews/benchmarking, I've been behind in writing up my Linux 6.8 feature recap. For those wanting a concise look at the many great changes coming with Linux 6.8 that will debut as stable in March, here's an overview of the interesting Linux 6.8 changes.
Last week I posted a number of fresh GCC vs. LLVM Clang compiler performance benchmarks using an AMD Ryzen Threadripper PRO 7995WX (Zen 4) system using the HP Z6 G5 A workstation running Ubuntu Linux. For those wondering about the performance of GCC vs. Clang generated binaries on something much more modest, here are some benchmarks when testing on a Meteor Lake laptop with the Core Ultra 7 155H.
As it's been a while since last delivering any competitive GCC versus LLVM Clang compiler competitive analysis and with the year quickly drawing a close, here's a fresh look at the GCC vs. Clang C/C++ compiler performance of various resulting application binaries tested on x86_64. GCC 13 vs. Clang 17 were tested as what's readily available on Ubuntu 23.10 Linux plus a look ahead in using the latest GCC 14 and LLVM Clang 18 development snapshots as of this week.
The AMD Ryzen Threadripper 7000 series offer great performance out-of-the-box for Linux desktop/workstation users as shown in my Ryzen Threadripper 7970X and 7980X benchmarks along with the Threadripper PRO 7995WX. While a more common tunable on the EPYC side, the Threadripper 7000 series can also benefit from Nodes Per Socket (NPS) / Sub-NUMA Clustering (SNC) tuning for enhancing the performance of some workloads. In this article is a look at dozens of benchmarks while looking at the performance impact of SNC2/SNC4 adjustments for the Zen 4 Threadripper.
921 software articles published on Phoronix.