OpenBLAS 0.3.27 Adds C-SKY Arch, Improved GEMM For AMD Zen & Sapphire Rapids Fixes

Written by Michael Larabel in Programming on 5 April 2024 at 10:17 AM EDT. 1 Comment
OpenBLAS 0.3.27 is out as the newest version of this prominent open-source Basic Linear Algebra Subprograms (BLAS) library.

OpenBLAS 0.3.27 brings initial support for the C-SKY architecture, caps the maximum number of threads for GEMM / GETRF / POTRF to avoid under-utilized/idle threads, better multi-threaded POTRF performance for all platforms, various other multi-threaded enhancements, faster OpenMP thread management, and a lot of other common enhancements to this great BLAS library.

Intel Xeon and AMD EPYC CPUs
OpenBLAS 0.3.27 has changes both for AMD and Intel this release.

OpenBLAS 0.3.27 also has a number of x86_64 fixes, including fixing LLVM compiler options for Intel Sapphire Rapids and improving fallbacks for Sapphire Rapids. On the AMD side, there is improved GEMM performance for AMD Zen targets.

Besides the x86_64 and C-SKY work, the new OpenBLAS release has further ARM tuning including initial support for the Cortex-A76 processor cores and Neoverse-V2 support within the DYNAMIC_ARCH builds. IBM POWER also has DGEMM and SGEMM performance optimizations, X280 CPU support in the RISC-V space, various LoongArch 64-bit optimizations, a few MIPS fixes, and more. This is a rather big release for the OpenBLAS library.

Downloads and the full list of OpenBLAS 0.3.27 changes via GitHub.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via

Popular News This Week