CUDA 9.2 Released With GEMM Improvements
Written by Michael Larabel in NVIDIA on 17 May 2018 at 04:42 PM EDT. 2 Comments
We knew it was coming while today NVIDIA has rolled out the CUDA 9.2 stable release update.

The CUDA 9.2 release includes speed-ups for launching CUDA kernels as well as faster performance for GEMM computational performance for half-precision and small N matrices. CUDA 9.2 also fixes a number of issues, including incorrect results with some GEMM calls on V100 Tensor Core GPUs and other BLAS problems.

The CUDA 9.2 release does depend upon the latest NVIDIA 396 series driver. As of writing, NVIDIA has yet to put out a blog post outlining any other changes beyond what is found in the release notes.

The CUDA 9.2 tool-kit is available for download from Unfortunately this release doesn't yet support the new Ubuntu 18.04 LTS.
Related News
About The Author
Author picture

Michael Larabel is the principal author of and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and automated benchmarking software. He can be followed via Twitter or contacted via

Popular News This Week