AMD Releases AOMP 16.0-2 For Its Radeon OpenMP Offloading Compiler
AOMP 16.0-2 is the newest snapshot of their compiler with Radeon / Instinct GPU offloading patches atop what is currently tracking the LLVM 16 upstream code-base.
AOMP 16.0-2 is re-based against an LLVM 16 Git state from mid-October, drops support for Ubuntu 18.04, brings various fixes, improves xteam reductions, adds support for the target team loops directive, forces synchronous execution of regions controlled via OMPX_FORCE_SYNC_REGIONS, and adds a new environment variable GPU_MAX_HW_QUEUES. The new GPU_MAX_HW_QUEUES environment variable can be used for controlling the number of HSA queues created while the default is four queues.
Source downloads as well as binaries for RHEL/CentOS, Ubuntu, and SUSE Linux Enterprise are available from GitHub. With time most of these relevant AMD Radeon/Instinct compiler patches should work their way into upstream LLVM. Unlike the AOCC compiler on the CPU side, at least AOMP has the patches publicly available for those interested in the changes over upstream LLVM.