1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking.org

NVIDIA's CUDA/OpenCL PTX Back-End In LLVM 3.2

Compiler

Published on 16 December 2012 11:00 AM EST
Written by Michael Larabel in Compiler
3 Comments

In preparing for the imminent release of LLVM 3.2, another worthwhile feature to go over is the NVPTX back-end that's been merged for this forthcoming open-source compiler infrastructure release. The NVPTX LLVM back-end is what's used by NVIDIA's closed-source driver for its CUDA and OpenCL compiler.

NVIDIA's "NVPTX" Parallel Thread Execution back-end is replacing the earlier PTX back-end that was previously living within LLVM. NVPTX is what NVIDIA opened up out of their NVCC CUDA and OpenCL compiler, so it's rather high quality and in very good shape. NVPTX is compatible with PTX 3.1 and SM 3.5, supports NVVM intrinsics of the NVIDIA Compiler SDK, is fully compatible with the old PTX back-end, and has much greater coverage of going from LLVM IR to PTX code.

NVIDIA published this new PTX back-end back in April. Parallel Thread Execution is an Assembly-like language that NVIDIA's graphics driver then translates into binary code for the respective hardware. With this NVPTX back-end now being open-sourced and part of LLVM, new possibilities are opened up. Though this work won't directly benefit the open-source Nouveau graphics driver project since it doesn't deal with PTX and the current Nouveau implementation takes LLVM IR and converts it into Gallium3D TGSI for use by their existing compiler.

On the other side of the table, the Radeon R600 back-end was recently merged into LLVM but that won't be appearing in an official release until next year with LLVM 3.3.

With LLVM 3.2 there is also improved CPU support for everything from Apple's A6 SoC in the iPhone 5 to better handling AVX2 with Intel Haswell CPUs, introduces an automatic loop vectorizer, better Polly optimizations, and much more.

Look for the LLVM 3.2 release to happen any time now... The original release plan was to release LLVM 3.2 today (16 December), but there's been no word if it's still happening or has been pushed back by a few days.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Linux Hardware Reviews
  1. Intel Pentium G3258 On Linux
  2. SilverStone Precision PS10
  3. ASRock Z97 Extreme6
  4. Nouveau Re-Clocking Is Way Faster, Shows Much Progress For Open-Source NVIDIA
Latest Linux Articles
  1. KVM Benchmarks On Ubuntu 14.10
  2. X.Org Server 1.16 Officially Released With Terrific Features
  3. Ubuntu With Linux 3.16 Smashes OS X 10.9.4 On The MacBook Air
  4. Preview: Benchmarking CentOS 7.0 & Scientific Linux 7.0
Latest Linux News
  1. CPUFreq Ondemand Could Be Faster, Use Less Power With Linux 3.17
  2. Intel Adds BPTC Texture Compression To Their Mesa Driver
  3. The Linux Kernel Bang-Bang Thermal Governor Is Banging
  4. NVIDIA Releases K1-Powered Shield Tablet & Controller
  5. Xen Project Announces Mirage OS 2.0
  6. Canonical Community Team Changes Announced For Ubuntu
  7. Raspberry Pi B+ ARM Debian Benchmarks
  8. Mozilla Unleashes Firefox 31 Web Browser
  9. GCC 5.0 Is Expected Next Year
  10. PHP5's Successor Might Be PHP7
Latest Forum Discussions
  1. AMD "Hawaii" Open-Source GPU Acceleration Still Not Working Right
  2. Open-Source Radeon Performance Boosted By Linux 3.16
  3. In Road To Qt, Audacious Switches From GTK3 Back To GTK2
  4. Debian + Steam + r600
  5. Next-Gen OpenGL To Be Announced Next Month
  6. Ubuntu With Linux 3.16 Smashes OS X 10.9.4 On The MacBook Air
  7. Updated and Optimized Ubuntu Free Graphics Drivers
  8. AMD Publishes Open-Source Linux HSA Kernel Driver