1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking.org

NVIDIA's CUDA/OpenCL PTX Back-End In LLVM 3.2

Compiler

Published on 16 December 2012 11:00 AM EST
Written by Michael Larabel in Compiler
3 Comments

In preparing for the imminent release of LLVM 3.2, another worthwhile feature to go over is the NVPTX back-end that's been merged for this forthcoming open-source compiler infrastructure release. The NVPTX LLVM back-end is what's used by NVIDIA's closed-source driver for its CUDA and OpenCL compiler.

NVIDIA's "NVPTX" Parallel Thread Execution back-end is replacing the earlier PTX back-end that was previously living within LLVM. NVPTX is what NVIDIA opened up out of their NVCC CUDA and OpenCL compiler, so it's rather high quality and in very good shape. NVPTX is compatible with PTX 3.1 and SM 3.5, supports NVVM intrinsics of the NVIDIA Compiler SDK, is fully compatible with the old PTX back-end, and has much greater coverage of going from LLVM IR to PTX code.

NVIDIA published this new PTX back-end back in April. Parallel Thread Execution is an Assembly-like language that NVIDIA's graphics driver then translates into binary code for the respective hardware. With this NVPTX back-end now being open-sourced and part of LLVM, new possibilities are opened up. Though this work won't directly benefit the open-source Nouveau graphics driver project since it doesn't deal with PTX and the current Nouveau implementation takes LLVM IR and converts it into Gallium3D TGSI for use by their existing compiler.

On the other side of the table, the Radeon R600 back-end was recently merged into LLVM but that won't be appearing in an official release until next year with LLVM 3.3.

With LLVM 3.2 there is also improved CPU support for everything from Apple's A6 SoC in the iPhone 5 to better handling AVX2 with Intel Haswell CPUs, introduces an automatic loop vectorizer, better Polly optimizations, and much more.

Look for the LLVM 3.2 release to happen any time now... The original release plan was to release LLVM 3.2 today (16 December), but there's been no word if it's still happening or has been pushed back by a few days.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Articles & Reviews
  1. Linux Compiler Benchmarks Of LLVM Clang 3.5 vs. LLVM Clang 3.6-rc1
  2. Intel Broadwell HD Graphics 5500: Windows 8.1 vs. Linux
  3. Linux Benchmarks Of NVIDIA's Early 2015 GeForce Line-Up
  4. NVIDIA GeForce GTX 960: A Great $200 GPU For Linux Gamers
  5. Disk Encryption Tests On Fedora 21
  6. Xonotic 0.8 Performance With The Open-Source AMD/NVIDIA Gallium3D Drivers
Latest Linux News
  1. Ubuntu's Mir Gains Server-Side Platform Probing
  2. Broadwell Linux Ultrabook Running MUCH Cooler Than Haswell
  3. LZHAM 1.0 Lossless Data Compression Codec Released
  4. LibreOffice 4.4 Is Coming Soon With New Features
  5. Linux Users Upset By Chromium's Busted HiDPI Support
  6. BPF Backend Merged Into LLVM To Make Use Of New Kernel Functionality
  7. Dying Light Is Headed To Linux, SteamOS
  8. Wayland 1.6.1 & Weston 1.6.1 Released
  9. Mesa 10.4.3 Brings A Bunch Of Fixes For The Direct3D "Nine" Support
  10. Intel Has A Few More Graphics Changes For The Linux 3.20 Kernel
Most Viewed News This Week
  1. Windows 10 To Be A Free Upgrade: What Linux Users Need To Know
  2. CoreOS Moves From Btrfs To EXT4 + OverlayFS
  3. Google Admin Encourages Trying Btrfs, Not ZFS On Linux
  4. TraceFS: The Newest Linux File-System
  5. My Initial Intel Broadwell Linux Experience With The ThinkPad X1 Carbon
  6. Mozilla's Servo Still On Track For 2015 Alpha Release
  7. Fedora 23 Likely To Pursue Wayland By Default
  8. Keith Packard Leaves Intel's Linux Graphics Work