Intel Makes Microsoft's C++ AMP Cross-Platform

Posted by Michael Larabel on November 16, 2012

Microsoft conceived C++ Accelerated Massive Parallelism (AMP) as a library atop DirectX 11 for offering data-parallelism directly in C++ that can make easy use of GPUs while having CPU fall-back support. With C++ AMP being similar to OpenCL, Intel engineers decided to implement the Microsoft specification within OpenCL and using LLVM/Clang so that it can be used cross-platform.

Microsoft considers C++ Accelerated Massive Parallelism to be one of their open specifications (it's under their "Community Promise" license), but with being implemented atop DirectX 11 and the compiler support being only built into Microsoft Visual Studio 2012, it isn't widely available outside of Microsoft's scope.

Engineers at Intel ended up developing "Shevlin Park", which is a prototype implementation of C++ AMP built using OpenCL with LLVM/Clang. The LLVM/Clang compiler stack was modified to handle C++ AMP programming constructions and the C++ AMP computations expressed within OpenCL compute kernels.

The C++ AMP run-time library was also implemented on an OpenCL run-time. Being implemented in this manner, C++ AMP can now be used within non-Microsoft/Windows environments. This Intel implementation with LLVM works on both the GPU and CPU.

The Shevlin Park project was talked about earlier this month at the LLVM Developers' Meeting in San Jose, California. The Intel slides covering Shevlin Park can be found here (PDF).

As far as why someone would want to try C++ AMP rather than just using OpenCL or other GPGPU models, Intel's Dillon Sharlet describes the Microsoft interface as an "elegant, minimal C++ extensions and template libraries for data parallel programming." C++ AMP has the host and device code in the same programming language while concealing any driver APIs. Meanwhile, the data programming parallel model is very close to that of OpenCL and is similar to that of the NVIDIA CUDA run-time API.

Benchmarks by Intel show that their Shevlin Park implementation of C++ AMP can actually outperform that of the C++ AMP support within Visual Studio 2012. In some cases, using raw OpenCL is faster than both Accelerated Massive Parallelism versions.

The Shevlin Park work is still considered experimental but the slides are definitely worth checking out for anyone interested in low-level code/compiler work.

Discuss this article in our forums, IRC channel, or email the author. You can also follow our content via RSS and on social networks like Facebook, Identi.ca, and Twitter (@Phoronix and @MichaelLarabel). Subscribe to Phoronix Premium to view our content without advertisements, view entire articles on a single page, and experience other benefits.
Latest Hardware Reviews
  1. Sumo Lounge Emperor
  2. Gallium3D Continues Improving OpenGL For Older Radeon GPUs
  3. 15-Way Open vs. Closed Source NVIDIA/AMD Linux GPU Comparison
  4. Nouveau vs. NVIDIA Linux Comparison Shows Shortcomings
Latest Software Articles
  1. The Cost Of Ubuntu Disk Encryption
  2. Btrfs vs. EXT4 vs. XFS vs. F2FS On Linux 3.10
  3. AMD Radeon R600 GPU LLVM 3.3 Back-End Testing
  4. F2FS File-System Shows Regressions On Linux 3.10
Latest Linux News
  1. Mageia 3 Released, Still Using Legacy GRUB
  2. NetBSD 6.1 Brings In More Features
  3. Using Six Monitors With AMD's Open-Source Linux Driver
  4. Benchmarking The Intel P-State, CPUfreq Changes
  5. FreeBSD Still Working On Next-Gen Package Manager
  6. DNF Still Advancing As Experimental Yum For Fedora
  7. Logitech Begins Supporting Linux Users
  8. Modern Intel Gallium3D Driver Still Being Toyed With
  9. Linux 3.10 Kernel Benchmarks On A Core i7 Laptop
  10. GCC 4.8.1 Compiler Due To Be Out Next Week
  11. Linux 3.10 Kernel Benchmarks For Intel Ivy Bridge
Latest Forum Talk
  1. The Cost Of Ubuntu Disk Encryption
  2. DRM Moves Ahead With HTML5 Specification
  3. Mageia 3 Released, Still Using Legacy GRUB
  4. Logitech Begins Supporting Linux Users
  5. Logitech supports linux!
  6. Btrfs vs. EXT4 vs. XFS vs. F2FS On Linux 3.10
  1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Motherboards
  5. Peripherals
  6. Processors
  7. Software
  8. Operating Systems
  9. All Articles
  1. Linux Benchmarking
  2. OpenBenchmarking.org
  3. Phoronix Test Suite