1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking Benchmarking Platform
Phoromatic Test Orchestration

Intel Makes Microsoft's C++ AMP Cross-Platform

Compiler

Published on 16 November 2012 06:29 AM EST
Written by Michael Larabel in Compiler
10 Comments

Microsoft conceived C++ Accelerated Massive Parallelism (AMP) as a library atop DirectX 11 for offering data-parallelism directly in C++ that can make easy use of GPUs while having CPU fall-back support. With C++ AMP being similar to OpenCL, Intel engineers decided to implement the Microsoft specification within OpenCL and using LLVM/Clang so that it can be used cross-platform.

Microsoft considers C++ Accelerated Massive Parallelism to be one of their open specifications (it's under their "Community Promise" license), but with being implemented atop DirectX 11 and the compiler support being only built into Microsoft Visual Studio 2012, it isn't widely available outside of Microsoft's scope.

Engineers at Intel ended up developing "Shevlin Park", which is a prototype implementation of C++ AMP built using OpenCL with LLVM/Clang. The LLVM/Clang compiler stack was modified to handle C++ AMP programming constructions and the C++ AMP computations expressed within OpenCL compute kernels.

The C++ AMP run-time library was also implemented on an OpenCL run-time. Being implemented in this manner, C++ AMP can now be used within non-Microsoft/Windows environments. This Intel implementation with LLVM works on both the GPU and CPU.

The Shevlin Park project was talked about earlier this month at the LLVM Developers' Meeting in San Jose, California. The Intel slides covering Shevlin Park can be found here (PDF).

As far as why someone would want to try C++ AMP rather than just using OpenCL or other GPGPU models, Intel's Dillon Sharlet describes the Microsoft interface as an "elegant, minimal C++ extensions and template libraries for data parallel programming." C++ AMP has the host and device code in the same programming language while concealing any driver APIs. Meanwhile, the data programming parallel model is very close to that of OpenCL and is similar to that of the NVIDIA CUDA run-time API.

Benchmarks by Intel show that their Shevlin Park implementation of C++ AMP can actually outperform that of the C++ AMP support within Visual Studio 2012. In some cases, using raw OpenCL is faster than both Accelerated Massive Parallelism versions.

The Shevlin Park work is still considered experimental but the slides are definitely worth checking out for anyone interested in low-level code/compiler work.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Articles & Reviews
  1. Ubuntu 15.04 Offers Faster OpenGL For AMD Radeon GPUs On Open-Source
  2. Ubuntu 15.04 Brings Some Graphics Performance Improvements For Intel Haswell
  3. Sub-$20 802.11n USB WiFi Adapter That's Linux Friendly
  4. The Lenovo T450s Is Working Beautifully With Linux
  5. Linux 4.0 SSD EXT4 / Btrfs / XFS / F2FS Benchmarks
  6. Linux 4.0 Hard Drive Comparison With Six File-Systems
Latest Linux News
  1. The Difference In Optimizations Between NIR & GLSL
  2. OpenMandriva Lx 3 Alpha: Adds UEFI Support, Defaults To LXQt
  3. Systemd Kills Off Shutdownd
  4. There's Now More Than 1,100 Games On Steam For Linux
  5. Btrfs In Linux 4.1 Has Fixes For File-Systems Of 20 Terabytes & Up
  6. Microsoft's CoreCLR Now Works On FreeBSD
  7. Unigine 2.0 Beta 2 Brings PBR, SSR, Kinect 2 Support
  8. KDBUS Still Hasn't Been Pulled, Might Not Land For Linux 4.1
  9. The State Of The Lima/Tamil Driver Code
  10. The New Linux Performance Test Lab Is Already Being Expanded
Most Viewed News This Week
  1. AMD Releases New "AMDGPU" Linux Kernel Driver & Mesa Support
  2. EXT4 In Linux 4.1 Adds File-System Level Encryption
  3. My Favorite Computer Desk Of The Past Decade For Less Than $100
  4. AMD Open-Sources "Addrlib" From Catalyst
  5. Library Operating System (LibOS) For Linux Still Being Pursued
  6. Linux-Powered Endless Computer Raises $100k+ In A Few Days
  7. Features Thus Far For The Linux 4.1 Kernel
  8. GIMP's Porting To GTK3 Continues