1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking Benchmarking Platform
Phoromatic Test Orchestration

Intel Makes Microsoft's C++ AMP Cross-Platform

Compiler

Published on 16 November 2012 06:29 AM EST
Written by Michael Larabel in Compiler
10 Comments

Microsoft conceived C++ Accelerated Massive Parallelism (AMP) as a library atop DirectX 11 for offering data-parallelism directly in C++ that can make easy use of GPUs while having CPU fall-back support. With C++ AMP being similar to OpenCL, Intel engineers decided to implement the Microsoft specification within OpenCL and using LLVM/Clang so that it can be used cross-platform.

Microsoft considers C++ Accelerated Massive Parallelism to be one of their open specifications (it's under their "Community Promise" license), but with being implemented atop DirectX 11 and the compiler support being only built into Microsoft Visual Studio 2012, it isn't widely available outside of Microsoft's scope.

Engineers at Intel ended up developing "Shevlin Park", which is a prototype implementation of C++ AMP built using OpenCL with LLVM/Clang. The LLVM/Clang compiler stack was modified to handle C++ AMP programming constructions and the C++ AMP computations expressed within OpenCL compute kernels.

The C++ AMP run-time library was also implemented on an OpenCL run-time. Being implemented in this manner, C++ AMP can now be used within non-Microsoft/Windows environments. This Intel implementation with LLVM works on both the GPU and CPU.

The Shevlin Park project was talked about earlier this month at the LLVM Developers' Meeting in San Jose, California. The Intel slides covering Shevlin Park can be found here (PDF).

As far as why someone would want to try C++ AMP rather than just using OpenCL or other GPGPU models, Intel's Dillon Sharlet describes the Microsoft interface as an "elegant, minimal C++ extensions and template libraries for data parallel programming." C++ AMP has the host and device code in the same programming language while concealing any driver APIs. Meanwhile, the data programming parallel model is very close to that of OpenCL and is similar to that of the NVIDIA CUDA run-time API.

Benchmarks by Intel show that their Shevlin Park implementation of C++ AMP can actually outperform that of the C++ AMP support within Visual Studio 2012. In some cases, using raw OpenCL is faster than both Accelerated Massive Parallelism versions.

The Shevlin Park work is still considered experimental but the slides are definitely worth checking out for anyone interested in low-level code/compiler work.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Linux News
  1. Linux 4.1-rc5 Kernel Released
  2. Mesa 10.5.6 Brings Fixes All Over The Place
  3. NVIDIA's Proprietary Driver Is Moving Closer With Kernel Mode-Setting
  4. The Latest Linux Kernel Git Code Fixes The EXT4 RAID0 Corruption Problem
  5. Features Added To Mesa 10.6 For Open-Source GPU Drivers
  6. Ubuntu's LXD vs. KVM For The Linux Cloud
  7. Fedora Server 22 Benchmarks With XFS & The Linux 4.0 Kernel
  8. GCC 6 Gets Support For The IBM z13 Mainframe Server
  9. Fedora 22 Is Being Released Next Tuesday
  10. OpenWRT 15.05 Preparing Improved Security & Better Networking
Latest Articles & Reviews
  1. The Latest Features For Linux Performance Management + Benchmark Monitoring
  2. Noctua NH-U12DX i4 + NF-F12
  3. Btrfs RAID 0/1 Benchmarks On The Linux 4.1 Kernel
  4. The State Of Various Firefox Features
Most Viewed News This Week
  1. The Linux 4.0 Kernel Currently Has An EXT4 Corruption Issue
  2. The Linux 4.0 EXT4 RAID Corruption Bug Has Been Uncovered
  3. AMDGPU Open-Source Driver Code Continues Maturing
  4. Microsoft Open-Sources The Windows Communication Foundation
  5. Systemd 220 Has Finally Been Released
  6. Another HTTPS Vulnerability Rattles The Internet
  7. LibreOffice 5.0 Open-Source Office Suite Has Been Branched
  8. LibreOffice 5.0 Beta 1 Released