PathScale's ENZO Compiler To Speed Code On GPU

Posted by Michael Larabel on February 11, 2013

PathScale, the company that's focused on providing high-performance compiler solutions, is hoping to speed up traditional software packages by automatically leveraging the graphics processor when compiling software with the PathScale ENZO compiler.

After writing this weekend about the PathScale EKOPath 5.0 Beta, Christopher Bergström, the CTO of PathScale, commented in our forums about some of their current work.

First of all, one of the interesting features that Bergström notes about EKOPath 5.0 is that it's leveraging some of Clang. Clang is the C/C++ front-end compiler to LLVM and while EKOPath isn't based upon LLVM, they have adapted some of the BSD-licensed Clang code to their compiler. "EKOPath 5 is really a *BIG* difference behind the scenes. For example if you do pathcc -show hello.c # You'll notice that we're using a modified clang as part of the process. This is almost certainly what allowed us build those additional benchmarks. (To clarify a bit - we're not using the llvm backend or any llvm ir. In the past we were using a modified gnu cc1, but that and all other gnu code has been removed.)"

It was also revealed that the succeeding EKOPath compiler (likely EKOPath 5.5) will feature a new compiler back-end. "EKOPath 5.5 will have a new backend we've been working on, but pushing out both of those big changes at the same time just wasn't possible."

Bergström then commented on their ENZO compiler product that is a GPGPU and multi-core solution that supports HMPP, Fortran, and C/C++ languages. ENZO features true GPGPU network zero copy, the PSCNV open-source compute Tesla driver (their Nouveau driver fork), PathAS assembler support for GPGPU, is compatible with CUDA code, and they have their own PathScale C++ template/class libraries for GPGPU. This "sister compiler to EKOPath", he explains, can now accelerate more code on the GPU. "While it's not possible speed-up every code on the GPU - We have put a huge amount of work in the programming models available for ENZO and it's backend performance. Personally, I don't get as excited (or worried) about 5-10% CPU performance when we can offer 30% gains to 10x with the GPU. I can't make promises, but we may try to drop a few OpenACC pragma around those benchmarks and post numbers on a Tesla 2050."

He followed up to a Phoronix reader comment about the GPU usage with, "I'd certainly recommend you test EKOPath and Intel compilers if you don't have a GPU. If you can get access to a system with a GPU (Tesla 2050, 2070 or 2090) *and* you're willing to add some pragma or directives to your code ENZO may be interesting. (The performance gains can be well worth the effort) We're working on support for -autogpu which like autovectorization or other automatic optimizations requires zero code changes. This isn't ready for production and just "noteworthy" at this point. (Honestly, give us a couple more months)."

It looks like interesting times are ahead for compilers exploiting the massively parallel capabilities of graphics processors. On the open-source side there's been work for LLVM automatic GPGPU code generation although nothing too exciting for end-users at the moment. Unfortunately, OpenACC isn't too widely implemented by open-source compilers at the moment. OpenACC is an industry, open standard to simplify parallel programming on CPUs and GPUs.

Discuss this article in our forums, IRC channel, or email the author. You can also follow our content via RSS and on social networks like Facebook, Identi.ca, and Twitter (@Phoronix and @MichaelLarabel). Subscribe to Phoronix Premium to view our content without advertisements, view entire articles on a single page, and experience other benefits.
Latest Hardware Reviews
  1. Sumo Lounge Emperor
  2. Gallium3D Continues Improving OpenGL For Older Radeon GPUs
  3. 15-Way Open vs. Closed Source NVIDIA/AMD Linux GPU Comparison
  4. Nouveau vs. NVIDIA Linux Comparison Shows Shortcomings
Latest Software Articles
  1. GCC 4.8.0 vs. LLVM Clang 3.3 Compiler Performance
  2. Intel Linux OpenGL Driver Leading Over Apple OS X
  3. The Cost Of Ubuntu Disk Encryption
  4. Btrfs vs. EXT4 vs. XFS vs. F2FS On Linux 3.10
Latest Linux News
  1. A New X.Org-Free Wayland LiveCD Released
  2. Unity 8, Mir Made Progress This Week On Features
  3. LLVM Clang 3.3 RC2 Is Ready For Testing
  4. AMD RadeonSI Gallium3D Begins Simple CL Demos
  5. Intel Shows Off GNOME3-Based Tizen Shell
  6. Linux Desktop Security Could Be A Whole Lot Better
  7. KDE 4.11 Will Be The Last Major KDE4 Workspaces Feature Release
  8. New NVIDIA Linux Driver Supports The GeForce GTX 780
  9. Chrome 28 To Offer More Speed Improvements
  10. Digia Announces "Boot To Qt" Project
  11. X.Org Libraries Hit By Round Of Security Issues
Latest Forum Talk
  1. A New X.Org-Free Wayland LiveCD Released
  2. Intel Shows Off GNOME3-Based Tizen Shell
  3. Is there anyway to improve the performance of the...
  4. KDE 4.11 Will Be The Last Major KDE4 Workspaces...
  5. Steam: No used games...
  6. New Intel X.Org Driver Supports All Of Haswell
  1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Motherboards
  5. Peripherals
  6. Processors
  7. Software
  8. Operating Systems
  9. All Articles
  1. Linux Benchmarking
  2. OpenBenchmarking.org
  3. Phoronix Test Suite