1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking.org

Benchmarking The SLP Vectorizer On LLVM Clang 3.4

Compiler

Published on 29 July 2013 03:24 PM EDT
Written by Michael Larabel in Compiler
Comment On This Article

Following word this weekend that Apple and Google engineers agree on SLP vectorization by default for the LLVM/Clang compiler, I carried out some fresh SLP Vectorizer benchmarks this weekend from the LLVM Clang 3.4 SVN development code.

The SLP Vectorizer can vectorize memory access, arithmetic operations, comparison operations, and other select operations. Back when it was ready in LLVM Clang 3.3 I did some early benchmarks and explained it in more detail. There's also the LLVM auto-vectorizer documentation.

With LLVM Clang 3.4 SVN it looks like the superword-level parallelism vectorizer will at least be enabled for the -O3 optimization level if not for other optimization levels too. With this upcoming change, from the LLVM/Clang Subversion code as of this weekend I ran some benchmarks when comparing the -fslp-vectorize compiler switch for a range of C/C++ benchmarks. The -O3 -march=native compiler switches were set the entire time.

These test results can be found on OpenBenchmarking.org in 1307291-SO-FSLPVECTO83.


For most of our real-world workload tests on Linux with LLVM/Clang 3.4 SVN, there was little change in performance out of the basic SLP Vectorizer. However, as the benchmarks showed this past weekend, for certain operations and micro-benchmarks there are worthwhile improvements to find with this straight-code vectorizer. There's at least no regressions even though this isn't quite as useful as the Loop Vectorizer.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Linux Hardware Reviews
  1. CompuLab Intense-PC2: An Excellent, Fanless, Mini PC Powered By Intel's i7 Haswell
  2. From The Atom 330 To Haswell ULT: Intel Linux Performance Benchmarks
  3. AMD Radeon R9 285 Tonga Performance On Linux
  4. Apotop Wi-Copy
Latest Linux Articles
  1. AMD Moves Forward With Unified Linux Driver Strategy, New Kernel Driver
  2. MSI: Update Your BIOS From The Linux Desktop
  3. NVIDIA vs. AMD 2D Linux Drivers: Catalyst Is Getting Quite Good At 2D
  4. 15-Way GPU Comparison With Mesa 10.3 + Linux 3.17
Latest Linux News
  1. Phoronix Test Suite 5.4 M3 Is Another Hearty Update
  2. GParted 0.20 Improves Btrfs Support
  3. EXT4 In Linux 3.18 Has Clean-ups, Bug Fixes
  4. Emacs 24.4 Has Built-In Web Browser, Improved Multi-Monitor Support
  5. NVIDIA's NVPTX Support For GCC Is Close To Being Merged
  6. KDE's KWin On Wayland Begins Using Libinput
  7. Khronos Releases OpenVX 1.0 Specification
  8. Linux Kernel Working Towards GNU11/C11 Compatibility
  9. Ubuntu 15.04 Is Codenamed After A Monkey: Vivid Vervet
  10. Following GCC, Clang Looks To Default To C11
Latest Forum Discussions
  1. HOPE: The Ease Of Python With The Speed Of C++
  2. Updated and Optimized Ubuntu Free Graphics Drivers
  3. Users/Developers Threatening Fork Of Debian GNU/Linux
  4. Bye bye BSD, Hello Linux: A Sys Admin's Story
  5. NVIDIA Presents Its Driver Plans To Support Mir/Wayland & KMS On Linux
  6. AMD Is Restructuring Again, Losing 7% Of Employees
  7. Open-Source AMD Fusion E-350 Support Takes A Dive
  8. Upgrade to Kaveri, very slow VDPAU performance