1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking.org

LLVM May Expand Its Use Of The Loop Vectorizer

Compiler

Published on 06 June 2013 01:05 AM EDT
Written by Michael Larabel in Compiler
8 Comments

LLVM's Loop Vectorizer, which is able to automatically vectorize code loops for performance benefits in many scenarios, may find its use expanded for other optimization levels in future LLVM releases.

LLVM's automatic loop vectorizer was merged for the LLVM 3.2 release and benchmarking the loop vectorizer showed it to provide performance benefits for many scenarios. In the LLVM 3.2 release it wasn't enabled by default, but for LLVM 3.3 it's now enabled when using the -O3 optimization level.

Besides enabling it default for this greatest optimization level, LLVM 3.3 also provided improvements to the loop vectorizer. The LLVM loop vectorizer is now in good standing and so it might be enabled too by default for -O2 and -Os.

There's still some differing views on why the vectorizer should be turned on for -O2, which is the mid optimization level before -O3, but for at least -Os it should be turned on. The -Os level is when optimizing generated binaries for size. The loop vectorizer has the potential of increasing the binary size for some loops, but LLVM is able to weight that information and decide when to vectorize or not.

Apple LLVM developers have now been discussing on the LLVM mailing list about expanding the loop vectorizer's usage by default. The performance wins provided by this vectorizer seem to be worth it to many people even at the potential cost of a slightly longer compile time or the chance that the resulting binary size is slightly larger. This is a change that won't come for LLVM 3.3 but will be for LLVM 3.4 or later; we'll see what happens and post the decision on Phoronix.

LLVM 3.3 also has the interesting SLP vectorizer to optimize straight-line code, but this current discussion is only talking about the loop vectorizer.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Linux Hardware Reviews
  1. Btrfs On 4 x Intel SSDs In RAID 0/1/5/6/10
  2. AMD Radeon R9 290 On Ubuntu 14.10: RadeonSI Gallium3D vs. Catalyst
  3. MSI X99S SLI PLUS On Linux
  4. NVIDIA GeForce GTX 970 Offers Great Linux Performance
Latest Linux Articles
  1. NVIDIA's Linux Driver Can Deliver Better OpenGL Performance Than Windows 8.1
  2. Windows 8.1 vs. Ubuntu 14.10 With Intel HD Graphics
  3. 6-Way Ubuntu 14.10 Radeon Gallium3D vs. Catalyst Driver Comparison
  4. NVIDIA vs. Nouveau Drivers On Ubuntu 14.10
Latest Linux News
  1. Wine 1.7.30 Continues Work On DirectWrite & Offers Regedit Fixes
  2. Has The Sky Fallen? Qualcomm Contributes To Freedreno's DRM/KMS Driver
  3. Manjaro Works To Make Calamares A Distribution-Independent Installer
  4. DisplayLink USB 3.0 Support Sounds Like A Mess
  5. PulseAudio Gains A Native Bluetooth Headset Backend
  6. X.Org Foundation Decides On Its Women Outreach Project
  7. GTK+ 3.16's New GtkGLArea Widget Gets Improved
  8. X.Org Server 1.17 ABI Bumped
  9. Fedora 21 Beta To Be Released Next Week
  10. Go 1.4 Beta Release Brings Big Runtime Changes
Latest Forum Discussions
  1. What Would You Like To See Next?
  2. Closed source to opensource
  3. How to get rid of Linux
  4. Is foolish currently develop in machine code, hexadecimal and assembly?
  5. Reducing The CPU Usage In Mesa To Improve Performance
  6. Help diagnosing problems with a Readon HD 4670 on Mesa 10.3.2-1
  7. Advertisements On Phoronix
  8. nv and xorg.conf under Debian PPC