1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking Benchmarking Platform
Phoromatic Test Orchestration

Link-Time Optimizations With GCC 4.8

Compiler

Published on 09 February 2013 08:25 PM EST
Written by Michael Larabel in Compiler
14 Comments

GCC 4.8 will feature a few improvements when it comes to LTO, a.k.a. Link-Time Optimization, but will this reflect in any greater performance for the resulting binaries?

Officially, the improvements to Link-Time Optimization in the forthcoming release of GCC 4.8.0 is rewriting the LTO partitioning for "better reliability and maintainability." The GCC change-log notes that several important bugs leading to link failures have been fixed. Being in a mood to do some new compiler benchmarks this weekend (and needing to test some new compiler-related features for the forthcoming release of Phoronix Test Suite 4.4-Forsand), I did a quick few on GCC 4.7 and 4.8 while testing LTO. My last article on the subject was the GCC 4.7 Link-Time Optimization Performance from August. If you're not familiar with LTO, read the aforelinked article for more details on what LTO means for GCC and how to exploit the potentially performance-enhancing feature.

Embedded in this posting are just a couple of the benchmark results where there was something to see while the system software/hardware details, system logs, and other test profile results can be found on OpenBenchmarking.org within 1302087-FO-GCC48LINK37. For GCC 4.8.0, the late January development snapshot was used.

BYTE's Dhrystone 2 test is one of the few cases where LTO has a marked performance difference. GCC 4.8 was slightly faster than GCC 4.7 on the Intel Core i7 "Ivy Bridge" system, but when enabling LTO it was of less advantage than under GCC 4.7.2.

Regardless of the impact on the performance of the resulting binary, the compilation time takes longer when enabling LTO due to the optimizations being done, well, at link-time.

LTO did cause the performance to regress for the widely-used Bullet Physics Engine.

In another result file (1302092-FO-GCC48LTO575) are a couple of more LTO benchmarks of the Core i7 system on the GCC 4.8.0 snapshot. With these extra benchmarks, the "-fwhole-program" compile-time switch was also tested beyond just "-flto" for enabling LTO. The "-fwhole-program" switch is to let the compiler "assume that the current compilation unit represents the whole program being compiled."

Find more compiler benchmarks at OpenBenchmarking.org.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Linux News
  1. Intel Has More Graphics Code For Testing, Plans For Linux 4.3
  2. GTK+ File Chooser Receiving Many Improvements
  3. Mesa 10.5.9 Is The Last Of The Series
  4. Trying To Run The Intel Core i7 5775C On Linux
  5. VirtualBox 5.0 RC3 Brings VMM Fixes, Takes Care Of Some KDE DnD Problems
  6. Ubuntu Is Finally Fixing Its Annoying GRUB Setting
  7. Firefox 39.0 Brings New Features, HTML5 Changes
  8. OPNsense 15.7 Released As Fork Of Pfsense
  9. The Less-Powerful Intel Compute Stick With Ubuntu Will Soon Ship
  10. Kodi 15.0 Release Candidate 1 Arrives
Latest Articles & Reviews
  1. 6-Way File-System Comparison On The Linux 4.1 Kernel
  2. How KDE VDG Is Trying To Make Open-Source Software Beautiful
  3. Attempting To Try Out BCache On The Linux 4.1 Kernel
  4. CompuLab's Fitlet Is A Very Tiny, Fanless, Linux PC With AMD A10 Micro
Most Viewed News This Week
  1. Pinos Is For Linux Video What PulseAudio Is For Audio
  2. The State & Complications Of Porting The Unity Editor To Linux
  3. The Staging Pull For Linux 4.2: "Big, Really Big"
  4. Latest Rumor Pegs Microsoft Wanting To Buy AMD
  5. "PulseVideo" Coming To Complement PulseAudio?
  6. Exciting Features Merged So Far For The Linux 4.2 Kernel
  7. RadeonSI Gallium3D Gets New OpenGL 4 Bits
  8. Linux 4.2 Advertises GFS2 Performance Improvements