AutoFDO Optimizations For The GCC Compiler

Posted by Michael Larabel on September 30, 2012

A patch has emerged that provides "AutoFDO" support for the GCC compiler for automated feedback-directed optimizations.

Dehao Chen of Google published an AutoFDO implementation to the GCC mailing list on Friday.
This patch implements the fine-graind AutoFDO optimizations for GCC. It uses linux perf to collect sample profiles, and uses debug info to represent the profile. In GCC, it uses the profile to annotate CFG to drive FDO. This can bring 50% to 110% of the speedup derived by traditional instrumentation based FDO. (Average is between 70% to 80% for many CPU intensive applications). Comparing with traditional FDO, AutoFDO does not require instrumentation. It just need to have an optimized binary with debug info to collect the profile.

This patch has passed bootstrap and gcc regression tests as well as tested with crosstool. Okay for google branches?

If people in up-stream find this feature interesting, I'll spend some time to port this to trunk and try to opensource the tool to generate profile data file.
So this implementation is also faster than the traditional Feedback Directed Optimization (FDO) support currently found in GCC.

Up until now traditional FDO has required running the specially-created binary manually to collect sample/training data that can then be passed back to the GNU Compiler Collection for producing a more-optimized binary on an application-specific case. Among the optimizations being dealt with here are instruction scheduling, basic block re-ordering, function splitting, and register allocation.

Let's hope it makes it into GCC trunk! Until then, the patch is on gcc-patches.

Discuss this article in our forums, IRC channel, or email the author. You can also follow our content via RSS and on social networks like Facebook, Identi.ca, and Twitter (@Phoronix and @MichaelLarabel). Subscribe to Phoronix Premium to view our content without advertisements, view entire articles on a single page, and experience other benefits.
Latest Hardware Reviews
  1. Sumo Lounge Emperor
  2. Gallium3D Continues Improving OpenGL For Older Radeon GPUs
  3. 15-Way Open vs. Closed Source NVIDIA/AMD Linux GPU Comparison
  4. Nouveau vs. NVIDIA Linux Comparison Shows Shortcomings
Latest Software Articles
  1. Intel Linux OpenGL Driver Leading Over Apple OS X
  2. The Cost Of Ubuntu Disk Encryption
  3. Btrfs vs. EXT4 vs. XFS vs. F2FS On Linux 3.10
  4. AMD Radeon R600 GPU LLVM 3.3 Back-End Testing
Latest Linux News
  1. Linux Desktop Security Could Be A Whole Lot Better
  2. KDE 4.11 Will Be The Last Major KDE4 Workspaces Feature Release
  3. New NVIDIA Linux Driver Supports The GeForce GTX 780
  4. Chrome 28 To Offer More Speed Improvements
  5. Digia Announces "Boot To Qt" Project
  6. X.Org Libraries Hit By Round Of Security Issues
  7. Wayland's Weston Gets Output Scaling Support
  8. Raspberry Pi Gets New Wayland Weston Renderer
  9. Debian GNU/Hurd 2013 Release Brings New Packages
  10. Intel Ultrabook Performance Is Faster With Mesa 9.2
  11. Hot Relocation HDD To SSD Support For Btrfs
Latest Forum Talk
  1. Chrome 28 To Offer More Speed Improvements
  2. Debian GNU/Hurd 2013 Release Brings New Packages
  3. KDE 4.11 Will Be The Last Major KDE4 Workspaces...
  4. Linux Desktop Security Could Be A Whole Lot Better
  5. how to use old laptops with via gfx
  6. Raspberry Pi Gets New Wayland Weston Renderer
  1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Motherboards
  5. Peripherals
  6. Processors
  7. Software
  8. Operating Systems
  9. All Articles
  1. Linux Benchmarking
  2. OpenBenchmarking.org
  3. Phoronix Test Suite