FX-8350 Piledriver Tuning On AMD's Open64 Compiler

Published on October 24, 2012
Written by Michael Larabel
Page 1 of 4
Discuss This Article

With this week's unveiling of the FX-8350 eight-core processor being based on AMD's new Piledriver architecture, in this article are benchmarks when testing out the Piledriver "bdver2" optimizations within AMD's own Open64 compiler.

Back in August was when AMD released the AMD Open64 4.5.2 compiler, which introduced support for Family 15h Piledriver cores. With the Piledriver support came work within AMD's Open64 compiler fork for handling AVX, XOP, FMA3, FMA4, BMI, TBM, and F16C instruction sets.

From the features list for the Open64 4.5.2 release, "You can enable this new instructions in Piledriver core using the -march=bdver2. Alternately you can pick and choose the ISA to be enabled using -mfma (for FMA3), -mfma4, -mbmi, -mtbm flags." The "bdver2" target is the same as is used by GCC and LLVM/Clang for supporting the second-generation Bulldozer -- a.k.a. Piledriver -- AMD processors.

Earlier this month I did do some GCC compiler tuning tests for the Piledriver-based AMD A10-5800K Trinity APU. With the GCC testing of comparing the bdver2 micro-architecture support against bdver1 and other earlier AMD CPU targets, there wasn't much improvement out of using "-march=bdver2" when compiling the test application / benchmark binaries.

In the GCC Piledriver tuning tests from last week with GCC 4.7.2 I went over what the bdver2 target adds: BMI, TBM, F16C, and FMA3. FMA3 is a three operand variant (that's being pushed by Intel with Haswell) of Fused Multiply-Add rather than the four operand version, F16C allows for converting and storing 32-bit floating point values using 16-bits, TBM is Trailing Bit Manipulation, and BMI is Bit Manipulation Instructions.

From the AMD FX-8350 Eight-Core "Vishera" setup running Ubuntu 12.10 with the Linux 3.5 kernel, AMD's official 64-bit Open64 4.5.2 compiler was tested. Each time this compiler built the Phoronix Test Suite collection of tests while passing each time a different "-march=" value of k8, barcelona, bdver1, and then bdver2.

<< Previous Page
1
Latest Hardware Reviews
  1. Sumo Lounge Emperor
  2. Gallium3D Continues Improving OpenGL For Older Radeon GPUs
  3. 15-Way Open vs. Closed Source NVIDIA/AMD Linux GPU Comparison
  4. Nouveau vs. NVIDIA Linux Comparison Shows Shortcomings
Latest Software Articles
  1. GCC 4.8.0 vs. LLVM Clang 3.3 Compiler Performance
  2. Intel Linux OpenGL Driver Leading Over Apple OS X
  3. The Cost Of Ubuntu Disk Encryption
  4. Btrfs vs. EXT4 vs. XFS vs. F2FS On Linux 3.10
Latest Linux News
  1. A New X.Org-Free Wayland LiveCD Released
  2. Unity 8, Mir Made Progress This Week On Features
  3. LLVM Clang 3.3 RC2 Is Ready For Testing
  4. AMD RadeonSI Gallium3D Begins Simple CL Demos
  5. Intel Shows Off GNOME3-Based Tizen Shell
  6. Linux Desktop Security Could Be A Whole Lot Better
  7. KDE 4.11 Will Be The Last Major KDE4 Workspaces Feature Release
  8. New NVIDIA Linux Driver Supports The GeForce GTX 780
  9. Chrome 28 To Offer More Speed Improvements
  10. Digia Announces "Boot To Qt" Project
  11. X.Org Libraries Hit By Round Of Security Issues
Latest Forum Talk
  1. GCC 4.8.0 vs. LLVM Clang 3.3 Compiler Performance
  2. A New X.Org-Free Wayland LiveCD Released
  3. Linux's "Ondemand" Governor Is No...
  4. Microsoft Releases Skype For Linux 4.2, Has...
  5. Unity 8, Mir Made Progress This Week On Features
  6. Linux Desktop Security Could Be A Whole Lot Better
  1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Motherboards
  5. Peripherals
  6. Processors
  7. Software
  8. Operating Systems
  9. All Articles
  1. Linux Benchmarking
  2. OpenBenchmarking.org
  3. Phoronix Test Suite