1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking.org

Multi-Core Scaling Performance Of AMD's Bulldozer

Michael Larabel

Published on 26 October 2011
Written by Michael Larabel
Page 3 of 7 - 20 Comments

Starting off is the C-Ray ray-tracing benchmark, which is a Phoronix favorite as it always manages to do an amazing job at stressing all available CPU cores and is compatible with numerous platforms. The C-Ray results show the FX-8150 Bulldozer not scaling quite as well as the Intel Core i7 990X, Intel Core i5 2500K, and dual AMD Opteron 2384 systems.

When moving from one core to two cores enabled, the C-Ray result on the i7 990X and Opteron 2384s were exactly twice as fast, the Core i5 2500K was 1.94x, and the FX-8150 was only 1.63x faster. However, the Bulldozer was at least ahead of the mobile Core i7 2630QM Sandy Bridge (a 2GHz quad-core with Hyper Threading), which was scaling very poorly. When at four cores, the FX-8150 was 3.26x faster than the single-core configuration, while the Core i5 2500K was hitting 3.68x and the Intel Core i7 990X Gulftown hit 4.01x.

When running at eight threads, the FX-8150 was 5.98x faster than the single-thread result. The Core i7 990X meanwhile was at just 4.38x, the Core i7 2630QM at 4.21x, and the dual AMD Opteron 2384 quad-core configuration was at 7.96x. The Bulldozer micro-architecture strategy is not as effective as having eight Shanghai cores, but it at least did better than the Intel Sandy Bridge / Gulftown CPUs that were partially utilizing Hyper Threading. Hyper Threading does very poorly with C-Ray and other select workloads, as is shown by the awkward performance of Gulftown when hitting eight and twelve threads.

With Smallpt, a light-weight and furiously fast multi-threaded path tracing test, the AMD FX-8150 multi-core scaling is right in line with the Sandy Bridge, Gulftown, and Shanghai processors. The only processor struggling to compete was the mobile Sandy Bridge (Core i7 2630QM). Hyper Threading tends to tarnish the 8/12 thread results for the Core i7 990X Extreme, but aside from that the Core i5 2500K, dual Opteron 2384, Core i7 990X, and FX-8150 results all scaled approximately the same. When testing four threads, the FX-8150 was 4.05x faster while the Core i5 2500K was at just 3.63x faster. With eight threads (fully utilizing the FX-8150), the improvement was 6.05x over the single-core result while the Opteron 2394 was at 8.02x and the Core i7 990X at 6.11x.

Latest Linux Hardware Reviews
  1. NVIDIA GeForce GTX 970 Offers Great Linux Performance
  2. CompuLab Intense-PC2: An Excellent, Fanless, Mini PC Powered By Intel's i7 Haswell
  3. From The Atom 330 To Haswell ULT: Intel Linux Performance Benchmarks
  4. AMD Radeon R9 285 Tonga Performance On Linux
Latest Linux Articles
  1. 6-Way Ubuntu 14.10 Linux Desktop Benchmarks
  2. Ubuntu 14.10 XMir System Compositor Benchmarks
  3. Btrfs RAID HDD Testing On Ubuntu Linux 14.10
  4. Ubuntu 14.10 Linux 32-bit vs. 64-bit Performance
Latest Linux News
  1. Mono Brings C# To The Unreal Engine 4
  2. Coreboot Now Has Support For Intel Broadwell Hardware
  3. Enlightenment's EFL 1.12 Alpha Has Evas GL-DRM Engine, OpenGL ES 1.1 Support
  4. GTK+ Lands Experimental Backend For Mir Display Server
  5. Ubuntu 14.10 Officially Released
  6. Mesa 10.4 Might Re-Enable HyperZ For R600g/RadeonSI
  7. Intel GVT-g GPU Virtualization Moves Closer
  8. GTK+ 3.16 To Bring Several New Features
  9. Debian 8.0 Jessie Has Many Multimedia Improvements
  10. What Linux Benchmarks Would You Like To See Next?
Latest Forum Discussions
  1. Linux hacker compares Solaris kernel code:
  2. Advertisements On Phoronix
  3. HOPE: The Ease Of Python With The Speed Of C++
  4. Updated and Optimized Ubuntu Free Graphics Drivers
  5. Users/Developers Threatening Fork Of Debian GNU/Linux
  6. Ubuntu 16.04 Might Be The Distribution's Last 32-Bit Release
  7. AMD Releases UVD Video Decode Support For R600 GPUs
  8. Proof that strlcpy is un-needed