R500 Texture Semaphores Merged To Master

Posted by Michael Larabel on October 15, 2011

The R500 texture semaphores work, the feature I wrote about and tested earlier this month, has been merged to master. This feature in the R300 Gallium3D open-source driver can provide some impressive performance improvements.

Tom Stellard, the former Google Summer of Code student who has since been hired by AMD, pushed several of his "r300/compiler" patches to mainline Mesa last night. The big one is implementing texture semaphore for Radeon X1000 (R500) hardware.
r300/compiler: Implement the texture semaphore
The texture semaphore allows for prefetching of texture data. On my RV515, this increases the FPS of Lightsmark by 33% (This is with the reg_rename pass enabled, which is enabled in the next commit). There is a new env variable now called RADEON_TEX_GROUP, which allows you to specify the maximum number of texture lookups to do at once. The default is 8, but different values could produce better results for various application / card combinations.

The benchmarks I did earlier this month confirmed these double-digit gains for certain texture-using OpenGL workloads like Lightsmark. Additionally, the RADEON_TEX_GROUP default of eight seemed to be an ideal number for delivering maximum performance.

Tom's other commits last night were allowing for merged instructions to be scheduled on demand, preventing the register allocator from creating non-native swizzles, not pairing output writes with GPR writes in the scheduler, and enabling register rename passing on R500 and using it before running through the optimization process.

The R300 compiler work on mainline Mesa can be viewed from CGit. This performance work will be part of Mesa 7.12/8.0. Read the earlier article for more information.

Discuss this article in our forums, IRC channel, or email the author. You can also follow our content via RSS and on social networks like Facebook, Identi.ca, and Twitter (@Phoronix and @MichaelLarabel). Subscribe to Phoronix Premium to view our content without advertisements, view entire articles on a single page, and experience other benefits.
Latest Hardware Reviews
  1. Intel Haswell HD Graphics 4600 vs. AMD Radeon Graphics On Linux
  2. Intel Haswell HD Graphics 4600 Performance On Ubuntu Linux
  3. Intel Core i7 4770K "Haswell" Benchmarks On Ubuntu Linux
  4. The First Experience Of Intel Haswell On Linux
Latest Software Articles
  1. Optimized Binaries Provide Great Benefits For Intel Haswell
  2. 11-Way Linux, BSD Platform Comparison
  3. SNA Acceleration Works Great For Intel Core i7 Haswell
  4. The Linux Evolution For Intel Haswell's Performance
Latest Linux News
  1. KDE's KWin Made Lots Of Progress In 4.11
  2. Ubuntu Announces Carrier Advisory Group
  3. Qt 5.1 Release Candidate 1 Has Arrived
  4. In-Fighting Continues Over Mir On Non-Unity Ubuntu
  5. Subversion 1.8 Presents New Features
  6. LLVM 3.3 Officially Released
  7. LLVM/Clang Now Uses Loop Vectorizer At New Levels
  8. Intel GPU Driver Tries To Rip Out FBDEV Support
  9. Coreboot Doing AMD USB 3.0, Q35 QEMU Emulation
  10. VP9 Codec Now Enabled By Default In Chrome
  11. openSUSE 13.1 M2 Plays On PulseAudio 4.0
Latest Forum Talk
  1. Ubuntu Announces Carrier Advisory Group
  2. Intel GPU Driver Tries To Rip Out FBDEV Support
  3. In-Fighting Continues Over Mir On Non-Unity Ubuntu
  4. Vote for GOG to add Linux versions of games they...
  5. Intel Haswell HD Graphics 4600 vs. AMD Radeon...
  6. Mir Still Causing Concerns By Ubuntu Derivatives
  1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Motherboards
  5. Peripherals
  6. Processors
  7. Software
  8. Operating Systems
  9. All Articles
  1. Linux Benchmarking
  2. OpenBenchmarking.org
  3. Phoronix Test Suite