1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking.org

Intel's Mesa Driver Gets New Performance Patch

Intel

Published on 19 January 2013 04:34 PM EST
Written by Michael Larabel in Intel
1 Comment

Intel's Mesa DRI driver received a new patch this weekend that's capable of enhancing the open-source graphics driver's performance for some OpenGL games.

The new performance-enhancing Mesa patch is by Kenneth Graunke and entitled i965: Implement CopyTexSubImage2D via BLORP (and use it by default).

CopyTexSubImage2D is now implemented via the BLORP rather than the BLT engine to work around limitations of the latter not being able to blit X-tiled buffers and between buffers of different tiling modes.

In terms of what this patch means for end-users of Intel hardware on Linux, the PlaneShift MMORPG game is much faster. Up to now the game has just run at around one frame per second on Intel hardware while chewing up nearly the entire CPU due to its use of Y-tiled depth buffers with CopyTexSubImage2D. This is a common issue to massively multi-player games.

Additionally, the Xonotic first person shooter with 4x MSAA anti-aliasing is now measured to be about 6.35% faster as a result of this single Mesa patch.

The patch, which adds just over 100 lines of new code to the Mesa i965 DRI driver, can be found currently on the mailing list until it reaches mainline Mesa hopefully in time for next month's Mesa 9.1 release.
The BLT engine has many limitations. Currently, it can only blit X-tiled buffers (since we don't have a kernel API to whack the BLT tiling mode register), which means all depth/stencil operations get punted to meta code, which can be very CPU-intensive.

Even if we used the BLT engine, it can't blit between buffers with different tiling modes, such as an X-tiled non-MSAA ARGB8888 texture and a Y-tiled CMS ARGB8888 renderbuffer. This is a fundamental limitation, and the only way around that is to use BLORP.

Previously, BLORP only handled BlitFramebuffer. This patch adds an additional frontend for doing CopyTexSubImage. It also makes it the default. This is partly to increase testing and avoid hiding bugs, and partly because the BLORP path can already handle more cases. With trivial extensions, it should be able to handle everything the BLT can.

This helps PlaneShift massively, which tries to CopyTexSubImage2D between depth buffers whenever a player casts a spell. Since these are Y-tiled, we hit meta and software ReadPixels paths, eating 99% CPU while delivering ~1 FPS. This is particularly bad in an MMO setting because people cast spells all the time.

It also helps Xonotic in 4X MSAA mode. At default power management settings, I measured a 6.35138% +/- 0.672548% performance boost (n=5).

No Piglit regressions on Ivybridge. I have not tested Sandybridge.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Linux Hardware Reviews
  1. MSI X99S SLI PLUS On Linux
  2. NVIDIA GeForce GTX 970 Offers Great Linux Performance
  3. CompuLab Intense-PC2: An Excellent, Fanless, Mini PC Powered By Intel's i7 Haswell
  4. From The Atom 330 To Haswell ULT: Intel Linux Performance Benchmarks
Latest Linux Articles
  1. RunAbove: A POWER8 Compute Cloud With Offerings Up To 176 Threads
  2. 6-Way Ubuntu 14.10 Linux Desktop Benchmarks
  3. Ubuntu 14.10 XMir System Compositor Benchmarks
  4. Btrfs RAID HDD Testing On Ubuntu Linux 14.10
Latest Linux News
  1. RadeonSI/R600g HyperZ Support Gets Turned Back On
  2. openSUSE Factory & Tumbleweed Are Merging
  3. More Fedora Delays: Fedora 21 Beta Slips
  4. Mono Brings C# To The Unreal Engine 4
  5. Coreboot Now Has Support For Intel Broadwell Hardware
  6. Enlightenment's EFL 1.12 Alpha Has Evas GL-DRM Engine, OpenGL ES 1.1 Support
  7. GTK+ Lands Experimental Backend For Mir Display Server
  8. Ubuntu 14.10 Officially Released
  9. Mesa 10.4 Might Re-Enable HyperZ For R600g/RadeonSI
  10. Intel GVT-g GPU Virtualization Moves Closer
Latest Forum Discussions
  1. HOPE: The Ease Of Python With The Speed Of C++
  2. Updated and Optimized Ubuntu Free Graphics Drivers
  3. Ubuntu 16.04 Might Be The Distribution's Last 32-Bit Release
  4. Linux hacker compares Solaris kernel code:
  5. Advertisements On Phoronix
  6. Users/Developers Threatening Fork Of Debian GNU/Linux
  7. AMD Releases UVD Video Decode Support For R600 GPUs
  8. Proof that strlcpy is un-needed