Announcement

**Obscene_CNN** · 06 May 2010, 12:40 PM

I might add that this patch should work on all chips that use GLSL (GL Shader Language) in mesa. So maybe intel and nvidia might get a boost too.

I have tested Torcs, Nexuiz, Foobillard, and Celestia with no ill effects that I can see. I did have one machine lock up with Stormbaan Coureur but I have had lockups with no patches applied as well.

**marek** · 06 May 2010, 01:33 PM

Originally posted by Obscene_CNN View Post

It bumped my Nexuiz scores on demo1 from 5,8,and 12 to 5,9, and 13.

Such small increase can be considered a statistical error. The real bottleneck is elsewhere. You could easily achieve 2x speedup if you concentrated on real problems. If you think your patch is useful, take it to ML.

**Obscene_CNN** · 06 May 2010, 01:52 PM

marek,

Its not a statistical error and it is on the dri-devel mailing list (I am awaiting my confirmation email to the Mesa3d-dev mailing list). Its about a 4% improvement. Modifications I have done that have gotten a 5 to 10% increase in performance in torcs have failed to change my Nexuiz performance by 1 second or the FPS at all.

I didn't write the function, Brian Paul who started Mesa did. Apparently he thought it was worthwhile to write it. I just found and worked around a bug I found in it to make it usable.

**evil_core** · 06 May 2010, 01:56 PM

Originally posted by marek View Post

Such small increase can be considered a statistical error. The real bottleneck is elsewhere. You could easily achieve 2x speedup if you concentrated on real problems. If you think your patch is useful, take it to ML.

1st test! And later complain.

This patch caused way better performance in q3a on mine r500(FireGL V5200) in UXGA mode(T60p). And not only FPS were problem, which is 25% lower in KMS comparing to UMS, but frame flow is lagfgy(not constant, accumulated and thrown at once). Try using some high-res like me and see difference, its real. In Warsow too and UT2K4, but tc-elite still lags at this res :/

**Hans** · 06 May 2010, 02:32 PM

Obscene_CNN I would love to try your patch, but I am using fglrx right now (due to work). But I am sure many is appreciating your effort in optimizing the drivers.

BTW. Is there any way of profiling the drivers?

**Zhick** · 06 May 2010, 02:55 PM

So I just gave it a quick test with Nexuiz/PTS and r300/r300g (my card is a X1900XT, so r500).
r300g (HDR on): Without your Patch 30.31, with it 30.03
r300 (HDR off): Without your Patch 40.88, with it 40.79
So I can't see any speedup, if anything it's slightly slower, but that might very well be random derivation.

**Obscene_CNN** · 06 May 2010, 03:14 PM

Hans,

You can profile the drivers. I haven't yet. Its a little difficult as you have to profile the kernel dri driver, libdrm, Mesa, and the application.

**Obscene_CNN** · 06 May 2010, 03:23 PM

Zhick,

thanks for your testing. I'm trying to figure out how it ended up slower for you.

One contributing factor is that there is one more pass of optimization of the GPU instructions that would incur more CPU overhead. However it should not be that much.

**Obscene_CNN** · 06 May 2010, 05:53 PM

Zhick,

The best I can figure is your performance is not GPU limited. It is CPU or DMA limited. I will try and come up with an additional patch for you to try to verify this.

Do you use an x86-64 distribution by chance?

Announcement

R300,R400,R500,R600,R700 and more performance patch

R300,R400,R500,R600,R700 and more performance patch

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment