The v2 patch now has these options, and will almost certainly get approved.
Originally Posted by nej_simon
-march=pentium4 -mtune=core2 -mfpmath=sse
Actually that looks like a typo - the patch comments talk about sse2, but the patch itself just enables sse.
Last edited by smitty3268; 01-28-2013 at 11:09 PM.
Function inlining varies a lot between software. In some cases, it gives huge speedups. Other times, it just results in slower performance and greater memory use. It can vary depending on how large your CPU cache is as well.
Originally Posted by mark_
You can even manually set the depth the compiler will inline down to - something Firefox does for example, because the default -O3 inlining was too much, but by limiting the inlining amount they could still turn on -O3 and get better results than plain old -O2.
It's much more likely to be with faster GPUs and lower resolutions. Michael testing an IGP at 1080p probably isn't going to show a lot.
Originally Posted by Adarion
Not in my experience. I've run a lot of benchmarks and games, and 'sysprof' often shows that _mesa_* calls (which are the actual implementation of the gl* calls) are a very noticable percentage.
Originally Posted by Lockal
I've always* built Mesa with -O3 and not once had an issue that was because of that.
* not built git in the last 3-4 months since it requires newer autofoo and I'm too lazy.