Announcement

**Spittie** · 12 April 2014, 09:45 AM

Well, seems that it got a nice boost compared to yesterday: http://openbenchmarking.org/result/1...=GCC+4.8+Tuned

**mmstick** · 12 April 2014, 11:31 AM

Originally posted by Spittie View Post

Well, seems that it got a nice boost compared to yesterday: http://openbenchmarking.org/result/1...=GCC+4.8+Tuned

Certainly, why are these scores 300-700% faster than previous benchmarks?

**Spittie** · 12 April 2014, 11:54 AM

Originally posted by mmstick View Post

Certainly, why are these scores 300-700% faster than previous benchmarks?

I'd love to ear an answer from Michael, anyway the only difference I can see is the kernel version (3.13 vs 3.14). There are also some different gcc flags, but I doubt those would give such a boost.

**Luke_Wolf** · 12 April 2014, 02:15 PM

Originally posted by Spittie View Post

I'd love to ear an answer from Michael, anyway the only difference I can see is the kernel version (3.13 vs 3.14). There are also some different gcc flags, but I doubt those would give such a boost.

Actually yes, yes the involved GCC flags would, why? well first off because march=native means that the binary itself is tuned towards the architecture it's running on, allowing things like SIMD extensions, whereas the previous tests were tune=generic which means that the binary isn't built for any of the quirks of the architecture and only has up to SSE2 available, but has the advantage of being a portable binary. Futher these binaries are built with Optimization level 3 whereas the previous article's binaries were built without any optimization level at all.

So yeah, those flags do make a big difference

**mmstick** · 12 April 2014, 02:26 PM

Originally posted by Luke_Wolf View Post

Actually yes, yes the involved GCC flags would, why? well first off because march=native means that the binary itself is tuned towards the architecture it's running on, allowing things like SIMD extensions, whereas the previous tests were tune=generic which means that the binary isn't built for any of the quirks of the architecture and only has up to SSE2 available, but has the advantage of being a portable binary. Futher these binaries are built with Optimization level 3 whereas the previous article's binaries were built without any optimization level at all.

So yeah, those flags do make a big difference

Nope, it shouldn't make the difference that big. 15%, maybe, but this is 200-700% faster in every benchmark.

**Luke_Wolf** · 12 April 2014, 02:54 PM

Originally posted by mmstick View Post

Nope, it shouldn't make the difference that big. 15%, maybe, but this is 200-700% faster in every benchmark.

Quite possible

The Performance Between GCC Optimization Levels - Phoronix

http://www.phoronix.com/scan.php?page=article&item=gcc_47_optimizations&num=1

Phoronix, Linux Hardware Reviews, Linux hardware benchmarks, Linux server benchmarks, Linux benchmarking, Desktop Linux, Linux performance, Open Source graphics, Linux How To, Ubuntu benchmarks, Ubuntu hardware, Phoronix Test Suite

on GCC 4.7 O0 vs O3 doubles the performance, march=native will be more dependent upon the software involved but will give it's own boost. Note that most production software is built with O2, but that O0 is the default optimization level.

**carewolf** · 13 April 2014, 06:35 AM

Originally posted by mmstick View Post

Nope, it shouldn't make the difference that big. 15%, maybe, but this is 200-700% faster in every benchmark.

-O3 enables auto vectorization. Normally that isn't a big deal because real programs have custom code for inner loops like that, but in a benchmark where everything is written in generic C or C++, well the compiler gets to do vectorization that can give good 4-8x speed-up (though unless it is really clean examples you tend to only get 2-3x speed up with automatic which is why real application write important vectorizable loops in assembler or instrinsics).

**Ansla** · 14 April 2014, 08:48 AM

The change in CFLAGS would explain app benchmarks getting faster, but not also the compile time benchmark getting faster, it should have had the opposite effect.

Announcement

Running GCC 4.9 On AMD's AM1 Kabini With Jaguar Cores

Running GCC 4.9 On AMD's AM1 Kabini With Jaguar Cores

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment