Originally posted by CochainComplex
View Post
Announcement
Collapse
No announcement yet.
"CC_OPTIMIZE_FOR_PERFORMANCE_O3" Performance Tunable Dropped In Linux 6.0
Collapse
X
-
Last edited by dekernel; 11 August 2022, 08:35 AM.
- Likes 1
-
Originally posted by coder View PostRegarding your quoted results: is the first set reporting the time to complete a fixed workload and the second set reporting the speed achieved? In other words, "lower is better", in the first test, but "higher is better" in the second?
BTW, I'm currently still running the Ofast kernel I tested last because I was too lazy to recompile and reboot again. So far no issues. Not that I would recommend it or anything, but no nasal demons have come out of my nose yet.Last edited by binarybanana; 11 August 2022, 07:01 AM.
- Likes 3
Leave a comment:
-
Originally posted by birdie View PostI see there's a cult of -O3 here: you, guys, are welcome to use Clear Linux, Gentoo, FreeBSD or create an Ubuntu SUPER DUPER FAST SPIN compiled with -O3 -march=zen3 -flto -pgo since I presume most people here are rabid AMD fans.
Oh and make sure you've not missed other experimental GCC flags. Every optimization matters, why stop at -O3?have a look at the Clear Linux repo - Ofast is the next "big" thing
e.g. your beloved Wayland
https://github.com/clearlinux-pkgs/w...n/wayland.spec
p.s.:...nitpicking but PGO does not work by just adding the "-pgo"-FlagLast edited by CochainComplex; 11 August 2022, 07:01 AM.
- Likes 2
Leave a comment:
-
Originally posted by DanglingPointer View PostThe -O2 religion vs the -O3 cult! Or is it vice versa?
Disclaimer: I have only ever used KCFLAGS=" ...-O3"
Code:$ time KCFLAGS="-march=native -msse2avx -pipe -O3" KCPPFLAGS="-march=native -msse2avx -pipe -O3" make -j$(( $(nproc) + 2 )) deb-pkg LOCALVERSION=-danglingpointer-zen3-optimised
- Likes 1
Leave a comment:
-
I see there's a cult of -O3 here: you, guys, are welcome to use Clear Linux, Gentoo, FreeBSD or create an Ubuntu SUPER DUPER FAST SPIN compiled with -O3 -march=zen3 -flto -pgo since I presume most people here are rabid AMD fans.
Oh and make sure you've not missed other experimental GCC flags. Every optimization matters, why stop at -O3?
Leave a comment:
-
Originally posted by CochainComplex View PostHaswell was introduced roughly 10 years ago and besides it resembles the roughly the compilerflag x86_v2.
BTW, the whole tangent about Clear Linux is a waste of time. Its only relevance to the discussion is in showing that -O3 works and doesn't compromise stability.Last edited by coder; 11 August 2022, 06:11 AM.
- Likes 2
Leave a comment:
-
Originally posted by birdie View PostCochainComplex
-O3 creates such bloated code it may thrash low-end CPUs with small L1/L2 caches and result in a much lower performance than -O2.
- Likes 4
Leave a comment:
-
Originally posted by birdie View Post1. There's no need to add this ugly fucking picture.
Originally posted by birdie View Post2. You're free to compile the kernel with -O999 if you want.
3. You're free to create your own distro where you compile everything with -O999.
Originally posted by birdie View PostGCC developers themselves have said on multiple occasions that -O3 enables experimental optimization options which may or may not improve performance but surely will add bloat.
Originally posted by birdie View PostThis topic is not worth the electrons wasted on it.
Originally posted by birdie View Post/Thread.
- Likes 4
Leave a comment:
-
Originally posted by birdie View Post
I see a lot of speculation and zero test results. Sorry, I'm a simple person and that doesn't work with me
- Likes 6
Leave a comment:
-
Originally posted by PerformanceExpert View Post
This is not true at all - even -O2 enables a lot of optimizations, including vectorization. -O3 is more aggressive of course, but the difference between -O2 and -O3 is far smaller today than it was 5-10 years ago. There is no doubt that -O2/-O3 generate larger code than -Os, but the difference is not that large.
In general the additional performance from optimization more than makes up for any increased I-cache misses. And it's not like -O3 is a recent invention, it existed decades ago when caches were absolutely tiny. So any claims about old CPUs with small caches not being able to run -O3 code well is plain wrong.
Leave a comment:
Leave a comment: