Originally posted by erniv2
View Post
GCC 15 Lands New Optimization For AMD Zen 4 & Zen 5 CPUs
Collapse
X
-
-
-
Originally posted by erniv2 View PostAnother myth it´s exactly the other way around mtune optimizes for a cpu without breaking older archs ie. making systems unbootable, march builds code for the specified arch outputing code for that exact cpu
Leave a comment:
-
-
Originally posted by EphemeralEft View Post
You might also need -mtune=native
march is the higer compiler option it uses mtune it is implied
Leave a comment:
-
-
Originally posted by Kjell View PostDoes GCC automatically enable this flag with march=native when the build host uses Zen4?
Leave a comment:
-
-
Does GCC automatically enable this flag with march=native when the build host uses Zen4?
Leave a comment:
-
-
I'm not sure why they'd need either AVX2 or SSE loop epilogues, TBH. AVX-512 can use 256-bit and 128-bit operands. So, if you merely wanted to avoid an AVX-512 vectorized loop having an unvectorized epilogue, maybe a better idea would be to teach the compiler how to use smaller operands with it?
Inside the CPU, it's using the same pipelines for all families of vector instructions. Some CPUs might have increased throughput of narrower operands, but you shouldn't have to switch instruction families to get that benefit.
Leave a comment:
-
-
GCC 15 Lands New Optimization For AMD Zen 4 & Zen 5 CPUs
Phoronix: GCC 15 Lands New Optimization For AMD Zen 4 & Zen 5 CPUs
Merged today for the upcoming GCC 15 stable release is a new "X86_TUNE_AVX512_TWO_EPILOGUES" tuning optimization that is enabled by default for AMD Zen 4 and Zen 5 processors...
Leave a comment: