Originally posted by staalmannen
View Post
ICC version 12 (parallel studio xe 2011)
most agresive flags: -fast -no-prec-div -fp-model fast
more agresive flags: -O3 -xHOST -ip -no-prec-div -fp-model fast
agresive flags: -O3 -xHOST -ip
normal O3: -O3
normal O2: -O2
normal Os: -Os
open64 4.2.3
most agresive flags: -ofast
more agresive flags: -O3 -ffast-math -lno -apo -mso -march=wolfdale
agresive flags: -O3 -lno -apo -mso -march=wolfdale
normal O3: -O3
normal O2: -O2
normal Os: -Os
GCC 4.5.2
most agresive flags: -O3 -ffast-math -march=native -mtune=native -fexcess-precision=fast -ftree-parallelize-loops=5 -floop-parallelize-all -floop-interchange -floop-strip-mine -floop-block -flto
more agresive flags: -O3 -ffast-math -march=native -mtune=native -fexcess-precision=fast -ftree-parallelize-loops=5 -floop-parallelize-all -floop-interchange -floop-strip-mine -floop-block -flto
agresive flags: -O3 -march=native -mtune=native -ftree-parallelize-loops=5 -floop-interchange -floop-strip-mine -floop-block -flto
normal O3: -O3
normal O2: -O2
normal Os: -Os
If u are not in the mood to do alll this , I would suggest to try more agresive flags or agresive flags profile.
note1: Most agresive flags and more agresive flags perform not acurate IEEE maths operations, but this should be fine for most programs.
note2: This is my personal choice, I only toyed a little with GCC vs ICC in the past, so those flags could be wrong or not optimal.
Comment