Announcement

**Mario Junior** · 16 March 2020, 06:11 AM

Originally posted by tiennou View Post

A look at the commit (https://git.kernel.org/pub/scm/linux...94d765c8eecbe1) points in that direction.

This BTFO all people says: "but muhhh, the compiler knows how to optimize assembly code better than you do it manually."

**ssokolow** · 16 March 2020, 07:11 AM

Originally posted by Mario Junior View Post

This BTFO all people says: "but muhhh, the compiler knows how to optimize assembly code better than you do it manually."

To be fair, the best solution is sort of a middle-ground: compiler intrinsics.

The "either use the fast path or fail to compile" of explicitly calling SIMD instructions directly, but without the "compiler has no idea what optimizations are safe to perform on the data flow surrounding this instruction" of assembly.

That's why Microsoft decided to omit inline assembly support from Visual C++ for 64-bit targets.

**newwen** · 16 March 2020, 09:45 AM

Doesn't executing AVX2 instructions consume a lot of power? If the processing time decreases, it's logical that the total consumed energy for the process to decrease, at the expense of higher peak/intantaneous power consumption, but has anyone tested it?

**sophisticles** · 16 March 2020, 10:21 AM

Originally posted by Raka555 View Post

I assume it is hand optimized assembler ?

All SIMD programming, is done using either assembler or compiler intrinsics, which are basically C style instructions that convert 1-1 to assembler.

By definition SIMD is hand optimized, unless of course you rely on compiler optimizations.,

**computerquip** · 16 March 2020, 11:04 PM

Originally posted by Mario Junior View Post

This BTFO all people says: "but muhhh, the compiler knows how to optimize assembly code better than you do it manually."

There's obviously going to be cases where hand-optimized assembler is going to be faster. The problem is that it's less readable, harder to understand, much more verbose, and far less portable than C in general. And it *does*, in the vast majority of situations, generate better assembly than you could write by hand.

**ZeDestructor** · 30 March 2020, 11:32 PM

Originally posted by newwen View Post

Doesn't executing AVX2 instructions consume a lot of power? If the processing time decreases, it's logical that the total consumed energy for the process to decrease, at the expense of higher peak/intantaneous power consumption, but has anyone tested it?

It does, but when you're looking at 0-30% more power for 30-420% better throughput, it's still more efficient overall.

Announcement

Linux 5.7 Netfilter To See AVX2 Optimizations For Big Performance Boost - Can Be Up To ~420%

Comment

Comment

Comment

Comment

Comment

Comment