Announcement

**schmidtbag** · 02 September 2022, 09:04 AM

I agree it's a little odd that Zen 4 became the motivator, but I guess the thinking was that AVX 512 is now common enough to warrant the effort. Back when it was only available to a single vendor and effectively only server CPUs, it didn't make sense. But with 2 vendors and desktop CPU support, I can see how it becomes a little more worthwhile.

It's also not clear how much faster AVX-512 will be for LLVMpipe on AMD Zen 4 processors. For Zen 4, AVX-512 is implemented still using a 256-bit data path rather than 512-bit, but we'll see how well their AVX-512 implementation plays out soon enough.

I wouldn't be surprised if AMD's approach performs worse but still yields better performance than not having it at all. Since AVX-512 isn't [yet] commonly used, I think it was a good idea for AMD to minimize how much die space was dedicated to a feature that most people otherwise might not care about. Since this allows the chips to be produced cheaper, anyone who cares about AVX-512 can just compensate for the lower performance by having more cores. I figure a 64 core Epyc would yield similar performance to a 56 core Xeon in AVX-512 workloads.

**klokik** · 02 September 2022, 10:51 AM

AFAIR, Intel cores were significantly dropping frequency even below the base frequency, with considerable recovery latency, if AVX-512 instructions were used. Thus if Zen 4 can maintain a somewhat decent clock rate, it has a chance to surpass Intel in Instructions Per Second terms.

**Dukenukemx** · 02 September 2022, 12:35 PM

Yuzu and RPCS3 developers have expressed positivity when using AVX-512 to boost their emulators performance, so there's certainly something good to look forward to there.

**Developer12** · 02 September 2022, 12:46 PM

The article is right to point out that this makes no sense. These AMD chips already all have full GPUs integrated on the I/O die. This LLVMpipe implementation will *never* be useful on those chips.

That leaves only intel, who are in fact reducing support for AVX512. There's even less justification for AVX512 support than there was before.

**brucethemoose** · 02 September 2022, 02:22 PM

Originally posted by Developer12 View Post

The article is right to point out that this makes no sense. These AMD chips already all have full GPUs integrated on the I/O die. This LLVMpipe implementation will *never* be useful on those chips.

That leaves only intel, who are in fact reducing support for AVX512. There's even less justification for AVX512 support than there was before.

Better performance is always good, but writing AVX512 intrinsics does seem like a lot of effort for a fallback gpu driver.

**andrei_me** · 02 September 2022, 04:42 PM

Not sure if LLVMpipe already contains optimization for newer ARM instructions, so it can run even smoother in Asahi for example

**brucethemoose** · 02 September 2022, 05:12 PM

Originally posted by andrei_me View Post

Not sure if LLVMpipe already contains optimization for newer ARM instructions, so it can run even smoother in Asahi for example

ARMv9 seems like a particularly sensible target. There are going to be lots of ARM machine with broken drivers in the near future.

Maybe they don't have any available devs who do ARM assembly.

**arQon** · 03 September 2022, 04:39 AM

Originally posted by Developer12 View Post

The article is right to point out that this makes no sense.

Yeah, I had to read it again to make sure I hadn't missed something.

"Since these CPUs *all* ship with an iGPU, add a bunch of redundant codepaths to the software renderer". wibble?

Next up: writing fast 16.16 fixed-point math for armv6 via SVE instructions... :P

**Raka555** · 03 September 2022, 02:34 PM

Phoronix will probably have an article about the "code rot" in LLVMpipe in a couple of years from now ...

Announcement

Mesa's LLVMpipe Driver Begins Experimenting With AVX-512 Optimizations Ahead Of Zen 4

Mesa's LLVMpipe Driver Begins Experimenting With AVX-512 Optimizations Ahead Of Zen 4

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment