Originally posted by Kano
View Post
Originally posted by leeenux
View Post
so OK lets take a REAL WORLD APP code that almost every one uses in some way today weather they know it or not at some point ,
such as the video streaming veetle, livestream,USTREAM tv, Justin.tv, and many others.
that app code being the multi threaded x264 ,you know that one single app that is intensely integer SSE*, SIMD,AVX optimised for x86 and some ARM NEON CPU/SOC Today.
how come even after Loren merrit , Dark shikari and others got remote access they only managed an average 2% to 10% improvement in parts the BD x2 integer SIMD should have walked it but didn't.
OC you might make the valid claim that Michael cant be bothered to ever find the time to git pull a current x264 with LOT's of AVX and other long standing speed improvements and include/integrate it into his Phoronix Test Suite so that old 2010 version makes AMD look even worse , and you may have a point, so get Michael to do something about it, but that still doesnt make AMD CPUs faster...
only better optimised internal AMD HW microcode etc for x264 and other common integer SIMD apps will make a real difference but that work
doesn't happen to date.
and notice how i didn't once refer to using GPU compute, or the ATI gfx SAD instructions
etc to make x264 faster, as if they were any good or easy to do for this every day FOSS application code base
used everywhere, then someone would have already submitted a patch to the x264 devs IRC channel,
and the x264 devs have tried to get some Gfx devs to actually contribute, not one single gfx or OpenCL coder has stepped up
or completed even a single re-factor of any single routine .and submitted a patch to the x264 devs to date.
Comment