Originally posted by bridgman
View Post
That makes the OpenCL section 50% flawed and 50% not too relevant (MD5 results), right?
@bridgman: unrelated, but I was wondering. What's the OpenCL compiler in this new AMDGPU-PRO stack, how much of the code-base (in particular compute-related) is shared with the ROC stack? And what's the situation with the OpenCL support in the ROC stack? It's not entirely clear what the different components are/do.
We've suffered from serious driver overheads as well as crazy compiler behavior in our code so I'm looking forward to seeing the ROC stack in action, both for a more lightweight compute-tuned driver stack as well as for a better compiler. We know our kernels can match the NVIDIA CUDA kernels that have been getting tweaks and serious tuning for years (at Maxwell with Fiji), so it's pretty frustrating to be held back by the software stack.
Leave a comment: