Announcement

**Peter Fodrek** · 10 September 2018, 08:07 AM

Originally posted by juno View Post

IIRC OpenCL 2.1/2.2 already use SPIR-V.
I don't know what OpenCL offers on top of Vulkan. Maybe a generic OpenCL-over-Vulkan driver would be possible, maybe Vulkan will replace OpenCL. No idea.

But there is problem with SPIR-V in mesa

Mesamatrix: The Mesa drivers matrix

https://mesamatrix.net/

Show Mesa progress for the OpenGL, OpenGL ES, Vulkan and OpenCL drivers implementations into an easy to read HTML page.

**Wielkie G** · 10 September 2018, 09:59 AM

Originally posted by Peter Fodrek View Post

But there is problem with SPIR-V in mesa

Mesamatrix: The Mesa drivers matrix

https://mesamatrix.net/

Show Mesa progress for the OpenGL, OpenGL ES, Vulkan and OpenCL drivers implementations into an easy to read HTML page.

That only concerns OpenGL SPIR-V implementation. Vulkan drivers (anv and radv) accept SPIR-V since day one, as that is basically the only shader representation available.

**bridgman** · 10 September 2018, 10:38 AM

Originally posted by busukxuan View Post

Sorry if this is a stupid question, does "SVM" refer to support vector machines? Doesn't sound likely to me that OpenCL would have such a high level primitive.

Yeah, Shared Virtual Memory. For apps running over KFD/ROCR all system and device memory is in a single virtual address space, and pointers can be shared between CPUs and GPUs. When running over the regular graphics driver each GPU has its own address space separate from CPU address space, but an SVM region can be set up so that CPU and GPU can share pointers within that region.

**coder** · 10 September 2018, 07:14 PM

Originally posted by phoronix View Post

it looks like AMD could be on a strong footing for GPU compute in 2019

...depending on what you want to do with it. Not since Hawaii/Grenada, has AMD been competitive in fp64. And Volta did an end-run around Vega's deep learning ambitions, with its Tensor cores.

I'm not cheerleading for Nvidia, but let's not kid ourselves. AMD seems to have walked away from HPC, and hasn't been competitive in deep learning since Nvidia's GP100 launched, in early 2016. That said, there will certainly be some applications where AMD can be competitive, but Nvidia's new RT cores just edged out another one.

**juno** · 10 September 2018, 08:43 PM

@coder: Nvidia started splitting their HPC and GPU lineups, they have been designing discrete top chips specifically for both graphics and compute starting with Pascal. Seems like AMD is doing the same now as they are preparing a HPC chip. It has new DL instructions, so maybe even specific hardware units similar to tensor cores. It has 1/2 FP64 rate, so it should reach Volta if they can manage to get past 1.8 GHz with a reasonable power budget, which I think is possible as they use TSMC's 7nm node and Vega10 already reaches 1.7 GHz.

I also think that RT has been a topic in the industry for a long time. Even Imagination Technologies managed to merge RT cores in their traditional GPU design. I think it might even be possible for AMD to get it inside Navi. At least, I hope so because I agree that dedicated hardware for RT is great and think it's the future of real time graphics as well. And it also helps very much in professional workloads.

I think their GPU team is still competitive. They just can't afford to release as many GPUs as Nvidia. As a reminder, in the past few years Nvidia has released GM200, GM204, GM206, GM207, GM208, GP100, GP102, GP104, GP106, GP107, GP108, GV100, and soon GT102, GT104, GT106. In the same timeframe, AMD has only released Tonga, Fiji, Polaris10, 11, 12, Vega10 and soon Vega20. And of course their CPU, APU and semi-custom SoCs. I think it's remarkable that they can keep up to this point and not being left behind much further.

**msroadkill612** · 11 September 2018, 06:06 AM

My layman's impression is that the big money in machine learning hardware, will initially be in large scale installations.

I am skeptical that cuda is the barrier to market entry it is credited with for AMD.

Writing some code doesnt bother those guys much if there is better perf/$ to be had from AMD.

The amd value proposition works quite well at scale. They dont mind much if it takes 3 AMD gpuS to do the work of 2 high priced nvidias, so long as its cheaper.

A recent game changer for AMD platform resources, is the explosion in core/thread numbers. That a ~humble TR can simply double cores using the same platform, seems a trick (~give each ccx a slave ccx on an ~easily modified die) that can be repeated on am4 and x399 epyc. Maybe AI will be a little less gpu centric now?

AMDs crushing dominance in most metrics, must make choosing a sibling amd gpu very compelling for epyc adopters

**msroadkill612** · 11 September 2018, 06:35 AM

Originally posted by coder View Post

... let's not kid ourselves. AMD seems to have walked away from HPC, and hasn't been competitive in deep learning since Nvidia's GP100 launched, in early 2016. That said, there will certainly be some applications where AMD can be competitive, but Nvidia's new RT cores just edged out another one.

There is never just one way to do things with code.

a champion team beats a team of champions.

would u say the same for multi 7nm Vega on a 64 core/128T Epyc in a tightly integrated ecosystem?

Which btw, has 128 lanes for scads of all important nvme.

IMO, using nand cleverly as pseudo ram, will be the key to the insatiable memory demands of vast data sets of big end of town AI.

Coders just use the resources available. If resource options change and improve - who knows what approach will yield the best AI perf/$? Its still a young field.

**coder** · 11 September 2018, 06:09 PM