Originally posted by gsedej
View Post
Announcement
Collapse
No announcement yet.
The State Of ROCm For HPC In Early 2021 With CUDA Porting Via HIP, Rewriting With OpenMP
Collapse
X
-
Originally posted by vegabook View Post
I'm sorry. Your company has been worth > 20 billion dollars on the stock market for more than 2 years now, and 50bn since the end of 2019. Stock market valuation's entire raison-d'etre is to measure your capital raising capacity. If your management decided not to take advantage of this trove of gold by, say, issuing 2% of shares into that valuation to fund software, then that's a strategic mistake for Radeon Technologies group or whatever it's now called, which seems to have been completely forgotten with all the Ryzen excitement. Fact is, money has been there for the taking thanks to your enormously successful stock but management didn't use that to invest where it needed to. This insistence on funding out of cashflow is very difficult to understand when your competitor is racing ahead with ever more speed, and Intel is about to get in too.Last edited by bridgman; 23 February 2021, 02:42 PM.Test signature
- Likes 1
Comment
-
Originally posted by bridgman View Post...
Most of our major customers build from source, by the way.
...
I think nothing of building the kernel for custom cases. I thought nothing of build X from source back in the day. I've been working with open source since 1993: I LIKE having the source code.
But my God, ROCm. It's a stack with many parts - a couple dozen? I can't even find a description of the dependency tree of the packages in bloody ROCm. I've tried to build ROCm from source, and I know how to figure dependencies out, but ROCm defeated me.
In case you think you hear some anger here, well, guy, you are right. AMD touts ROCm stack but it cannot be arsed to write up a doc that describes how to build the stack; AMD cannot be arsed to support RDNA/RDNA2 except with OpenCL (like, you know, OpenCL is the future, right?); AMD cannot even be arsed to write installation instructions that do not leave people with the impression they should install rocm-dkms; AMD puts bloody GCN GPUs inside CPUs and calls them APUs but AMD cannot be arsed to support it's compute stack there either.
How the hell does AMD ever expect to compete with nVidia in GPGPU outside national supercomputer walled gardens when the people who might become a real ecosystem for ROCm can't even use it? Radeon VII is gone, and you can't buy an mi60, you can buy an mi50 if you agree to felate a corp sales rep for a while, and the mi100 requires relations with HPE/Cray, a fate worse than corp sales; AMD should stop talking about ROCm in public, because the only way the public can actually use it now is to somehow find an RX580.
Comment
-
-
Originally posted by hoohoo View PostJesus, that is pretty Goddamned rich, Bridgman.
I think nothing of building the kernel for custom cases. I thought nothing of build X from source back in the day. I've been working with open source since 1993: I LIKE having the source code.
But my God, ROCm. It's a stack with many parts - a couple dozen? I can't even find a description of the dependency tree of the packages in bloody ROCm. I've tried to build ROCm from source, and I know how to figure dependencies out, but ROCm defeated me.
In case you think you hear some anger here, well, guy, you are right. AMD touts ROCm stack but it cannot be arsed to write up a doc that describes how to build the stack; AMD cannot be arsed to support RDNA/RDNA2 except with OpenCL (like, you know, OpenCL is the future, right?); AMD cannot even be arsed to write installation instructions that do not leave people with the impression they should install rocm-dkms; AMD puts bloody GCN GPUs inside CPUs and calls them APUs but AMD cannot be arsed to support it's compute stack there either.
I have raised the issue about it being almost impossible to find a new product with ROCm support multiple times, and it does seem to have gotten attention. We also have new management in most of the areas where you expressed concerns and I think that will help as well. It still requires a bunch of additional people to do the work and that will take some time to build up, but it looks promising.
Originally posted by hoohoo View PostHow the hell does AMD ever expect to compete with nVidia in GPGPU outside national supercomputer walled gardens when the people who might become a real ecosystem for ROCm can't even use it? Radeon VII is gone, and you can't buy an mi60, you can buy an mi50 if you agree to felate a corp sales rep for a while, and the mi100 requires relations with HPE/Cray, a fate worse than corp sales.
Originally posted by hoohoo View PostAMD should stop talking about ROCm in public, because the only way the public can actually use it now is to somehow find an RX580.Last edited by bridgman; 23 February 2021, 02:36 PM.Test signature
- Likes 1
Comment
-
Originally posted by bridgman View Post
With respect, we made zero deal about Radeon VII's FP64 capability other than eventually adding an FP64 FLOPs number to the product page alongside FP16 and FP32.
The press made a big deal about it which is fine, and which eventually resulted in us launching the PRO version of the card (with even faster FP64) but from our perspective Radeon VII was a gaming card.
It doesn't stand the smell test. If AMD meant the Radeon VII to be a gaming card then why did AMD cap the FP64 at 3.5-odd TF and not gimp it to near zero like the other gaming cards?
Comment
-
Originally posted by bridgman View Post
Again, my comment was specifically in response to a poster's comments about Google. It was not meant to be dismissive in any way about the challenges new users face.
I have raised the issue about it being almost impossible to find a new product with ROCm support multiple times, and it does seem to have gotten attention. We also have new management in most of the areas where you expressed concerns and I think that will help as well. It still requires a bunch of additional people to do the work and that will take some time to build up, but it looks promising.
Comment
-
Originally posted by vegabook View PostSecond, this going on about "limited resources". AMD is rich now. It's been rich for 2.5 years at least. This excuse is no longer valid at all even if you allow for decent lead time to hire and train people.
and right now they try to fix it with today's money. what does not bring you back the time from 4 years ago.Phantom circuit Sequence Reducer Dyslexia
Comment
-
Originally posted by bridgman View Post
It uses an NVidia/Mellanox API (which is what the NICs support) that requires application memory pages to be pinned in place from userspace. I think the recent upstreaming of GPU P2P dmabuf may allow us to re-implement in an upstreamable form, at least that was one of the goals for the work IIRC.Phantom circuit Sequence Reducer Dyslexia
Comment
Comment