Originally posted by scottishduck
View Post
Announcement
Collapse
No announcement yet.
AMD XDNA Linux Driver Updated As It Nears The Upstream Kernel
Collapse
X
-
-
Originally posted by cgmb View PostI'm not sure what you're referring to. The drivers for AMD's enterprise GPUs are open source and available as both an out-of-tree kernel module and upstreamed into linux itself.
Also MxGPU is a sore point. You can download GIM drivers which support up to the FirePro S7150, but anything newer like the Radeon V340 or V620 is unsupported by them and only by non-public drivers. Questions about that on AMD community forums go unanswered.
Originally posted by cgmb View PostMy understanding is that the place to report kernel bugs is https://gitlab.freedesktop.org/drm/amd/-/issues. Although, it is true that it may still be difficult to get attention on your issue.
Leave a comment:
-
Originally posted by Developer12 View PostIt's a total gamble if any given AMD card will run ROCm. So far AMD seem to be intentionally ambiguous to hide the fact that the actual, guaranteed-to-work pool of cards is vanishingly small. (while a slightly larger number of cards are unmaintained and "might" work or might crash or might start crashing with the next update) The AMD driver support is so unmaintained that GCC compiler devs are dropping card support because the drivers are too broken to test if the compilers work.
Originally posted by chithanh View PostYou can buy used NVidia enterprise hardware, download drivers from NVidia, and run it fine in your lab. Driver support goes back many generations.
You cannot do the same with used AMD enterprise hardware and expect it to work, as their drivers for enterprise hardware are often proprietary and locked behind service contracts. Those parts which are supported by ROCm, be happy if you can use it for a year before AMD drops support.
Originally posted by chithanh View Post3. It is not possible for the developer community to reach anyone inside AMD who has the ability to reproduce and fix issues in the drivers. You can complain on the AMD community forums (a horribly slow and bloated piece of web service btw.) all day long, but the only thing that the moderators can do there is placate users. Nobody there from AMD with any technical capacity (nor the hardware) to help users isolate issues and raise them internally at AMD.
- Likes 1
Leave a comment:
-
Originally posted by Adarion View PostSo many things could benefit from it, some LibO stuff, darktable, blender, all the real compute stuff that was mostly written with GPUs as processors in mind, all that so called A"I" stuff these days. But it needs compute support and not just in 4 to 5 totally unaffordable CDNA models and 3 ultra-high-end ones of the normal "gaming" series.
The main problem that AMD is behind NVidia on Blender etc. support is because the developer community is not dogfooding on AMD hardware. This is due to multiple reasons:
1. There is practically no reason to choose AMD hardware over NVidia (even ignoring the driver situation) since AMD started to emulate NVidia in fusing off enterprise features in their consumer hardware. Level1Techs and others have been campaigning for years to get at least some reduced form of SR-IOV (MxGPU) on consumer hardware, but to no avail. SR-IOV was even removed from professional/workstation products and nowadays is only in datacenter.
2. AMD has absolute disdain and least regard for home labs, presumably because their corporate bean counters do not see any revenue from those. They totally ignore that mindshare among the developer community depends on home labs to a large extent.
You can buy used NVidia enterprise hardware, download drivers from NVidia, and run it fine in your lab. Driver support goes back many generations.
You cannot do the same with used AMD enterprise hardware and expect it to work, as their drivers for enterprise hardware are often proprietary and locked behind service contracts. Those parts which are supported by ROCm, be happy if you can use it for a year before AMD drops support.
And then you see on the enterprise CPU side that AMD introduces PSB mobo vendor locks , with no possibility to instead reset the crypto keys or any other recourse, severely impacting the market for second-hand EPYC CPUs. Again, no consideration at all given to home labs.
3. It is not possible for the developer community to reach anyone inside AMD who has the ability to reproduce and fix issues in the drivers. You can complain on the AMD community forums (a horribly slow and bloated piece of web service btw.) all day long, but the only thing that the moderators can do there is placate users. Nobody there from AMD with any technical capacity (nor the hardware) to help users isolate issues and raise them internally at AMD.
Until these things change, AMD will always remain an also-ran in software support.
Leave a comment:
-
But nvidia behing ahead should still not influence your driver quality and HW support. Let nv be whereever they want, AMD is to focus on their own HW and make sure this just works. There are various regressions in amdgpu that I noticed recently and compute support is a sheer pain. They COULD have been so much better. It reminds me of VIA even, fancy ideas, interesting concepts, but failing utterly to deliver due to bad driver support.
For the record: No, I am not saying AMD's driver support is bad in itself, it has come a long way and we do have a free as in freedom driver stack, which is great and 2 bazillion percent more than VIA ever had. I followed it from the earliest days (radeonHD, when libv was still involved) and it has grown and matured. However, it starts to show regressions and compute is a painful field. And here it would be so good, the potential of great acceleration for some tasks, has been there since the E-350. So many things could benefit from it, some LibO stuff, darktable, blender, all the real compute stuff that was mostly written with GPUs as processors in mind, all that so called A"I" stuff these days. But it needs compute support and not just in 4 to 5 totally unaffordable CDNA models and 3 ultra-high-end ones of the normal "gaming" series. Nah, this has to be with every APU, dGPU and whatnot. (Yeah, one won't reach the performance of a CDNA dedicated compute card, but this is for everyday people with everyday tasks, as mentioned above; people who do dedicated compute on large scale will then likely also afford the big monsters.)
Leave a comment:
-
Originally posted by Developer12 View PostAMD *could* have good AI products. They routinely have decent hardware at a decent price, but their software and driver stack is absolute crap.
It's a total gamble if any given AMD card will run ROCm. So far AMD seem to be intentionally ambiguous to hide the fact that the actual, guaranteed-to-work pool of cards is vanishingly small. (while a slightly larger number of cards are unmaintained and "might" work or might crash or might start crashing with the next update) The AMD driver support is so unmaintained that GCC compiler devs are dropping card support because the drivers are too broken to test if the compilers work.
When nvidia CUDA will absolutely, trust-your-life-with-it work on nearly every card they make (consumer AND enterprise) back to a well-defined starting point, it's a no brainer why nobody bothers with AMD.
- Likes 2
Leave a comment:
-
AMD *could* have good AI products. They routinely have decent hardware at a decent price, but their software and driver stack is absolute crap.
It's a total gamble if any given AMD card will run ROCm. So far AMD seem to be intentionally ambiguous to hide the fact that the actual, guaranteed-to-work pool of cards is vanishingly small. (while a slightly larger number of cards are unmaintained and "might" work or might crash or might start crashing with the next update) The AMD driver support is so unmaintained that GCC compiler devs are dropping card support because the drivers are too broken to test if the compilers work.
When nvidia CUDA will absolutely, trust-your-life-with-it work on nearly every card they make (consumer AND enterprise) back to a well-defined starting point, it's a no brainer why nobody bothers with AMD.
- Likes 4
Leave a comment:
-
Originally posted by scottishduck View PostIt’s almost like the Ryzen AI thing was pure marketing buzz and no one actually goes to AMD for AI.
- Likes 3
Leave a comment:
-
I'll be one of the dissenters on AI hardware. While I think it's overblown and a bit early to celebrate, I'm excited for a future where the AI acceleration can take over and stop using my CPU and GPU.
If you download GPT4ALL (it has an official Flathub for those interested) you'll quickly be able to see what it can do with CPUs, and Vulkan, if you set up the latter. No expertise or finicking. Pure GUI, click around and go. Models I've seen vary from hundreds of megabytes to tens of gigabytes.
I've had even the smaller, lighter models write me start-up scripts for my laptop to save on power and battery. It wasn't always perfect, but it's incredible for something that is totally offline, private, and requires no Internet. If I could offload it to an AI accelerator, it would not only use less power, but I'd be able to process far more intelligent and advanced models in less time.Last edited by Mitch; 13 October 2024, 10:29 AM.
- Likes 7
Leave a comment:
Leave a comment: