We are aiming to have a developer preview of the modified OpenCL stack (still closed source runtime but using a lot more open source code) around mid-December.
Announcement
Collapse
No announcement yet.
OpenCL Image Support For Gallium3D's Clover
Collapse
X
-
Sort-of... the preview will be running over the ROCM stack which includes modified kernel drivers (newer-than-upstream KFD code plus corresponding changes in amdgpu, ttm etc...). The ROCM kernel has not gone through much graphics testing other than developers booting to desktop so we aren't saying at this point you will have a do-everything system, but next step will be getting the newest amdkfd code upstream-ready so we can include it in all-open and -pro trees.
HW support in the initial preview will probably be limited to one or two high-end dGPUs.Test signature
- Likes 1
Comment
-
Originally posted by bridgman View PostSort-of... the preview will be running over the ROCM stack which includes modified kernel drivers (newer-than-upstream KFD code plus corresponding changes in amdgpu, ttm etc...). The ROCM kernel has not gone through much graphics testing other than developers booting to desktop so we aren't saying at this point you will have a do-everything system, but next step will be getting the newest amdkfd code upstream-ready so we can include it in all-open and -pro trees.
HW support in the initial preview will probably be limited to one or two high-end dGPUs.
Comment
-
Yeah, it's tough getting the HSA/ROC stack working on SI since it doesn't have the MEC block (so no AQL support and limited user queue support, also 2 compute queues on ME instead of 32+ HW scheduling on MEC).
We did hack up a partial implementation using Tahiti (79xx, which had ATC and could generate ATS/PRI requests) and Trinity (which had IOMMUv2 but not hooked up to the internal GPU) during the early days of HSA work until Kaveri silicon became available, but it required non-standard API calls because the MEC HW functionality wasn't there.
I think hacking the runtime to use kernel calls + native compiler on SI would be easier than trying to implement HSA/ROC-ish user queues.Last edited by bridgman; 24 November 2016, 01:26 AM.Test signature
- Likes 1
Comment
Comment