Originally posted by coder
View Post
"just installing separate RDNA and CDNA cards in your machine"
yes exactly this is a very good idea so you can have a 6900XT for graphics with MESA/ACO and you can buy a MI100 CDNA card and you put it in as a second card for compute with ROCm.
but just lets calculate the costs of this: 1500€ for the 6900XT and 12500€ for the mi100 CDNA card. means you end up at 14000€
lets say you put this on the same card but everthing else is the same you maybe save 100€ for the electronic circuit board
not bad i say. now you think 16GB vram for the 6900XT and 16GB vram for a hypothetical CDNA card sounds boring because you can only use 16gb... now you use xGMI and glue these both together now you can use up to 32GB vram.
to get 32GB vram with xGMi sounds good to me.
but lets face it you or AMD can do it smarter: lets say you put on a IO chip like in the 3950X cpu and the IO chip does the HBM3 or ddr6x means you put 32gb vram on the card and both chips RDNA and CDNA use this memory IO chip interface.
now you put in 2 chips of infinity cache one for the RDNA chip and one for the CDNA chip.
see now you have a very smart solution it save transostors because the second chip is only CDNA and not full GCN/RDNA you also save memory because to get 32gb VRAM for each card to use at max you do not need to put in 64gb vram like in a simple dual gpu design... you can do even smarter stuff now you can select what code you run on RDNA and what code run on CDNA if a code (32bit floating point)runs better on RDNA you put it there if a code runs better on CDNA (64bit floating point) than you put it into the CDNA chip. and because of the infinity cache per chip you can run both graphic tasks and compute cards at the same time with near same speed.
"Would it be a single device with a mix of execution resources? We don't even have APIs for that (e.g. describing a mix of resources and specifying which workloads should run where, etc.)!"
yes in the end you can even do this. like we now have CPU and GPU and APU you could make an API who automatically use the best hardware for the task. if a game use AI NPC it can run on the CDNA compute if the game does graphics it runs on the RDNA part.
Comment