Firefox went to updating the screen with less than 1 fps (disabling GPU acceleration helps though) and the rest of the desktop also feels sluggish.
The memory allocator also has trouble allocating larger chunks of VRAM when there still should be enough memory left
Your experience is really interesting, but I don't believe that AMD has better memory allocator than Nvidia.
I think that you just used cudnn_benchmark=True which drastically increases memory usage in order to find the best algorithm, try to disable it and more likelly you can use batch size of 8 as on your AMD.
As a result, there are some networks that I can only train with a batch size of 3 when I trained them with a batch size of 8 on my previous Polaris GPU (both cards have 8 GB VRAM).
Leave a comment: