Announcement

**gorgone** · 15 January 2019, 07:33 AM

the RX Vega 64 is slower than RX 590
this code must be a masterpiece^^

**ObiWan** · 15 January 2019, 08:12 AM

And rocm2 beeing slow, this is with a vega56 and the amdgpu-pro opencl backend:
https://openbenchmarking.org/result/...SK-VEGA5685245

**leidola** · 15 January 2019, 08:56 AM

That's a factor two. Rocm opencl really needs some tuning.

**ankan** · 15 January 2019, 10:52 AM

For CPU backend, I think you can link to intel mkl-dnn lib (instead of OpenBLAS) to get much better performance.
Search for mkl-dnn in github
Also, I assume you are already running it with enough no. of threads (e.g: --threads=32).

**Marc Driftmeyer** · 15 January 2019, 11:04 AM

Whoever wrote their OpenCL stack should be shot. It reminds me of how piss poor Blender's monolithic stack was before AMD rewrote it into a split stack and threaded it properly.

**Marc Driftmeyer** · 15 January 2019, 11:04 AM

Originally posted by ObiWan View Post

And rocm2 beeing slow, this is with a vega56 and the amdgpu-pro opencl backend:
https://openbenchmarking.org/result/...SK-VEGA5685245

It isn't just ROCm. Most of these slowness comes from the client's poor coding and knowledge of OpenCL as well.

**zxy_thf** · 15 January 2019, 08:46 PM

Originally posted by Marc Driftmeyer View Post

It isn't just ROCm. Most of these slowness comes from the client's poor coding and knowledge of OpenCL as well.

Additionally I don't even think it is a good idea to use OpenCL directly rather than building the AI engine on top of pytorch/tensorflow.

**alcalde** · 15 January 2019, 09:04 PM

Originally posted by zxy_thf View Post

Additionally I don't even think it is a good idea to use OpenCL directly rather than building the AI engine on top of pytorch/tensorflow.

I thought neither TensorFlow nor PyTorch worked with OpenCL?

**kohlerm** · 17 January 2019, 05:39 AM

Thanks, very interesting test!
So getting a 2060 is now probably the way to go to get a really strong chess engine. lczero just got the second place at the TCEC computer chess tournament, behind stockfish.
But with a 2060 you would need to get quite some expensive hardware to make Stockfish a competitor. Would need to find some numbers for stockfish speed depending on the number of cores, but currently at a ratio of 1:1000 in nodes per second lczero is most likely stronger than stockfish. That being said stockfish would be still very useful for deep tactical analysis problems.

Announcement

Lczero Neural Network Chess Benchmarks With OpenCL Radeon vs. NVIDIA

Lczero Neural Network Chess Benchmarks With OpenCL Radeon vs. NVIDIA

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment