Originally posted by AmericanLocomotive
View Post
Announcement
Collapse
No announcement yet.
Intel Linux Optimizations Help AMD EPYC "Genoa" Improve Scaling To 384 Threads
Collapse
X
-
Seems like some applications still have an upper limit to how many threads they can utilize; perhaps 128, when you consider the major dropoff at 384. I suspect that despite what the application sees, the scheduler knows there are more available threads. This causes the threads to jump between core clusters or sockets, which dramatically hurts performance, apparently enough where 48 threads is often faster or close to it. For workloads that use all threads, you wouldn't get this problem because there's no point in swapping threads around. I'm sure if some of these benches prevented their threads from hopping around, the graph would mostly plateau at 192 and 384 threads.
Comment
-
Originally posted by arcivanov View PostReally curious at what's happening at the magic number of "192". Seems to be some monstrous bottleneck somewhere.
Hyperthreading has limited throughput benefit but you still suffer from lock contention, so performance might drop overall.
- Likes 3
Comment
Comment