Announcement

Collapse
No announcement yet.

RDMA Changes For Linux 5.12 Add DMA-BUF Support For Peer-To-Peer Transfers With GPUs

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • mppix
    replied
    Originally posted by coder View Post
    The CPU is still what boots, runs the OS & drivers, manages the memory space, manages security, and basically coordinates things at a high-level. It's still the boss, even if it's not an intermediary of every individual transaction. These devices don't know about UIDs, perms, or userspace processes, but the CPU is what's responsible for extending those constructs so that device-to-device transactions remain private and secure.

    If you want to view the inside of a computer more as a network, where there are multiple equal peers, then I think blade servers are probably more along those lines. I believe at least some of them use Ethernet as a backplane.
    Yes, of course. The CPU remains the main hardware that runs the system. However, they become more like a referee from a computation perspective and don't risk to be the bottleneck.

    Blade servers and supercomputers can use both Ethernet or more commonly infiniband with direct memory access. In most cases, you still have a central access/management node(s). It certainly also a distributed computation environment at a higher level that would seem complementary.

    Leave a comment:


  • coder
    replied
    Originally posted by mppix View Post
    Sounds exciting. This, pci4/5 (..6) smart access memory, oneAPI and the like may really start a new compute model where the CPU is only one of many compute nodes in a system.
    The CPU is still what boots, runs the OS & drivers, manages the memory space, manages security, and basically coordinates things at a high-level. It's still the boss, even if it's not an intermediary of every individual transaction. These devices don't know about UIDs, perms, or userspace processes, but the CPU is what's responsible for extending those constructs so that device-to-device transactions remain private and secure.

    If you want to view the inside of a computer more as a network, where there are multiple equal peers, then I think blade servers are probably more along those lines. I believe at least some of them use Ethernet as a backplane.

    Leave a comment:


  • You-
    replied
    Originally posted by agd5f View Post

    FWIW, AMD did the GPU side of this (i.e., added peer to peer support to dma-buf in general).
    Cool, with cooperation there is less chance that one party wants to take its toys home (or rather, not bring them out to play).

    Thanks for correcting me on this.

    Leave a comment:


  • agd5f
    replied
    Also, HMM doesn't currently handle Peer to Peer access.

    Leave a comment:


  • mppix
    replied
    Originally posted by You- View Post
    This is interesting and just goes to show how much of a behemoth intel is in relation to opensource.
    I would suggest this is also useful to AMD and something they could have worked on, but by having a smaller team, a lot of this type of work is done by intel.
    Early AMD exascale supercomuter reports suggests that they are doing something like infinity fabric over pcie interfaces. This may come down to a very similar concept..

    Leave a comment:


  • agd5f
    replied
    Originally posted by You- View Post
    This is interesting and just goes to show how much of a behemoth intel is in relation to opensource.

    I would suggest this is also useful to AMD and something they could have worked on, but by having a smaller team, a lot of this type of work is done by intel.
    FWIW, AMD did the GPU side of this (i.e., added peer to peer support to dma-buf in general).
    Last edited by agd5f; 22 February 2021, 06:08 PM.

    Leave a comment:


  • You-
    replied
    This is interesting and just goes to show how much of a behemoth intel is in relation to opensource.

    I would suggest this is also useful to AMD and something they could have worked on, but by having a smaller team, a lot of this type of work is done by intel.

    The sales and marketing side of intel may sometimes be evil, but its OSS Side is amazing.

    Leave a comment:


  • tildearrow
    replied
    RDMA just makes me think of RDNA...

    (even though I know they are separate things)

    Leave a comment:


  • mppix
    replied
    Sounds exciting. This, pci4/5 (..6) smart access memory, oneAPI and the like may really start a new compute model where the CPU is only one of many compute nodes in a system.

    Leave a comment:


  • agd5f
    replied
    Originally posted by pegasus View Post
    So this is now the third implementation solving the same problem? As far as I know Mellanox was the first with GPUdirect. Seems like a good scenario for Linus to share some sharp wisdom with the world
    GPUDirect/PeerDirect is not upstream.

    Leave a comment:

Working...
X