Swapping TCP/IP for RDMA in GVirtuS didnโt just improve speedโit transformed GPGPU remoting. With Mellanox Infiniband, we slashed SAXPY execution time by 82% and Matrix Multiplication by 55%. The secret? Eliminating system calls and leveraging pre-registered memory.
๐ Follow us on LinkedIn! https://www.linkedin.com/company/clever-project/?viewAsMember=true
๐ Check the updates from the website: www.cleverproject.eu
๐ Full paper in: https://zenodo.org/records/14717622