Copying a tensor to shared memory from the GPU

If I’m bringing a tensor back from the GPU to the CPU and need to use it in multiple procs, can I move it directly in to shared memory and avoid the copy with .share_memory_() ?

1 Like