If I’m bringing a tensor back from the GPU to the CPU and need to use it in multiple procs, can I move it directly in to shared memory and avoid the copy with .share_memory_()
?
1 Like
If I’m bringing a tensor back from the GPU to the CPU and need to use it in multiple procs, can I move it directly in to shared memory and avoid the copy with .share_memory_()
?