I have a very large net. I put a part of the net on cuda 0 and another part of the net on cuda 1. But there is no no copy_() operation for Variables. I can not directly propagate the gradient from one gpu device to another device. Will further release support this feature? Thanks.

No. If you want to copy a CUDA variable from one device to another, do:

var2 = var1.cuda(new_device)