Will there be any difference to transfer tensor to GPU in getitem and in training loop?

nivesh_gadipudi · September 17, 2020, 7:21pm

Which practice is considered good?

def __getitem__(self, idx): to use .cuda() while returning features, labels from __get__item or
In the training loop
features.cuda(), labels.cuda() = batch. ??

ptrblck · September 19, 2020, 6:38am

I would recommend the second approach, as it would make sure to create the complete batch using multiprocessing and transferring the batch to the device.
The first approach might work, but you could easily run into multiprocessing errors, if each worker tries to create a new CUDA context.

nivesh_gadipudi · September 19, 2020, 5:55pm

I’m getting the object is not having .cuda() method when using the second one. From getitem I’m returning a tuple of (image, label). I have successfully used transformations, changed the dtype to float32. But it’s outputting the error .cuda() method can be used on that.

But works totally fine by 1st method.

ptrblck · September 20, 2020, 2:42am

The tuple class is a plain Python class and thus doesn’t have the cuda() method.
You would have to unwrap the tensors and call cuda() on each of them.

Will there be any difference to transfer tensor to GPU in __getitem__ and in training loop?

Will there be any difference to transfer tensor to GPU in getitem and in training loop?