Low performance of transferring tensor to CUDA

ptrblck · January 14, 2022, 6:21pm

The to() operations as well as e.g.copy_ accept the non_blocking argument and an example was posted in your other question.