Small detail: You should not do param.data.cuda() but param.detach().cuda(). As .data should not be used anymore.
1 Like
Small detail: You should not do param.data.cuda() but param.detach().cuda(). As .data should not be used anymore.