The model won't train on GPU

omar_bouhamed · September 11, 2020, 5:56pm

Hi, am new to pytorch and am trying to run my model on GPU, the Jupyter notebook successfully detect my GPU

        if torch.cuda.is_available():
            self.device = torch.device("cuda:0")  # you can continue going on here, like cuda:1 cuda:2....etc. 
            print("Running on the GPU")
            torch.cuda.set_device(0)
        else:
            self.device = torch.device("cpu")
            print("Running on the CPU")

Running on the GPU

print(torch.cuda.current_device())
print(torch.cuda.get_device_name(torch.cuda.current_device()))

0
GeForce RTX 2060

and I used .to(device) for the model and its inputs

self.dqn = Network(states_dim, action_dim).to(self.device)

But it is still working on CPU

Thank you for your help.

ptrblck · September 14, 2020, 9:24am

The Windows task manager is often nor properly reporting the GPU utilization so you could either select the “compute” output in a drop down menu (I’m no Windows expert as you might notice ) or use nvidia-smi.

omar_bouhamed · September 15, 2020, 5:06pm

Yes, it looks like the task manager isn’t showing the real state of the GPU use. Thank you!
The python.exe is running on GPU but it seems that the GPU is not working at full capacity. is there something wrong or it depends on my model…?

ptrblck · September 16, 2020, 4:14am

It depends on the model and overall training script. E.g. your data loading might create bottlenecks or the model workload might be too small to properly saturate the GPU.