Hi, I’m monitoring my GPU usage as I’m doing training. What I observe is, it fluctuates between 93% and 44%. Is this because, at some points, it waits CPU to pass data? Or is it because I do a pass over validation set (which is considerably smaller than the training set) after training ? In any …

How to ensure my GPU is utilized to the fullest?

wolterlw (Volodymyr) February 13, 2020, 1:14am 2

Check out these threads

93% is excellent utilization and I believe having lower GPU utilization during validation is expected as you don’t compute gradients or make parameter updates so the process is a lot more data-intensive
I’d suggest profiling your DataLoader vs the training step as the first step to figure out where the bottleneck actually is.