Very low GPU utilization when training YOLOV8

mahdiHash · August 17, 2024, 6:01pm

I’m training a model on a small dataset (139 images, total size of 14MB, stored on HDD) for an object detection project. I have a GTX 1650 with 4GB memory on my laptop and I’m using YOLOV8 pre-trained models. I start the training using yolo detect train [arguments] with these arguments:

data=config.yaml
model=yolov8m.pt
epochs=3
imgsz=640
batch=4 (can’t set to more than 4, memory issue)
workers=8
device=0 (no difference if it’s omitted or not)

I’ve checked whether PyTorch recognizes my GPU using the code below, and it does:

import torch
torch.cuda.is_available() // True

But during the training process, the GPU utilization is 0-2%, GPU memory 3/4GB, CPU usage 10-16%, and RAM usage 14/16GB. Why is GPU utilization so low? Am I doing somehting wrong?

Windows 10
CUDA 12.1
PyTorch 2.4.0
Ultralytics 8.2.77
torchvision 0.19.0

A screenshot of task manager during training:

Soumya_Kundu · August 18, 2024, 2:31am

Not sure if I can pinpoint but older GPUs may not be able to maximally utilise the newer versions I guess.

Plus your data is quite small as well.

Another thing could be your torch environment itself?

Look into the issues mentioned on the YOLOv8 repo. Quite a few people have faced this and there isn’t any one solution unfortunately.

ptrblck · August 19, 2024, 2:37pm

You need to either enable the Compute view in your Task Manager or use nvidia-smi to see the GPU utilization on Windows.

Soumya_Kundu · August 20, 2024, 1:12am

Not a windows user — but why does one have explicitly open the compute view? Is it not automatically tracked?

ptrblck · August 20, 2024, 3:31pm

As you can see in the posted picture, it’s not as only 3D, Copy, Video Encode/Decode are shown. I’m also not using Windows, but know that the default view does not show any CUDA compute kernels.
Based on this post it seems a CUDA view might also be available, but unsure.

I would recommend sticking to nvidia-smi as it will show the GPU utilization properly.

Soumya_Kundu · August 20, 2024, 3:49pm

Ah okay. Thanks! nvidia-smi (and nvitop) is my goto anyway.

s.sni · August 22, 2024, 3:19pm

It seems a few months after my post, a new feature was added to windows 10 (also 11) called " Hardware-Accelerated GPU Scheduling ."
When this feature is on, CUDA is not visible as one of the dropdown menu options in GPU tab in Task Manager (Source). Thus, turning off this feature by following these steps (in windows 11) will probably return CUDA back to Task Manager:

Open Windows Setting > System > Display> Graphics
Select Change default graphics settings
Turn off " Hardware-Accelerated GPU Scheduling ."
Reboot the system.

Note: These steps might be different in windows 10. Googling will easily show you how to turn it off.