Reduce Idleness Between Batch Loads

johnorford · August 8, 2023, 10:21am

12 workers supply the GPU with data. When doing inference, the GPU is shows:

HTop shows more-or-less that:

But in between batch loading, there are moments when the GPU % drops to zero.

I suspect this is the main reason for low GPU utilisation.

Is there any way to reduce this idleness?

I have many more avenues to improve performance, but this seems blindingly obvious, but I am unsure how to improve this.

Thanks!

nlgranger · August 8, 2023, 11:56am

Can you describe the structure of the batch and the size of the tensors?
I assume you have memory pinning enabled and non_blocking=True?

johnorford · August 8, 2023, 1:22pm

Hi,

Here are some more details:

Dataloader:

Collate function:

non_blocking etc.:

img size (actually the stack of images): torch.Size([2010,3,256,256])

johnorford · August 8, 2023, 1:35pm

OK, I realised I didn’t read the workers argument correctly! All good now.