How to load images for inference in batch

codegypsy1 · December 29, 2018, 7:17am

Hey there I am new to PyTorch. I have a inference code that predicts and classify images. I can predict and classify images one by one, can anyone please help me to classify all the images of a folder in a batch.

Directory structure:
images
… img1.jpg
… img2.jpg
… img3.jpg

Output:
elephant
lion
tiger

How can I load all the image in the folder and predict one by one.
I am using the prediction code as follows:

def predict_image(image_path):
    print("prediciton in progress")
    image = Image.open(image_path)

    transformation = transforms.Compose([
        transforms.RandomResizedCrop(224),
        transforms.ToTensor(),
        transforms.Normalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225])
        ])

    image_tensor = transformation(image).float()
    image_tensor = image_tensor.unsqueeze_(0)

    if cuda:
        image_tensor.cuda()

    input = Variable(image_tensor)
    output = model(input)

    index = output.data.numpy().argmax()
    return index

This works for single images, if called again and again the time of execution will increase. Also I am doing inference on a CPU machine.

Amrit_Das · January 2, 2019, 10:54am

As per my understanding, I wrote this piece of code. To load images and predict.

data_transforms = {
    'predict': transforms.Compose([
        transforms.RandomResizedCrop(224),
        transforms.RandomHorizontalFlip(),
        transforms.ToTensor(),
        transforms.Normalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225])
    ])
    }

dataset = {'predict' : datasets.ImageFolder("./data", data_transforms['predict'])}
dataloader = {'predict': torch.utils.data.DataLoader(dataset['predict'], batch_size = 1, shuffle=False, num_workers=4)}

outputs = list()
since = time.time()
for inputs, labels in dataloader['predict']:
    inputs = inputs.to(device)
    output = model(inputs)
    output = output.to(device)
    index = output.data.numpy().argmax()
    print index

I hope this helps you and solves your problem

codegypsy1 · January 2, 2019, 10:58am

Thank You worked like a charm

Ben_Bowles · April 25, 2019, 11:25pm

What is a device? torch.cuda.current_device() ?

Manuel_Alejandro_Dia · May 15, 2019, 8:14am

Usually a device is where you do your computations, for example if you use a GPU your device will be something like cuda:0 or just 0.

So what torch.cuda.current_device() does is to return the identifier of which GPU is currently being used. This can be really helpful for systems with multiple GPUs.

Docs for more information.

adijindal30 · June 27, 2021, 5:04pm

Does inferencing on the batch reduce the per image inference time?

Manuel_Alejandro_Dia · June 30, 2021, 3:31pm

I guess the answer would depend on your network and input, but according to some tests I ran, time is the same for batch or single image.
So yes, it reduces the per-image inference time.

import torch

single_image = torch.rand(1,3,300,300).cuda()
multiple_images = torch.rand(8,3,300,300).cuda()

network = torch.hub.load('pytorch/vision:v0.9.0', 'resnet50').cuda().eval()

with torch.no_grad():
    %timeit network(single_image)
       #33.2 ms ± 123 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
    %timeit network(multiple_images)
       #33.2 ms ± 399 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

Running multiple image inference depends on your application and hardware. For example:

If you are training, the bigger the batch size the better. Because it will allow you to have a better gradient update, and in the end a faster training time. Remember that inference is not all the computation you perform when executing your model. Things like pre and post-processing also take time, and if you can do these steps at the same time for many images your computation time will be reduced!
If you are using a live system, you don’t want to wait and have many images to do inference in all of them at once. Usually, you want to do inference on the latest available image (batch of 1). Even though the computation time per image is reduced, it may be too slow the overhead caused by the rest of the system, making single image inference the best option.

adijindal30 · July 6, 2021, 5:28am

yes you are right and I guess the difference in inference time is quite large when I just using CPU otherwise in the case of GPU, I guess only a little difference in inference time when I did the batch inference. Here is the graph for Resnet-18 inference using GPU, on 256 images.
download (1)

sal_zak · May 11, 2023, 1:45pm

how to turn list of images to a batch to do inference per batch?

Manuel_Alejandro_Dia · May 11, 2023, 2:53pm

What do you mean with list of images?
Can you give an example of your code?

Your question is too vague

sal_zak · May 12, 2023, 12:06pm

a python list of numpy array each array represent an image and i fed the list to this function and i want to return the array of predictions
Screenshot from 2023-05-11 14-41-01

tiramisuNcustard · May 12, 2023, 3:28pm

Since you are working with images, it might be easier to work with ImageFolder. Take a look at its documentation here:

https://pytorch.org/vision/main/generated/torchvision.datasets.ImageFolder.html