How to using multiple gpus to do inference in this situation

I should average 10 softmax scores to report the final score. I have changed the code of dataloader, _get_item() will return a list that contains 10 sampled image tensors.

How to write code to do inference with using multiple batches and multiple gpus?

Thanks.