I am using the mse loss described here.
mse_loss( input , target , size_average=None , reduce=None , reduction=‘mean’ ) → Tensor
My input and target are of size [16, 2, 48, 120] i.e. a batch size of 16 where each item is a tensor of size [2, 48, 120].
Supplying the argument
whereas the argument
This probably means that the method is averaging over the total number of pixel instead of averaging only by the batch size. What can I do to sum over every pixel then divide by the batch size ?