How to get gradients of loss with individual sample in min-batch

LeeDoYup · July 22, 2020, 11:57pm

Hello.

I want to get gradients of each sample’s loss w.r.t. each parameters.
Specifically, when

I want to get the gradients for each sample, that is

To get the gradients for all samples,
I tried to get the gradients for each mini-batch.

However, after i use

loss = torch.sum(-onehot*pred, dim=1)
# onehot: [B, # of class]
# pred: [B, # of class]
loss.backward(gradient=torch.ones_like(loss))

I found that the gradients of loss w.r.t. parameters

print(conv1.weight.grad.shape)

have the shape of [C_in, C_out, w, h] not [B, C_in, C_out, w, h].

That is, the gradients of loss for each sample are accumulated.
How can i get the gradient for each sample?

Is the only way to solve using for-loop?

Thank you…

Doyup Lee

ptrblck · July 25, 2020, 3:16am

This topic deals with the same question and suggests to use @Yaroslav_Bulatov’s repository.