Hi,
I am doing binary image classification and using BCEWithLogitLoss.
Initally, I was getting RuntimeError: result type Float can’t be cast to the desired output type Long
So after searching, I converted the pred and target to float but now I am getting RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
I really don’t have an idea where I am doing wrong -

My training loop looks like the below -

model.train()
for batch_idx, (data, target) in enumerate(loader['train']):
# move to GPU
if torch.cuda.is_available():
data, target = data.to('cuda', non_blocking=True), target.to('cuda', non_blocking = True) # noqa
optimizer.zero_grad()
output = model(data)
pred = torch.argmax(output, dim=1)
loss = criterion(pred.float(), target.float()). # Conversion of pred and target to float
loss.backward()
optimizer.step()
train_loss += ((1 / (batch_idx + 1)) * ((loss.data) - train_loss))

My transfer learning & loss function loading code snippet looks like below -

Hi @bing, you can’t differentiate torch.argmax with respect to output (as torch.argmax has no grad_fn) so you need to find another way to convert your output tensor to a prediction with an operation that has a grad_fn. A minimal example below to show that torch.argmax has no grad_fn.

I did tried calculating the loss function as you suggested earlier but I was getting ValueError: Target size (torch.Size([1])) must be the same as input size (torch.Size([1, 2]))
So I pivoted to calculating the argmax
Below is the shape of my output and target

Shape of output: torch.Size([1, 2])
Shape of target: torch.Size([1])

So, I had a quick read through the docs for BCEWithLogitsLoss (docs here). These shapes represent the [batch, num_classes] respectively (and obviously should have the same size).

So check your target Tensor has the right shape

Or perhaps you need to reduce output to match the shape of target (as that’s what torch.argmax was effectively doing)

I am initially trying to run the network on 1 sample only, so the target is supposed to be of shape 1, I tried unsqueezing also but it didn’t work out.
Yes, the output is supposed to be a single value but I really don’t know if not to use argmax then how to do it.
Below are my Target and output values -

If your target is a scalar for a single sample, it should have a shape of [1,1] because the shape is defined as [num_samples, size_of_one_sample] which corresponds to [1,1].

If you’re trying to get the position of the max value of output, surely you should be using torch.max instead of torch.argmax (as torch.max has a grad_fn)?

So,

pred = torch.max(output, dim=1, keepdim=True)[0] #need the [0] to return values, [1] is indices

and make sure to have keepdim=True (so your shape is correct!), and this approach has a grad_fn,