ValueError: Expected target size (32, 7), got torch.Size([32])

salma_kasraoui · April 12, 2019, 2:51pm

Hi everyone,
I know this topic was previously discussed, however, the proposed solutions didn’t work for me.
I am trying to perform classification of precomputed features into 7 categories using logistic regression.
I got the following error when training the classifier:

ValueError: Expected target size (32, 7), got torch.Size([32])

My target shape was ([768,1]) and squeezing it didn’t solve the problem.
The input shape also is torch.Size([768, 1, 221])

By squeezing it, I got this error:

RuntimeError: Expected object of scalar type Long but got scalar type Int for argument #2 'target'

To train the logistic regression model, I used this piece of code which works steadily with another dataset:

#define classifier
num_input = trainingData.shape[-1]
num_classes = trainingLabels.cpu().unique().numel()
model = Sequential(Linear(num_input, num_classes), LogSoftmax(dim=1))
optimizer = Adam(model.parameters())
criterion = NLLLoss()

batch_size = 32
num_epochs = 50
#learning rate
lr = 1e-4

nsamples = trainingData.shape[0]
nbatches = nsamples // batch_size

for e in range(num_epochs):
    perm = torch.randperm(nsamples)

    
    for i in range(nbatches):
        idx = perm[i * batch_size : (i+1) * batch_size]
        model.zero_grad()
        resp = model.forward(trainingData[idx])
        trainingLabels = trainingLabels.squeeze()
        loss = criterion(resp, trainingLabels[idx])
        loss.backward()
        optimizer.step()
    
    resp = model.forward(trainingData)
    avg_loss = criterion(resp, trainingLabels)

Obviously, my problem is in the data shape but I can not fix it may be because I am new to pytorch.

Any help will be appreciated.

ptrblck · April 12, 2019, 7:50pm

Hi Salma,

Based on the error message, it seems you have an additional dimension in your model output.
If you just would like to classify the data in 7 classes (not-pixel wise classification etc.), your output should have the shape [batch_size, nb_classes], while your target should be a torch.LongTensor containing the class indices in the shape [batch_size].

Try to squeeze the model output and convert your target as:

target = target.long()

If that doesn’t help, could you post the input, output and target shapes, so that we could debug this issue further?

Also a small issue in your code:
You should call the model directly (output = model(data)), since this will make sure to register all hooks etc. Currently you are using model.forward(data) which might yield some issues in the future.

While your code should work fine using your manual batching approach, you could use a Dataset and DataLoader instead, which will make shuffling, preprocessing the data etc. a bit easier.

salma_kasraoui · April 15, 2019, 4:05pm

Yes there was an additional dimension in the input and the target.
I squeezed the input and I converted the target to target.long

And it worked. Thank you!

Naveen · April 30, 2020, 6:59pm

I have a similar error I am posting it here because it is similar and
I tried this method and its not working.
I was trying to do cross entropy loss and I got this error “Expected target size (1, 5), got torch.Size([1, 17451])”
My cls_preds.size()= torch.Size([1, 17451, 5]) and cls_targets.size()= torch.Size([1, 17451])
and I was trying to calculate Loss by following code
loss = nn.CrossEntropyLoss()
output = loss(cls_preds, cls_targets)

Please provide some explanation, thank you.

ptrblck · May 1, 2020, 3:14am

nn.CrossEntrolyLoss expects a model output in the shape [batch_size, nb_classes, *additional_dims] and a target in the shape [batch_size, *additional_dims] containing the class indices in the range [0, nb_classes-1].

Based on your output shape it seems you are dealing with 17451 classes and a temporal dimension of 5. The target should thus have a shape of [1, 5] (note the missing channel dimension).

Also, it seems your target might be one-hot encoded, as it’s using the class dimension in dim1, but is missing the temp. dim?

Maybe your use case is vice versa and you are dealing with 5 classes.
In this case, you would have to permute the output so that it has the shape [1, 5, 17451].

Naveen · May 1, 2020, 7:08am

Yes you are right my class_num = 5 and number of batches = 1 and the other dimension is number of anchors.
and I did cls_preds.permute(0,2,1) and my shape is now [1, 5, 17451].

Now I am getting an error saying “RuntimeError: Assertion `cur_target >= 0 && cur_target < n_classes’ failed. at /pytorch/aten/src/THNN/generic/SpatialClassNLLCriterion.c:111” in nn.nll_loss2d.

any help?
Thank you.

ptrblck · May 2, 2020, 3:26am

If you are dealing with the shape [1, 5, 17451], I assume that each “anchor” belongs to one of the 5 classes?
If so, then the target should contain class indices in the range [0, 4].

Naveen · May 24, 2020, 6:07pm

Actually I tried number of anchor boxes as batch number and it got solved. It make sense? Thanks anyway.

dbld_blqq · May 20, 2021, 12:17pm

Hi, I face the same problem as Naveen, shape of my output is torch.Size([1, 20, 4]), where 20 is the time sequence, 4 is the number of classes, softmax was used as the final activitation function, and loss is nn.CrossEntropyLoss(), I use the follow expression:
loss = loss_func(output, torch.max(label, 2)[1])
I got this error:
ValueError: Expected target size (2, 4), got torch.Size([2, 20])
But if I changed my time sequence to 4, this error disappeared, why？

ptrblck · May 20, 2021, 6:48pm

That would be wrong as explained here:

so you would have to permute the model output such that the class dimension is in dim1.

That is unfortunately also wrong, since nn.CrossEntropyLoss expects logits, so you would have to remove the softmax.

dbld_blqq · May 21, 2021, 11:44am

You are right, I added transpose op before output and it works. Thanks a lot.

Faisal_T · January 17, 2023, 1:36pm

I have a similar problem I’m getting:
“RuntimeError: Expected target size [2, 30000], got [2]”. when calling “torch.nn.CrossEntropyLoss()”.

I saw your comment where you explained that : " nn.CrossEntrolyLoss expects a model output in the shape [batch_size, nb_classes, *additional_dims] and a target in the shape [batch_size, *additional_dims]"

I’m not sure why my targets/labels dimensions are only the batch size which is [2].
Thank you so much

ptrblck · January 17, 2023, 5:07pm

This would indicate your targets are representing a single class for each sample, so a multi-class classification while your model outputs indicate an additional temporal dimension.
Could you describe your use case a bit more, as the target shape seems to be unexpected?

Faisal_T · January 17, 2023, 5:28pm

I’ve created a post that contains the code of the model definition and the training loop: RuntimeError: Expected target size [2, 30000], got [2]

My task is to classify Arabic poetry texts based on category. I have 26 different categories. each poetry text sample has one category.

Faisal_T · January 18, 2023, 8:13am

Hello ptrblck, do you need me to provide any more info? Thank you so much

ptrblck · January 18, 2023, 8:16am

No, as I think you’ve already posted the important issues in the cross-post. Let me highlight them there.