Image classification tensor

TTTT · December 15, 2022, 10:48pm

In fact, i want to concat image pixels (train_x) and their labels(train_y). Then I want to convert to tensor.
I don’t know how to do ?

Thank You

J_Johnson · December 15, 2022, 11:47pm

Some more details about your objective might help.

In order to concat 2 tensors, they need to be the same size, except in the dim being concatenated. You could do:

labels = torch.empty_like(train_x)
labels[:, 0, labels.size()[1], :] = train_y.view(-1, 1, labels.size()[1], 1)

data_labels=torch.cat([train_x, labels], dim=1)

But, in most cases, you might find it easier to just use:

data_labels = [train_x, train_y]
...
train_x, train_y = data_labels[0], data_labels[1]

TTTT · December 16, 2022, 8:46am

Thank you so much ! I will try it

TTTT · December 16, 2022, 8:56am

But i have an error =

ptrblck · December 16, 2022, 9:09am

Call view on a PyTorch tensor not a numpy array:

x = torch.randn(2, 3, 4, 5)
x = x.view(-1, 1, 1, 1) # works

x = np.random.randn(2, 3, 4, 5)
x = x.view(-1, 1, 1, 1) # fails
# TypeError: view() takes from 0 to 2 positional arguments but 4 were given

J_Johnson · December 16, 2022, 9:41am

As @ptrblck said, if it’s a NumPy object, you’ll need to convert it into a Pytorch tensor, via torch.from_numpy():

labels[:, 0, labels.size()[1], :] = torch.from_numpy(train_y).view(-1, 1, labels.size()[1], 1)

data_labels=torch.cat([train_x, labels], dim=1)

https://pytorch.org/docs/stable/generated/torch.from_numpy.html

TTTT · December 16, 2022, 10:07am

I have an error :

J_Johnson · December 16, 2022, 10:34am

You changed train_y.view(-1,1,labels.size()[1], 1) to train_y.view(-1,1,labels.size()[0], 1). The -1 is intended to capture the batch size, as this can be variable, while the labels.size()[1] captures the number of labels per item. Change it back to dim 1 so the sizes match.

J_Johnson · December 16, 2022, 10:36am

Additionally, it seems your images are removing a dim. Images should have 4 dimensions: batch size, channels, height, width.

If you are getting your images/labels from a numpy dataloader, it could be squeezing the channels dim. You’ll need that back in to pass to a model for inference, if it has convolutional layers. i.e. train_x=train_x.unsqueeze(1)

If the model is strictly linear layers, then you can resize the labels view accordingly.

TTTT · December 16, 2022, 10:39am

But in train_x there are pixels for each images.

TTTT · December 16, 2022, 10:40am

I changed because i got this error :

J_Johnson · December 16, 2022, 10:41am

As I suspected, you’re missing the channels dim. Does your model have Conv2D layers?

TTTT · December 16, 2022, 10:46am

I did not train the model. In fact, for the image classification, i have a folder with human faces and two csv files : train_csv contains the name of image (ex : 0001.jpg) and the label (1 or -1). So i start to collect then into datframe.

TTTT · December 16, 2022, 10:46am

J_Johnson · December 16, 2022, 11:18am

Then just remove the 2nd dim in the .view():

labels[:,0: labels.size()[1], 0:1] = train_y.view(-1, labels.size()[1], 1)

TTTT · December 16, 2022, 12:47pm

It did not change:

TTTT · December 16, 2022, 12:49pm

J_Johnson · December 16, 2022, 2:06pm

Can you print the sizes of train_x and train_y?

TTTT · December 16, 2022, 2:11pm

Yes :

J_Johnson · December 16, 2022, 2:18pm

import torch

train_x=torch.rand(200, 130, 130)
train_y=torch.rand(200)

labels = torch.empty_like(train_x)
labels[:,:1, :1] = train_y.view(-1, 1, 1)

data_labels=torch.cat([train_x, labels], dim=1)

print(data_labels.size())