Probably better like this, though:
import torch
train_x=torch.rand(200, 130, 130)
train_y=torch.rand(200)
labels = torch.empty_like(train_x)
labels[:,: 1, :1] = train_y.view(-1, 1, 1)
data_labels=torch.stack([train_x, labels], dim=1)
print(data_labels.size())
Then dim=1 will contain your image/label pairs.