Equivalence of torch.tensor([[1]]) and torch.tensor(1)

marcindulak · December 2, 2019, 9:40pm

I’ve encountered the following, and looking for an explanation

import torch
print(torch.__version__)
1.3.1
a = torch.tensor([[1]])
b = torch.tensor(1)
print(a.shape, b.shape)
torch.Size([1, 1]), torch.Size([])
print(a == b)
tensor([[True]])

richard · December 2, 2019, 9:43pm

If you want to detect equivalence of sizes and shape, torch.equal is the right thing to use.

In your example above, a == b, we are broadcasting a and b and then performing the operation: https://pytorch.org/docs/stable/notes/broadcasting.html.

The steps look roughly like the following:

a has size [1, 1], b has size []
If the sizes are broadcastable (they are), we make it so that they have the same size before evaluation. So a_broadcasted has size [1, 1], b_broadcasted has size [1, 1]
We compare a_broadcasted with b_broadcasted. This produces a tensor with size [1, 1] that has one element True.

marcindulak · December 2, 2019, 9:45pm

Understood. Is there a proper name for a torch.Size([])?

marcindulak · December 2, 2019, 9:48pm

OK, it looks it’s called a scalar https://discuss.pytorch.org/t/what-does-torch-size-0-means.

kartikey_shaurya · December 2, 2019, 9:57pm

thanks for answering