Shuffling a Tensor

brookisme · September 18, 2018, 8:40pm

Hi Everyone -

Is there a way to shuffle/randomize a tensor. Something equivalent to numpy’s random.shuffle.

Thanks!

asml · September 18, 2018, 9:36pm

Just index with a tensor of random indices.

ptrblck · September 18, 2018, 11:10pm

You could use torch.randperm to create random indices and then index it using @asml’s suggestion.

brookisme · September 18, 2018, 11:46pm

@asml , @ptrbick - thanks both. is something like this

t=torch.tensor([[1,2],[3,4]])
r=torch.randperm(2)
c=torch.randperm(2)
t=t[r][:,c]

The most elegant solution? Its simple enough but the indexing in t[r][:,c] feels a bit odd

ptrblck · September 18, 2018, 11:52pm

You could merge the indexing or use view instead:

t=torch.tensor([[1,2],[3,4]])
r=torch.randperm(2)
c=torch.randperm(2)
t=t[r[:, None], c]

# With view
idx = torch.randperm(t.nelement())
t = t.view(-1)[idx].view(t.size())

brookisme · September 19, 2018, 12:00am

Oh smart – I like the .view() solution, especially since nbelement and size are fixed. The merged-indexing is nice but probably no less awkward than my first go.
Thanks again!

victor1 · October 3, 2018, 11:18pm

Don’t do this, it is not a real random transformation!

indeed:
The number of possible transformations for a N x N square matrix: (N*N)!
Or, with two permutations of the lines and the columns as you do, there are (N!)*(N!) possible transformation
And (N*N)! is far higher than (N!)*(N!) when N is high…
with you code, the matrix

t=torch.tensor([[1,2],[3,4]])

will never be randomized into

t=torch.tensor([[1,4],[2,3]])

Use the code of @ptrblck with the view, it is a good one

tsap · January 7, 2021, 11:27am

Hello,
If we want to shuffle the order of image database (format: [batch_size, channels, height, width]), I think this is a good method:

t = torch.rand(4, 2, 3, 3)
idx = torch.randperm(t.shape[0])
t = t[idx].view(t.size())

t[idx] will retain the structure of channels, height, and width, while shuffling the order of the image.

ncuxomun · January 21, 2021, 1:35am

Exactly what I was looking for. Thanks, mate!

InfT · October 6, 2021, 11:07am

This code works, but the result changes at every run. How can I make it deterministic?

ptrblck · October 7, 2021, 6:04am

Seed the pseudorandom number generator via torch.manual_seed(SEED) before using the random operation.

YannDubs1 · November 10, 2021, 11:37pm

If it’s on CPU then the simplest way seems to be just converting the tensor to numpy array and use in place shuffling :

t = torch.arange(5)             
np.random.shuffle(t.numpy())
print(t) 
# tensor([0, 2, 3, 1, 4])

HashRocketSyntax · December 28, 2021, 5:22pm

For numpy parity, it would be handy to have torch.shuffle()

HashRocketSyntax · May 9, 2022, 3:38pm

For batch-first shuffling:

tzr = torch.tensor([
    [[1],[1],[1]],
    [[2],[2],[2]],
    [[3],[3],[3]],
    [[4],[4],[4]],
])

rand_indx = torch.randperm(len(tzr))

tzr[rand_indx]

returns

tensor([
    [[3], [3], [3]],
    [[4], [4], [4]],
    [[2], [2], [2]],
    [[1], [1], [1]]
])