Making a circulant matrix from a given vector in a memory efficient way

AK47 · July 9, 2018, 2:38pm

I’d like to make a circulant matrix from a given vector of dimension N in a way such that the operation and the resulting matrix consume only O(N) memory cost. Higher dimensional generalization of this operation is needed for a certain variant of Transformer I’m trying to investigate. A naive approach is to apply ‘expand’ to the original vector and then to apply ‘view’ and ‘slice’ alternately. However, this doesn’t work, as ‘view’ demands the input to be contiguous. Is there any viable way you can think of?

SimonW · July 9, 2018, 3:02pm

it’s not really possible with the strided arrays (https://en.wikipedia.org/wiki/Stride_of_an_array). pytorch, numpy and many other scientific computing framework adopts this approach.

AK47 · July 9, 2018, 3:09pm

Thank you so much for your reply. I guess I’ll give up my current attempt and look for another direction.

SimonW · July 9, 2018, 3:18pm

what’s the exact use case? i might be able to help.

AK47 · July 9, 2018, 6:29pm

Finally, I’ve noticed that my approach of bidirectional language model was unnecessarily complicated, which required the use of circulant matrix for managing the cache of Transformer. I’m sorry.

SimonW · July 9, 2018, 7:16pm

No worries! Good luck with your project!

htd · January 12, 2022, 5:57pm

A bit late but here is a generic function for pytorch tensors, getting the circulant matrix for one dimension. It’s based on unfold operation.

def circulant(tensor, dim):
    """get a circulant version of the tensor along the {dim} dimension.
    
    The additional axis is appended as the last dimension.
    E.g. tensor=[0,1,2], dim=0 --> [[0,1,2],[2,0,1],[1,2,0]]"""
    S = tensor.shape[dim]
    tmp = torch.cat([tensor.flip((dim,)), torch.narrow(tensor.flip((dim,)), dim=dim, start=0, length=S-1)], dim=dim)
    return tmp.unfold(dim, S, 1).flip((-1,))