"expand" on non-singleton dimensions without actually repeating the memory?

I am not familiar with the low level implementation, but I understand the reason you explained. It is a pity that it can not be achieved easily. I have no idea how complicated it can be, but maybe we could have some kind of circular indexing implemented for this?

Anyway, thank you for your fast reply!