Sorry for the typo. It is “out of memory”. Actually, torch.bmm cannot benefit much in my task. After conducting more experiments today, I have rearranged the weird phenomena I came across and post it more detailed here : https://discuss.pytorch.org/t/many-weird-phenomena-about-torch-matmul-operation/158208