Model trained with PyTorch 2.0 and Multi GPUs does not work with one GPU

I didn’t change anything but the device number. The interesting thing is the model works on 1 GPU after I changed the compile backend to torch.compile(self.model,backend="aot_eager") according to PyTorch 2.0 compile problem in mac Honestly, I have no idea how it works and why it works…