Cannot run pytorch with CUDA 12.1 on a server with 8 x A100

Same as here which is not reproducible.