Slim difference in outputs of a model for same seed/inputs

Check this post:

Set number of thread will do the trick: torch.set_num_threads(1)