Random seed initialization

Bixqu · September 26, 2017, 7:13am

I have a problem regarding a large variation in the result I get, by running my model multiple times. The exact same architecture and training gives anywhere from 91.5% to 93.4% accuracy on image classification (cifar 10).

The problem is that I don’t know how to use the torch random seed in order to get the better results, not the worse ones. I tried various values for the random seed, with:

torch.manual_seed(7)

and I get the lower bound of the results. Any ideas?

smth · September 27, 2017, 5:34am

if you are using GPU, you might also need to set torch.cuda.manual_seed_all.
http://pytorch.org/docs/master/cuda.html#random-number-generator

vsag · January 29, 2018, 9:54am

@smth do we need to set torch.manual_seed() and torch.cuda.manual_seed_all() or the second is enough? thanks.

smth · January 29, 2018, 9:55am

with the latest pytorch 0.3 version you only need to set torch.manual_seed which will seed all devices

sid · February 5, 2018, 12:41am

Is the random number generator platform independent?

smth · February 5, 2018, 7:09pm

the CPU RNG is platform-independent. I am not sure about the CUDA RNG and what guarantees NVIDIA gives across GPU models, CUDA versions and platforms.

voxmenthe · March 29, 2018, 9:04am

Funny, even though I have included both:

torch.manual_seed(999)

and

if torch.cuda.is_available():
torch.cuda.manual_seed_all(999)

I am still getting inconsistent results, fluctuating 1-2% by re-running the model. I wonder why that could be?

ptrblck · March 29, 2018, 9:44am

Could you try to add torch.backends.cudnn.deterministic = True to your code?
CUDNN has some non-deterministic methods, so small fluctuations might come from this.

voxmenthe · March 31, 2018, 5:29am

I added:

torch.backends.cudnn.deterministic = True in addition to:

torch.manual_seed(999) and

if torch.cuda.is_available(): torch.cuda.manual_seed_all(999)

but accuracy for same model/same data still varies considerably across runs. I’ve even tried duplicating the above in the code and even tried switching to the latest version of pytorch (3.1) but still getting the same variability in accuracy across runs for same model/same data. Weird.

amirkargar · April 9, 2018, 4:20pm

Hi,

I’m having the same issue, did you figure out a way to make the results consistent across runs?

Thanks,
Amir

Darren_Lee · April 12, 2018, 3:09pm

Hi,

Have you figured out how to make the results reproducible now?

Thanks,
Darren

ivan-bilan · May 16, 2018, 6:34pm

Same problem here, running on PyTorch 0.4. I am using RRelu though, even though I’ve set all the flags mentioned above, results differ by a margin of +/- 0.5% from run to run.

Yufeng_Ma · May 17, 2018, 12:58am

What’s the number of workers for your dataloader? The following post might be helpful for deterministic results.

wtsangmed · September 4, 2018, 5:33pm

Was following this post b/c I ran into same issues training an autoencoder. I don’t know if the OP has solved the problem. but I did a test last night on a AWS GPU and cuda on w/ the parameters below gave me consistent results.
torch.backends.cudnn.deterministic = True
torch.manual_seed(999)

Further I explicitly specify model.eval() after training when computing the decoders and encoders.

Alternatively when I have, below, the results were inconsistent.
torch.backends.cudnn.deterministic = True
torch.cuda.manual_seed_all(999)

As an above poster mentioned it seems as though torch.manual_seed() applies to both cuda and cpu devices for the latest version. So if you’re not getting consistent result w/ torch.cuda.manual_seed_all, try just torch.manual_seed. This may depend on the pytorch version you have installed…Hope this helps.

mutaku · November 6, 2018, 3:30am

Good info.

The docs also suggest setting: torch.backends.cudnn.benchmark = False

and remember that Numpy should be seeded as well.

–> Randomness [Docs]

Noam_Salomonski · February 18, 2019, 11:26pm

did anyone solve this yet?

isalirezag · March 17, 2019, 9:05pm

Sounds like there is another question related here.

anyways, I think this can be a solution:

manualSeed = 1

np.random.seed(manualSeed)
random.seed(manualSeed)
torch.manual_seed(manualSeed)
# if you are suing GPU
torch.cuda.manual_seed(manualSeed)
torch.cuda.manual_seed_all(manualSeed)


torch.backends.cudnn.enabled = False 
torch.backends.cudnn.benchmark = False
torch.backends.cudnn.deterministic = True

also in the dataloader i set num_workers = 0

based on here
you also need to change worker_init_fn as :

def _init_fn():
    np.random.seed(manualSeed)
    

DataLoding = data.DataLoader(..., batch_size = ..., 
                             collate_fn = ..., 
                             num_workers =..., 
                             shuffle = ..., 
                             pin_memory = ...,
                             worker_init_fn=_init_fn)

I noticed if we dont do torch.backends.cudnn.enabled = False the results are very close, but some times not match
p.s. im using pytorch 1.0.1

ShangxuanWu · July 28, 2019, 2:56am

Thanks!

num_workers = 0 and torch.backends.cudnn.enabled = False are the real thing that works! And I also see that if you train one step 10 times, only using num_workers = 0 we can get exactly same output 8 times and different output 2 times.

arnabsinha · June 9, 2020, 6:50am

np.random.seed(0)
random.seed(0)
torch.manual_seed(0)
torch.cuda.manual_seed(0)
torch.cuda.manual_seed_all(0)
torch.backends.cudnn.enabled = False
torch.backends.cudnn.benchmark = False
torch.backends.cudnn.deterministic = True

and setting dataloader like the following:

torch.utils.data.DataLoader(training, shuffle = True, batch_size=BATCH_SIZE, worker_init_fn=np.random.seed(0),num_workers=0)

WORKED FOR ME!

I am using Pytorch version 1.0.0.

Tethys_Sun · August 25, 2020, 4:01am

I tried exactly same setting, even with torch.backends.cudnn.enabled =False, the results are not the same… Do you have any idea?