The different results with each training time, even the same code, environment,..everything,

Vincent24 · August 29, 2020, 8:02am

Hello everybody!
I did as following and had the problem:

Code:
def seed_torch(seed=0):
random.seed(seed)
os.environ[‘PYTHONHASHSEED’] = str(seed)
np.random.seed(seed)
torch.manual_seed(seed)
torch.cuda.manual_seed(seed)
torch.cuda.manual_seed_all(seed)# if you are using multi-GPU.
torch.backends.cudnn.benchmark = False
torch.backends.cudnn.deterministic = True

seed_torch()

def main():
…
rawnet.train()
for i, data in enumerate(train_dataloader):
seed_torch()
…
…
seed_torch()
rawnet.eval()
seed_torch()
with torch.no_grad():
for i, data in enumerate(val_dataloader):
seed_torch()
…

The different results with each training time, even the same code, environment,…everything. Particularly, I have almost the same results of training loss. However, I have DIFFERENT results of validation loss, even putting seed_torch() everywhere in loops. Please help me solve this problem!!!. Thank you!

ptrblck · August 31, 2020, 12:02am

Could you check, if your model is using some of the non-deterministic methods mentioned in the reproducibility docs?

Lin_Jia · August 31, 2020, 12:21am

Are your results very different? I have an MNIST app, the loss and accuracy varies a little bit every time I run the app. But the variation is tiny. For example, in terms of accuracy, the change is below 0.5%.

Vincent24 · August 31, 2020, 6:18am

Yes!, I wrote my code as reproducibility docs .

you can see my above subprogram:

def seed_torch(seed=0):
random.seed(seed)
os.environ[‘PYTHONHASHSEED’] = str(seed)
np.random.seed(seed)#as reproducibility docs
torch.manual_seed(seed)# as reproducibility docs
torch.cuda.manual_seed(seed)
torch.cuda.manual_seed_all(seed)
torch.backends.cudnn.benchmark = False# as reproducibility docs
torch.backends.cudnn.deterministic = True# as reproducibility docs

do you know what the problem is?

Vincent24 · August 31, 2020, 6:39am

yes. The validation loss is very different but the training loss is little different. You can see as above figure

Dicko871 · December 7, 2021, 7:37pm

Hi @Vincent24 , did you manage to solve this issue? I am facing exactly the same problem.