Why the .eval() function is causing fluctuations in output loss?

xaggi · May 31, 2019, 12:30pm

I am trying to write a neural network code for which a basic sanity check I tried was to train and validate it using the same set of input data i.e. validation data = train data.

With .eval() active at the validation time the loss output plots like this:

In a similar setting when I run the same code without activating .eval() function i.e. inferring the network in training mode, the loss function seems very smooth and as expected, train and validation loss are pretty much the same:

What might be causing this behavior in a network’s implementation?

My code is written in Pytorch 0.4.0

al3x · May 31, 2019, 1:45pm

I think torch.no_grad() is the recommended way of dealing with validation. torch.eval() is usually reserved for testing time. What happens when you use this instead?