Storing loss each iteration when using a closure (e.g. LBFGS)

ccook · May 4, 2023, 3:37pm

When you estimate using LBFGS, you have to wrap the optimization steps in a closure, so each pass through has something like the following:

def closure(): 
    optimizer.zero_grad()
    prediction = model(data)
    loss = criterion(prediction, target) 
    loss.backward()
    return loss

optimizer.step(closure)

Having the loss within the closure makes it difficult to stash what the current loss is each time (i.e. can’t just do losses += [loss.detach().item()]).

One option is to re-evaluate the model and criterion each time outside of the closure (with torch.no_grad()), but this is a waste of compute.

Is there a better way?

arampatzis · April 15, 2025, 3:45pm

I have the same problem. As far as I understand, there is no way to know the loss value of the current iteration.

I end up re-computing the loss right after the optimizer.step(closure).

Are there any update on this?