Saving and loading a model in Pytorch?

ptrblck · September 4, 2018, 2:10pm

In this tutorial the weight updates are performed manually in this line of code.
Since you don’t have internal estimates you don’t have to store anything regarding the optimization.

sleebapaul · September 4, 2018, 2:19pm

I saved the model which performed the following graph.

On reproducing the results with the model saved, my test error is,

End of training | test loss 17.28 | test ppl 32104905.14

Generated result is gibberish.

Something is terribly wrong. I’m not sure where I should check

ptrblck · September 4, 2018, 2:21pm

Something looks fishy. Could you create a new thread and post your complete issue there?
It would also be easier to debug, if you could post your code so that we can have a look.

sleebapaul · September 4, 2018, 2:32pm

I started a new thread at here.

naderAsadi · September 19, 2018, 8:39am

Hi! I have a problem with loading my model. I’m training VGG19 on cifar10 in colab, when I load it in colab it is OK but when I load it on my laptop with same code it gives error. They’re both python3 and trained with cuda.
Error:

Save code

def save_checkpoint(state, filename):
    torch.save({'state_dict': net.state_dict(),
                'optimizer': optimizer.state_dict(),
                }, filename)

Load

checkpoint = torch.load('./vgg19_200.pth')
net.load_state_dict(checkpoint['state_dict'])
optimizer.load_state_dict(checkpoint['optimizer'])

laptopmutia · September 19, 2018, 5:12pm

hello I’m trying to save my adam optimizer,

but why whenever I load it, the state_dict is always different
if I restart my environment?

I’ve also make my own thread here, Saved model have higher loss

thank you

Lee_Jim · September 20, 2018, 6:37am

no such thing as mistake or understand or not, think any is ok

soorajviraat · November 29, 2018, 1:51am

Is there a way to save and load models from s3 directly?

xiao · December 12, 2018, 8:00am

Have you solved this problem? I have encountered this problem, I don’t know where it is wrong.

mutaku · December 13, 2018, 3:06pm

Yes, if you use StringIO you can create a file stream, write your model state to it, then push that to s3.

What I additionally do is use joblib to add compression and pickle after writing to the stream, push that to s3, then unload with joblib back to a file stream object and read the model state back into a model object to resume.

shivam13juna · January 8, 2019, 1:37am

It’s not necessary, you can use .copy() it’ll work fine too.

wizardk · January 8, 2019, 6:46am

He just mean when you need to evaluate or infer.

pranjali97 · January 23, 2019, 11:33am

Hi, I am new to pytorch and was wondering how to create the model class for a trained pytorch model. I wish to use that to save and load the model for serving it with flask.

cuixing158_1 · January 29, 2019, 4:51am

MyModel.eval() insert before or after state_dict method ? i think is after,beacuse after load parameters and freeze some parameters,right ?

ranklord · January 30, 2019, 9:32am

Yes, you are right! Put MyModel.eval() after loading state_dict.

metro.smiles · August 4, 2019, 9:48pm

Hi,

So a sort of related question but in the context of the saving the optimizer. Is saving/loading the full optimizer object, the same as saving/loading only the optimizer’s state_dict() when resuming training? Aside from the obvious that saving only the state_dict(), saves memory…

Rocket · January 5, 2020, 3:57pm

In my case, the results are correct without eval() but not with it.

I am trained Gener. Adv. Net.

sirgogo · February 11, 2020, 8:46pm

Wow. I did not think of that, but this also worked for me and I have no idea why.

Sanpreet_Singh · March 13, 2020, 4:23pm

Hello Bixqu, how are you

To save the model to .pt file and load it please see github repository on this from below link

Simple way to save and load model in pytorch

You can also read the blog on it from below link

But sorry I have not written anything how to continue training from last epoch. I would write on this also. But i hope you will gain some insights from this repository.

With Regards
Sanpreet Singh

pinocchio · April 29, 2020, 8:19pm

What is wrong with doing:

def save_ckpt(path_to_ckpt):
    from pathlib import Path
    import dill as pickle
    ## Make dir. Throw no exceptions if it already exists
    path_to_ckpt.mkdir(parents=True, exist_ok=True)
    ckpt_path_plus_path = path_to_ckpt / Path('db')

    ## Pickle args
    db['crazy_mdl'] = crazy_mdl
    with open(ckpt_path_plus_path , 'ab') as db_file:
        pickle.dump(db, db_file)