Any guideline on load a pretrained Pytorch NLP model to evaluate new dataset

zorozy · April 9, 2020, 11:35pm

Hi,

I am very new to NLP and would like to ask a very stupid question.

After I trained one NLP model, like GRU, what should I save and how could I reload the model again later on a new test dataset?

Besides, what should I do to record the prediction result for each input in the original sequence?

I am so sorry for this baby question.

ptrblck · April 10, 2020, 5:19am

To save and reload a model you should serialize its state_dict, which contains all parameters and buffers, as explained in the serialization docs.

You could append all outputs or predictions in a list after detaching them and just store the list.
Detaching is necessary, as otherwise the whole computation graph will be added to the list and thus your memory usage might grow:

outputs = []
for data in loader:
    output = model(data)
    outputs.append(output.detach().cpu())

If you are wrapping the code in a with torch.no_grad() block, as is common during evaluation, you don’t need to detach the outputs.