How to get a 'cpu' state dict

timbmg · September 7, 2018, 5:49pm

Assuming my model is on a gpu already, is there a way to get a state_dict of a model with cpu tensors without moving the model first to cpu and then back to gpu again?
Something like:

state_dict =  model.state_dict()
state_dict = state_dict.cpu()

SimonW · September 7, 2018, 6:58pm

for k, v in state_dict.items():
  state_dict[k] = v.cpu()

francois-rozet · October 17, 2020, 4:38pm

There is also a one-liner to create a cpu copy of the state_dict :

{k: v.cpu() for k, v in model.state_dict()}

RobS · April 25, 2023, 8:03pm

Small correction: to iterate through, you need the items() from the dict, not the dict itself

{k: v.cpu() for k, v in model.state_dict().items()}