Are there any recommended methods to clone a model?

disreputableDog · August 9, 2019, 1:15pm

Can confirm, deepcopy does not work (changes to original still reflected in copy) but pickle does work.

yong_xu · February 1, 2020, 7:41am

classifier = pickle.loads(pickle.dumps(self.classifier))
TypeError: can’t pickle module objects

Kamil_Wojcicki · February 21, 2020, 10:55pm

Using Adam’s suggestion:

threw:

TypeError: can't pickle dict_keys objects

for the model I am working with.

I am using python 3.7 and the model was trained on multiple GPUs.

Has anyone run into this issue with their models? Any ideas how to fix it?

Searching online, I found similar issue with deepcopy (but not in the context of PyTorch):

Apparently in python3 you have to wrap dict.keys() in list() — otherwise the deepcopy issue appears.

github.com/neuropycon/ephypype

deepcopy error when running workflow in python3

opened 03:27PM - 29 Dec 17 UTC

closed 12:08PM - 31 Dec 17 UTC

dmalt

Can't make nipype work under python3. I've created a simple pipeline to disent…angle command-line code from nipype and ephypype. When I run the workflow I get the following error: ```Traceback (most recent call last): File "test_cli.py", line 91, in <module> workflow.run(plugin='Linear') File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/site-packages/nipype/pipeline/engine/workflows.py", line 570, in run flatgraph = self._create_flat_graph() File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/site-packages/nipype/pipeline/engine/workflows.py", line 830, in _create_flat_graph workflowcopy = deepcopy(self) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 180, in deepcopy y = _reconstruct(x, memo, *rv) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 280, in _reconstruct state = deepcopy(state, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 150, in deepcopy y = copier(x, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 240, in _deepcopy_dict y[deepcopy(key, memo)] = deepcopy(value, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 180, in deepcopy y = _reconstruct(x, memo, *rv) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 280, in _reconstruct state = deepcopy(state, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 150, in deepcopy y = copier(x, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 240, in _deepcopy_dict y[deepcopy(key, memo)] = deepcopy(value, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 150, in deepcopy y = copier(x, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 240, in _deepcopy_dict y[deepcopy(key, memo)] = deepcopy(value, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 180, in deepcopy y = _reconstruct(x, memo, *rv) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 280, in _reconstruct state = deepcopy(state, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 150, in deepcopy y = copier(x, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 240, in _deepcopy_dict y[deepcopy(key, memo)] = deepcopy(value, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 150, in deepcopy y = copier(x, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 215, in _deepcopy_list append(deepcopy(a, memo)) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 150, in deepcopy y = copier(x, memo) File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 220, in _deepcopy_tuple y = [deepcopy(a, memo) for a in x] File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 220, in <listcomp> y = [deepcopy(a, memo) for a in x] File "/home/dmalt/Code/python/neuropycon/npyc3/lib/python3.6/copy.py", line 169, in deepcopy rv = reductor(4) TypeError: can't pickle dict_keys objects``` It seems to me that something goes wrong with looped links inside the workflow object. The error appears when nipype tries to deepcopy the workflow instance. In python2.7 the same works fine. I tried to google for similar problems but I got only one google groups issue like that without responses. Have you guys seen a problem like this?

Kamil_Wojcicki · February 24, 2020, 5:21pm

The answer turned out to be pretty simple. The instance attributes of your model have to be picklable. In my particular case, storing dict_keys caused the issue. Converting those to list, resolved the issue:

model.attribute = list(model.attribute)  # where attribute was dict_keys
model_clone = copy.deepcopy(model)

syomantak · July 13, 2020, 8:18am

If I just want to copy the state dict then would temp = model.state_dict() work or do I need deep copy for state_dict as well? I later keep training so would the temp variable change?

albanD · July 13, 2020, 1:44pm

Hi,

Yes you need to deepcopy it if you want a deep copy.
If you just do this, the temp value will be changed when you update the model.

Lin_Jia · August 29, 2020, 5:53pm

I use pytorch C++ interface. I need to do deep copy for modules. I think I am going to go with this route: 1) dump one module onto the disk using torch save, 2) load the dumped file into a new module class.

fulltopic · September 11, 2020, 2:58am

What’s the corresponding methods of C++ API?

afshin67 · March 30, 2021, 12:05am

Did you find any solution?

drscotthawley · April 3, 2021, 10:16pm

With Pytorch 1.7, I’m not finding copy.deepcopy(model) to work.

model_clone = copy.deepcopy(model)

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-173-0885027ec90d> in <module>()
----> 1 model_clone = copy.deepcopy(model)

AttributeError: 'function' object has no attribute 'deepcopy'

drscotthawley · April 3, 2021, 10:31pm

…oh, woops. Have to
import copy
first. Now it works!

DreamJim · March 2, 2024, 10:23am

Excuse, I just meet some problem and found problem in copy the model.
I am news to machine learn and pytorch, so my question maybe looks stupid. I am currently using version of python 3.11 and torch 2.0.1. I am trying to copy a model with copy.deepcopy() after the model training and try to reactivate the train. However, I meet an error message “AttributeError: ‘NoneType’ object has no attribute ‘data’”. It seems like there is not grad in param _group in optimizer. Then I found
“`copy.deepcopy` does not copy gradients of nn.Parameter · Issue #95711 · pytorch/pytorch · GitHub”
state out that the copy model would not contain the param gradient.
May I please ask is there any proper way to copy the model? Or any other method that I can just restore a new “grad” attribute to the copy model?