I got the below error when I tried to pretrain a model from github on my own dataset:
File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line 1224, in load_state_dict
self.__class__.__name__, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for Generator:
size mismatch for linear.weight: copying a param with shape torch.Size([16384, 128]) from checkpoint, the shape in current model is torch.Size([24576, 17]).
size mismatch for linear.bias: copying a param with shape torch.Size([16384]) from checkpoint, the shape in current model is torch.Size([24576]).
size mismatch for linear.u0: copying a param with shape torch.Size([1, 16384]) from checkpoint, the shape in current model is torch.Size([1, 24576]).
size mismatch for blocks.0.0.conv1.weight: copying a param with shape torch.Size([1024, 1024, 3, 3]) from checkpoint, the shape in current model is torch.Size([1536, 1536, 3, 3]).
size mismatch for blocks.0.0.conv1.bias: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.0.0.conv1.u0: copying a param with shape torch.Size([1, 1024]) from checkpoint, the shape in current model is torch.Size([1, 1536]).
size mismatch for blocks.0.0.conv2.weight: copying a param with shape torch.Size([1024, 1024, 3, 3]) from checkpoint, the shape in current model is torch.Size([1536, 1536, 3, 3]).
size mismatch for blocks.0.0.conv2.bias: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.0.0.conv2.u0: copying a param with shape torch.Size([1, 1024]) from checkpoint, the shape in current model is torch.Size([1, 1536]).
size mismatch for blocks.0.0.conv_sc.weight: copying a param with shape torch.Size([1024, 1024, 1, 1]) from checkpoint, the shape in current model is torch.Size([1536, 1536, 1, 1]).
size mismatch for blocks.0.0.conv_sc.bias: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.0.0.conv_sc.u0: copying a param with shape torch.Size([1, 1024]) from checkpoint, the shape in current model is torch.Size([1, 1536]).
size mismatch for blocks.0.0.bn1.stored_mean: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.0.0.bn1.stored_var: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.0.0.bn2.stored_mean: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.0.0.bn2.stored_var: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.1.0.conv1.weight: copying a param with shape torch.Size([512, 1024, 3, 3]) from checkpoint, the shape in current model is torch.Size([768, 1536, 3, 3]).
size mismatch for blocks.1.0.conv1.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.1.0.conv1.u0: copying a param with shape torch.Size([1, 512]) from checkpoint, the shape in current model is torch.Size([1, 768]).
size mismatch for blocks.1.0.conv2.weight: copying a param with shape torch.Size([512, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([768, 768, 3, 3]).
size mismatch for blocks.1.0.conv2.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.1.0.conv2.u0: copying a param with shape torch.Size([1, 512]) from checkpoint, the shape in current model is torch.Size([1, 768]).
size mismatch for blocks.1.0.conv_sc.weight: copying a param with shape torch.Size([512, 1024, 1, 1]) from checkpoint, the shape in current model is torch.Size([768, 1536, 1, 1]).
size mismatch for blocks.1.0.conv_sc.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.1.0.conv_sc.u0: copying a param with shape torch.Size([1, 512]) from checkpoint, the shape in current model is torch.Size([1, 768]).
size mismatch for blocks.1.0.bn1.stored_mean: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.1.0.bn1.stored_var: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([1536]).
size mismatch for blocks.1.0.bn2.stored_mean: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.1.0.bn2.stored_var: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.2.0.conv1.weight: copying a param with shape torch.Size([512, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([768, 768, 3, 3]).
size mismatch for blocks.2.0.conv1.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.2.0.conv1.u0: copying a param with shape torch.Size([1, 512]) from checkpoint, the shape in current model is torch.Size([1, 768]).
size mismatch for blocks.2.0.conv2.weight: copying a param with shape torch.Size([512, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([768, 768, 3, 3]).
size mismatch for blocks.2.0.conv2.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.2.0.conv2.u0: copying a param with shape torch.Size([1, 512]) from checkpoint, the shape in current model is torch.Size([1, 768]).
size mismatch for blocks.2.0.conv_sc.weight: copying a param with shape torch.Size([512, 512, 1, 1]) from checkpoint, the shape in current model is torch.Size([768, 768, 1, 1]).
size mismatch for blocks.2.0.conv_sc.bias: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.2.0.conv_sc.u0: copying a param with shape torch.Size([1, 512]) from checkpoint, the shape in current model is torch.Size([1, 768]).
size mismatch for blocks.2.0.bn1.stored_mean: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.2.0.bn1.stored_var: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.2.0.bn2.stored_mean: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.2.0.bn2.stored_var: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.3.0.conv1.weight: copying a param with shape torch.Size([256, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([384, 768, 3, 3]).
size mismatch for blocks.3.0.conv1.bias: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([384]).
size mismatch for blocks.3.0.conv1.u0: copying a param with shape torch.Size([1, 256]) from checkpoint, the shape in current model is torch.Size([1, 384]).
size mismatch for blocks.3.0.conv2.weight: copying a param with shape torch.Size([256, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([384, 384, 3, 3]).
size mismatch for blocks.3.0.conv2.bias: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([384]).
size mismatch for blocks.3.0.conv2.u0: copying a param with shape torch.Size([1, 256]) from checkpoint, the shape in current model is torch.Size([1, 384]).
size mismatch for blocks.3.0.conv_sc.weight: copying a param with shape torch.Size([256, 512, 1, 1]) from checkpoint, the shape in current model is torch.Size([384, 768, 1, 1]).
size mismatch for blocks.3.0.conv_sc.bias: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([384]).
size mismatch for blocks.3.0.conv_sc.u0: copying a param with shape torch.Size([1, 256]) from checkpoint, the shape in current model is torch.Size([1, 384]).
size mismatch for blocks.3.0.bn1.stored_mean: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.3.0.bn1.stored_var: copying a param with shape torch.Size([512]) from checkpoint, the shape in current model is torch.Size([768]).
size mismatch for blocks.3.0.bn2.stored_mean: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([384]).
size mismatch for blocks.3.0.bn2.stored_var: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([384]).
size mismatch for blocks.4.0.conv1.weight: copying a param with shape torch.Size([128, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([192, 384, 3, 3]).
size mismatch for blocks.4.0.conv1.bias: copying a param with shape torch.Size([128]) from checkpoint, the shape in current model is torch.Size([192]).
size mismatch for blocks.4.0.conv1.u0: copying a param with shape torch.Size([1, 128]) from checkpoint, the shape in current model is torch.Size([1, 192]).
size mismatch for blocks.4.0.conv2.weight: copying a param with shape torch.Size([128, 128, 3, 3]) from checkpoint, the shape in current model is torch.Size([192, 192, 3, 3]).
size mismatch for blocks.4.0.conv2.bias: copying a param with shape torch.Size([128]) from checkpoint, the shape in current model is torch.Size([192]).
size mismatch for blocks.4.0.conv2.u0: copying a param with shape torch.Size([1, 128]) from checkpoint, the shape in current model is torch.Size([1, 192]).
size mismatch for blocks.4.0.conv_sc.weight: copying a param with shape torch.Size([128, 256, 1, 1]) from checkpoint, the shape in current model is torch.Size([192, 384, 1, 1]).
size mismatch for blocks.4.0.conv_sc.bias: copying a param with shape torch.Size([128]) from checkpoint, the shape in current model is torch.Size([192]).
size mismatch for blocks.4.0.conv_sc.u0: copying a param with shape torch.Size([1, 128]) from checkpoint, the shape in current model is torch.Size([1, 192]).
size mismatch for blocks.4.0.bn1.stored_mean: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([384]).
size mismatch for blocks.4.0.bn1.stored_var: copying a param with shape torch.Size([256]) from checkpoint, the shape in current model is torch.Size([384]).
size mismatch for blocks.4.0.bn2.stored_mean: copying a param with shape torch.Size([128]) from checkpoint, the shape in current model is torch.Size([192]).
size mismatch for blocks.4.0.bn2.stored_var: copying a param with shape torch.Size([128]) from checkpoint, the shape in current model is torch.Size([192]).
size mismatch for blocks.5.0.conv1.weight: copying a param with shape torch.Size([64, 128, 3, 3]) from checkpoint, the shape in current model is torch.Size([96, 192, 3, 3]).
size mismatch for blocks.5.0.conv1.bias: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for blocks.5.0.conv1.u0: copying a param with shape torch.Size([1, 64]) from checkpoint, the shape in current model is torch.Size([1, 96]).
size mismatch for blocks.5.0.conv2.weight: copying a param with shape torch.Size([64, 64, 3, 3]) from checkpoint, the shape in current model is torch.Size([96, 96, 3, 3]).
size mismatch for blocks.5.0.conv2.bias: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for blocks.5.0.conv2.u0: copying a param with shape torch.Size([1, 64]) from checkpoint, the shape in current model is torch.Size([1, 96]).
size mismatch for blocks.5.0.conv_sc.weight: copying a param with shape torch.Size([64, 128, 1, 1]) from checkpoint, the shape in current model is torch.Size([96, 192, 1, 1]).
size mismatch for blocks.5.0.conv_sc.bias: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for blocks.5.0.conv_sc.u0: copying a param with shape torch.Size([1, 64]) from checkpoint, the shape in current model is torch.Size([1, 96]).
size mismatch for blocks.5.0.bn1.stored_mean: copying a param with shape torch.Size([128]) from checkpoint, the shape in current model is torch.Size([192]).
size mismatch for blocks.5.0.bn1.stored_var: copying a param with shape torch.Size([128]) from checkpoint, the shape in current model is torch.Size([192]).
size mismatch for blocks.5.0.bn2.stored_mean: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for blocks.5.0.bn2.stored_var: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for output_layer.0.gain: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for output_layer.0.bias: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for output_layer.0.stored_mean: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for output_layer.0.stored_var: copying a param with shape torch.Size([64]) from checkpoint, the shape in current model is torch.Size([96]).
size mismatch for output_layer.2.weight: copying a param with shape torch.Size([3, 64, 3, 3]) from checkpoint, the shape in current model is torch.Size([3, 96, 3, 3]).