Is there pretrained CNN (e.g. ResNet) for CIFAR-10 or CIFAR-100?

skrish13 · March 5, 2017, 12:04am

Ahh. I saw your edits though, thanks

jekbradbury · March 5, 2017, 12:06am

Here’s code for a network known to do well on CIFAR, although they don’t include a pretrained model file https://github.com/xternalz/WideResNet-pytorch

skrish13 · March 5, 2017, 12:07am

Oh okay! I’ll see if I can train it. Thanks again!

skrish13 · March 5, 2017, 12:10am

Didn’t we have a repo of links to PyTorch implementation of various papers? (Or I’m confusing that with Chainer’s similar repo :S)

jekbradbury · March 5, 2017, 12:28am

There’s this https://github.com/ritchieng/the-incredible-pytorch (Chainer keeps a semi-official one, but it probably makes more sense to leave it to the community)

skrish13 · March 5, 2017, 12:29am

Hmm yeah I guess. Thanks for the repo

prlz77 · May 16, 2017, 3:21pm

Hi, I uploaded resnet-v3 models in https://github.com/prlz77/ResNeXt.pytorch, however they are not the full-sized ones, which you can train yourself with this code too

Ismail_Elezi · May 18, 2017, 12:33pm

Sorry for the dumb question, but how do you load a .pytorch file. Is it the same extension as .pth?

This doesn’t seem to work:

net = torch.load(‘model.pytorch’)

prlz77 · May 18, 2017, 1:38pm

I think this is the way: http://stackoverflow.com/questions/42703500/best-way-to-save-a-trained-model-in-pytroch

Ismail_Elezi · May 18, 2017, 3:42pm

What would be TheModelClass for your model?

Ismail_Elezi · May 21, 2017, 4:09pm

@jekbradbury

Did you manage to train/load the model successfully? I managed to train it, but I am having a sizes do not match error, when I am trying to load it. I am trying to load it with the following code:

model = wrn.WideResNet(layers, 10)
model = model.cuda()
checkpoint = torch.load(‘runs/WideResNet-28-10/checkpoint.pth.tar’)
model.load_state_dict(checkpoint[‘state_dict’])

but getting:

RuntimeError: sizes do not match at /data/users/soumith/miniconda2/conda-bld/pytorch-0.1.7_1485444530918/work/torch/lib/THC/THCTensorCopy.cu:31

karlTUM · May 24, 2017, 7:09am

@Ismail_Elezi Did you solve your problem? I also got the same when I tried to resume to train a network.

Ismail_Elezi · May 24, 2017, 10:17am

Yes, I managed to load ResNets that I trained on CIFAR datasets. The code for that is:

model = wrn.WideResNet(depth=number_of_layers, num_classes=100, widen_factor=4)
checkpoint = torch.load(‘runs/WideResNet-28-10/cifar_10.pth.tar’)
model.load_state_dict(checkpoint[‘state_dict’])
model = model.cuda()

The parameters for the model and for the net you are loading should agree.

For what is worth, the accuracy I got was:

Cifar-10: 0.9548
Cifar-100: 0.7868

with these hyperparameters:

layers: 40 convs
learning rate: 0.1
momentum: nesterov with 0.9 param
regularization: Tikhonov with 5e-4 param
widen_factor: 4
batch size: 128
number of epochs: 200

Would be interesting to see what happens if I use some more advanced optimizer like Adam.

Anyway, in case you don’t have time to train them, I can upload the models today, during the afternoon.

prlz77 · May 26, 2017, 12:57pm

I just read your question! Nice to know you managed

raaj043 · June 12, 2017, 11:26am

Hi,

Then on which dataset all the models are pretrained?

smth · June 13, 2017, 1:57am

they’re all pre-trained on Imagenet-12

raaj043 · June 14, 2017, 1:53pm

Thank you very much. Its helpfulu

akamaster · January 16, 2018, 12:34pm

Here is the link to repo that has pretrained ResNets for CIFAR10, and this models are lean-resnets discussed in original paper. If you directly apply ResNets from torchvision to train your own net, you’ll get something that is not in original paper, because torchvision’s nets are for ImageNet, not CIFAR10

huyvnphan · July 8, 2019, 10:12pm

I also searched for pretrained Pytorch models on CIFAR-10 but I could not find any repo that share weights so I made one:

NicoHambauer · July 21, 2021, 9:36am

Thanks for that question! It is really confusing first if one is new to ML and tries to find an explanation on which datasets pretrained models are fit on.