Is there pretrained CNN (e.g. ResNet) for CIFAR-10 or CIFAR-100?

yunjey · March 4, 2017, 2:14pm

torchvision.models contains several pretrained CNNs (e.g AlexNet, VGG, ResNet). However, it seems that when input image size is small such as CIFAR-10, the above model can not be used.
Should i implement it myself? Or, Does PyTorch offer pretrained CNN with CIFAR-10?

smth · March 4, 2017, 2:17pm

We dont offer pre-trained resnet with cifar. You might have to train one yourself.

skrish13 · March 4, 2017, 11:29pm

Maybe NIN with CIFAR? :3

skrish13 · March 5, 2017, 12:04am

Ahh. I saw your edits though, thanks

jekbradbury · March 5, 2017, 12:06am

Here’s code for a network known to do well on CIFAR, although they don’t include a pretrained model file https://github.com/xternalz/WideResNet-pytorch

skrish13 · March 5, 2017, 12:07am

Oh okay! I’ll see if I can train it. Thanks again!

skrish13 · March 5, 2017, 12:10am

Didn’t we have a repo of links to PyTorch implementation of various papers? (Or I’m confusing that with Chainer’s similar repo :S)

jekbradbury · March 5, 2017, 12:28am

There’s this https://github.com/ritchieng/the-incredible-pytorch (Chainer keeps a semi-official one, but it probably makes more sense to leave it to the community)

skrish13 · March 5, 2017, 12:29am

Hmm yeah I guess. Thanks for the repo

prlz77 · May 16, 2017, 3:21pm

Hi, I uploaded resnet-v3 models in https://github.com/prlz77/ResNeXt.pytorch, however they are not the full-sized ones, which you can train yourself with this code too

Ismail_Elezi · May 18, 2017, 12:33pm

Sorry for the dumb question, but how do you load a .pytorch file. Is it the same extension as .pth?

This doesn’t seem to work:

net = torch.load(‘model.pytorch’)

prlz77 · May 18, 2017, 1:38pm

I think this is the way: http://stackoverflow.com/questions/42703500/best-way-to-save-a-trained-model-in-pytroch

Ismail_Elezi · May 18, 2017, 3:42pm

What would be TheModelClass for your model?

Ismail_Elezi · May 21, 2017, 4:09pm

@jekbradbury

Did you manage to train/load the model successfully? I managed to train it, but I am having a sizes do not match error, when I am trying to load it. I am trying to load it with the following code:

model = wrn.WideResNet(layers, 10)
model = model.cuda()
checkpoint = torch.load(‘runs/WideResNet-28-10/checkpoint.pth.tar’)
model.load_state_dict(checkpoint[‘state_dict’])

but getting:

RuntimeError: sizes do not match at /data/users/soumith/miniconda2/conda-bld/pytorch-0.1.7_1485444530918/work/torch/lib/THC/THCTensorCopy.cu:31

karlTUM · May 24, 2017, 7:09am

@Ismail_Elezi Did you solve your problem? I also got the same when I tried to resume to train a network.

Ismail_Elezi · May 24, 2017, 10:17am

Yes, I managed to load ResNets that I trained on CIFAR datasets. The code for that is:

model = wrn.WideResNet(depth=number_of_layers, num_classes=100, widen_factor=4)
checkpoint = torch.load(‘runs/WideResNet-28-10/cifar_10.pth.tar’)
model.load_state_dict(checkpoint[‘state_dict’])
model = model.cuda()

The parameters for the model and for the net you are loading should agree.

For what is worth, the accuracy I got was:

Cifar-10: 0.9548
Cifar-100: 0.7868

with these hyperparameters:

layers: 40 convs
learning rate: 0.1
momentum: nesterov with 0.9 param
regularization: Tikhonov with 5e-4 param
widen_factor: 4
batch size: 128
number of epochs: 200

Would be interesting to see what happens if I use some more advanced optimizer like Adam.

Anyway, in case you don’t have time to train them, I can upload the models today, during the afternoon.

prlz77 · May 26, 2017, 12:57pm

I just read your question! Nice to know you managed

raaj043 · June 12, 2017, 11:26am

Hi,

Then on which dataset all the models are pretrained?

smth · June 13, 2017, 1:57am

they’re all pre-trained on Imagenet-12

raaj043 · June 14, 2017, 1:53pm

Thank you very much. Its helpfulu