Torchvision's inception_v3 takes much longer to load than other models

daveboat · February 5, 2020, 5:00pm

If I try to load resnet50 after downloading the pretrained weights, with the code

start = time.time()
resnet = torchvision.models.resnet50(pretrained=True)
print('resnet50 loaded in {} seconds'.format(time.time()-start))

I get

resnet50 loaded in 0.41980624198913574 seconds

and similar times for vgg, densenet, etc. However, for inception_v3, I get

inception_v3 loaded in 184.83905911445618 seconds

Is there a reason inception takes so much longer to load?

ptrblck · February 6, 2020, 7:04am

We had recently the same issue here.
Do you see the same loading time when you use pretrained=False?

daveboat · February 6, 2020, 1:58pm

Yes, the times are identical with and without pretrained. I’m using the latest PyPI versions of torch (1.4.0) and torchvision (0.5.0), and it happens on multiple machines.

daveboat · February 6, 2020, 2:37pm

I did a tiny bit of digging, and the slowdown is 99% due to a single line in Inception3’s initialization:

values = torch.as_tensor(X.rvs(m.weight.numel()), dtype=m.weight.dtype)

Specifically, getting the truncated normal distribution samples via X.rvs(m.weight.numel()). Could it be that a change to scipy.stats slowed this down? Also, is it possible to perform the same truncated norm with torch’s built-in tensor operations?

ptrblck · February 6, 2020, 4:03pm

Do you also see the long operating time when you call this single line of code in isolation?

daveboat · February 6, 2020, 4:40pm

If I change the weight initialization to something like

        for m in self.modules():
            if isinstance(m, nn.Conv2d) or isinstance(m, nn.Linear):
                import scipy.stats as stats
                stddev = m.stddev if hasattr(m, 'stddev') else 0.1
                X = stats.truncnorm(-2, 2, scale=stddev)
                #values = torch.as_tensor(X.rvs(m.weight.numel()), dtype=m.weight.dtype)
                #values = values.view(m.weight.size())
                foo = X.rvs(m.weight.numel())
                values = torch.zeros_like(m.weight.size())
                with torch.no_grad():
                    m.weight.copy_(values)
            elif isinstance(m, nn.BatchNorm2d):
                nn.init.constant_(m.weight, 1)
                nn.init.constant_(m.bias, 0)

Then I still see the slowdown, which disappears if I remove the X.rvs() call. A single call to X.rvs() doesn’t take a particularly long time, but the loop iterates over ~300 layers.

ptrblck · February 7, 2020, 1:21am

Thanks for this information!
Could you post the scipy version so that I can reproduce it?
Also, thanks for narrowing down this issue.

daveboat · February 7, 2020, 2:51pm

I’m using the latest PyPI version of scipy, 1.4.1.

ptrblck · February 13, 2020, 5:05am

It might be related to this scipy.stats issue.
CC @fmassa

daveboat · February 13, 2020, 7:17pm

It definitely seems like that scipy.stats issue is the same issue we’re seeing here. In the meantime, I’ve just removed the weight initialization, since I always use the pretrained weights anyways.

vbvg2008 · March 3, 2020, 10:18pm

I am also experiencing the same issue in a tensorflow docker container. @ptrblck you can reproduce this issue by the following 3 steps:

docker pull tensorflow/tensorflow:2.1.0-gpu-py3
create a docker container of the image, do pip install torch torchvision inside container.
model = torchvision.models.inception_v3()

I think tensorflow is irrelevant for this issue, but the version of other packages in the container will be useful for you to find the root cause.

ptrblck · March 4, 2020, 1:26am

The root cause should be isolated by @daveboat and linked to in my last post pointing to a performance regression in rvs calls in scipy.

jitesh · March 20, 2020, 10:47am

Uninstalling scipy version 1.4 and installing 1.3.3 fixed the issue for me. The loading time is much faster now. Just had to do pip install --upgrade scipy==1.3.3

Can_Nguyen · May 21, 2020, 8:53am

@jitesh Thanks! That works.

jitesh · May 22, 2020, 9:06am

No problem. Glad I could help!

tommyjiang · June 5, 2020, 10:13am

I use scipy=1.2.0 to solve this.

jitesh · June 5, 2020, 10:19am

Yeah, I guess that works as well

Kamil_Adamczewski · July 18, 2020, 5:10pm

Thanks @jitesh @ptrblck for engaging in this discussion. What a relief.