Broadcasting the depthwise convolution kernel

ptrblck · April 13, 2020, 3:14am

You could use expand(256, -1, -1, -1) instead of repeat.
Not however, that you would only save ~9kB of memory, since:

print(lpf.nelement() * 4 / 1024)
> 9.0